Detection of mosaic and population-level structural variants with Sniffles2.
Moritz SmolkaLuis F PaulinChristopher M GrochowskiDominic W HornerMedhat MahmoudSairam BeheraEster Kalef-EzraMira GandhiKarl HongDavut PehlivanSonja W ScholzCláudia M B CarvalhoChristos ProukakisFritz J SedlazeckPublished in: Nature biotechnology (2024)
Calling structural variations (SVs) is technically challenging, but using long reads remains the most accurate way to identify complex genomic alterations. Here we present Sniffles2, which improves over current methods by implementing a repeat aware clustering coupled with a fast consensus sequence and coverage-adaptive filtering. Sniffles2 is 11.8 times faster and 29% more accurate than state-of-the-art SV callers across different coverages (5-50×), sequencing technologies (ONT and HiFi) and SV types. Furthermore, Sniffles2 solves the problem of family-level to population-level SV calling to produce fully genotyped VCF files. Across 11 probands, we accurately identified causative SVs around MECP2, including highly complex alleles with three overlapping SVs. Sniffles2 also enables the detection of mosaic SVs in bulk long-read data. As a result, we identified multiple mosaic SVs in brain tissue from a patient with multiple system atrophy. The identified SV showed a remarkable diversity within the cingulate cortex, impacting both genes involved in neuron function and repetitive elements.
Keyphrases
- functional connectivity
- single cell
- resting state
- copy number
- loop mediated isothermal amplification
- high resolution
- case report
- label free
- high frequency
- electronic health record
- single molecule
- clinical practice
- rna seq
- artificial intelligence
- mass spectrometry
- big data
- brain injury
- sensitive detection
- subarachnoid hemorrhage
- health insurance