Genome-wide reconstruction of complex structural variants using read clouds.
Noah SpiesZiming WengAlex BisharaJennifer McDanielDavid CatoeJustin M ZookMarc SalitRobert B WestSerafim BatzoglouArend SidowPublished in: Nature methods (2017)
In read cloud approaches, microfluidic partitioning of long genomic DNA fragments and barcoding of shorter fragments derived from these fragments retains long-range information in short sequencing reads. This combination of short reads with long-range information represents a powerful alternative to single-molecule long-read sequencing. We develop Genome-wide Reconstruction of Complex Structural Variants (GROC-SVs) for SV detection and assembly from read cloud data and apply this method to Illumina-sequenced 10x Genomics sarcoma and breast cancer data sets. Compared with short-fragment sequencing, GROC-SVs substantially improves the specificity of breakpoint detection at comparable sensitivity. This approach also performs sequence assembly across multiple breakpoints simultaneously, enabling the reconstruction of events exhibiting remarkable complexity. We show that chromothriptic rearrangements occurred before copy number amplifications, and that rates of single-nucleotide variants and SVs are not correlated. Our results support the use of read cloud approaches to advance the characterization of large and complex structural variation.
Keyphrases
- single molecule
- copy number
- genome wide
- mitochondrial dna
- single cell
- dna methylation
- living cells
- atomic force microscopy
- label free
- electronic health record
- big data
- loop mediated isothermal amplification
- high throughput
- gene expression
- health information
- real time pcr
- circulating tumor cells
- young adults
- artificial intelligence
- data analysis
- quantum dots
- amino acid