Login / Signup

LinkedSV for detection of mosaic structural variants from linked-read exome and genome sequencing data.

Li FangCharlly KaoMichael V GonzalezFernanda A MafraRenata Pellegrino da SilvaMingyao LiSören-Sebastian WenzelKatharina WimmerHakon H HakonarsonKai Wang
Published in: Nature communications (2019)
Linked-read sequencing provides long-range information on short-read sequencing data by barcoding reads originating from the same DNA molecule, and can improve detection and breakpoint identification for structural variants (SVs). Here we present LinkedSV for SV detection on linked-read sequencing data. LinkedSV considers barcode overlapping and enriched fragment endpoints as signals to detect large SVs, while it leverages read depth, paired-end signals and local assembly to detect small SVs. Benchmarking studies demonstrate that LinkedSV outperforms existing tools, especially on exome data and on somatic SVs with low variant allele frequencies. We demonstrate clinical cases where LinkedSV identifies disease-causal SVs from linked-read exome sequencing data missed by conventional exome sequencing, and show examples where LinkedSV identifies SVs missed by high-coverage long-read sequencing. In summary, LinkedSV can detect SVs missed by conventional short-read and long-read sequencing approaches, and may resolve negative cases from clinical genome/exome sequencing studies.
Keyphrases
  • single molecule
  • single cell
  • copy number
  • electronic health record
  • genome wide
  • big data
  • dna methylation
  • machine learning
  • loop mediated isothermal amplification
  • artificial intelligence
  • nucleic acid
  • health insurance