svCapture: efficient and specific detection of very low frequency structural variant junctions by error-minimized capture sequencing.
Thomas E WilsonSamreen AhmedJake HigginsJesse J SalkThomas W GloverPublished in: NAR genomics and bioinformatics (2023)
Error-corrected sequencing of genomic targets enriched by probe-based capture has become a standard approach for detecting single-nucleotide variants (SNVs) and small insertion/deletions (indels) present at very low variant allele frequencies. Less attention has been given to comparable strategies for rare structural variant (SV) junctions, where different error mechanisms must be addressed. Working from samples with known SV properties, we demonstrate that duplex sequencing (DuplexSeq), which demands confirmation of variants on both strands of a source DNA molecule, eliminates false SV junctions arising from chimeric PCR. DuplexSeq could not address frequent intermolecular ligation artifacts that arise during Y-adapter addition prior to strand denaturation without requiring multiple source molecules. In contrast, tagmentation libraries coupled with data filtering based on strand family size greatly reduced both artifact classes and enabled efficient and specific detection of single-molecule SV junctions. The throughput of SV capture sequencing (svCapture) and base-level accuracy of DuplexSeq provided detailed views of the microhomology profile and limited occurrence of de novo SNVs near the junctions of hundreds of newly created SVs, suggesting end joining as a possible formation mechanism. The open source svCapture pipeline enables rare SV detection as a routine addition to SNVs/indels in properly prepared capture sequencing libraries.