Login / Signup

Enhancing breakpoint resolution with deep segmentation model: A general refinement method for read-depth based structural variant callers.

Yao-Zhong ZhangSeiya ImotoSatoru MiyanoRui Yamaguchi
Published in: PLoS computational biology (2021)
Read-depths (RDs) are frequently used in identifying structural variants (SVs) from sequencing data. For existing RD-based SV callers, it is difficult for them to determine breakpoints in single-nucleotide resolution due to the noisiness of RD data and the bin-based calculation. In this paper, we propose to use the deep segmentation model UNet to learn base-wise RD patterns surrounding breakpoints of known SVs. We integrate model predictions with an RD-based SV caller to enhance breakpoints in single-nucleotide resolution. We show that UNet can be trained with a small amount of data and can be applied both in-sample and cross-sample. An enhancement pipeline named RDBKE significantly increases the number of SVs with more precise breakpoints on simulated and real data. The source code of RDBKE is freely available at https://github.com/yaozhong/deepIntraSV.
Keyphrases
  • electronic health record
  • single molecule
  • big data
  • deep learning
  • convolutional neural network
  • machine learning
  • dna methylation
  • artificial intelligence
  • optical coherence tomography
  • gene expression
  • high intensity