Login / Signup

Correcting modification-mediated errors in nanopore sequencing by nucleotide demodification and reference-based correction.

Chien-Shun ChiouBo-Han ChenYou-Wun WangNang-Ting KuoChih-Hsiang ChangYao-Ting Huang
Published in: Communications biology (2023)
The accuracy of Oxford Nanopore Technology (ONT) sequencing has significantly improved thanks to new flowcells, sequencing kits, and basecalling algorithms. However, novel modification types untrained in the basecalling models can seriously reduce the quality. Here we reports a set of ONT-sequenced genomes with unexpected low quality due to novel modification types. Demodification by whole-genome amplification significantly improved the quality but lost the epigenome. We also developed a reference-based method, Modpolish, for correcting modification-mediated errors while retaining the epigenome when a sufficient number of closely-related genomes is publicly available (default: top 20 genomes with at least 95% identity). Modpolish not only significantly improved the quality of in-house sequenced genomes but also public datasets sequenced by R9.4 and R10.4 (simplex). Our results suggested that novel modifications are prone to ONT systematic errors. Nevertheless, these errors are correctable by nucleotide demodification or Modpolish without prior knowledge of modifications.
Keyphrases
  • adverse drug
  • patient safety
  • dna methylation
  • healthcare
  • single cell
  • quality improvement
  • machine learning
  • mental health
  • gene expression
  • emergency department
  • functional connectivity
  • resistance training