Aberration-corrected ultrafine analysis of miRNA reads at single-base resolution: a k-mer lattice approach.
Xuan ZhangPengyao PingGyorgy HutvagnerMichael BlumensteinJinyan LiPublished in: Nucleic acids research (2021)
Raw sequencing reads of miRNAs contain machine-made substitution errors, or even insertions and deletions (indels). Although the error rate can be low at 0.1%, precise rectification of these errors is critically important because isoform variation analysis at single-base resolution such as novel isomiR discovery, editing events understanding, differential expression analysis, or tissue-specific isoform identification is very sensitive to base positions and copy counts of the reads. Existing error correction methods do not work for miRNA sequencing data attributed to miRNAs' length and per-read-coverage properties distinct from DNA or mRNA sequencing reads. We present a novel lattice structure combining kmers, (k - 1)mers and (k + 1)mers to address this problem. The method is particularly effective for the correction of indel errors. Extensive tests on datasets having known ground truth of errors demonstrate that the method is able to remove almost all of the errors, without introducing any new error, to improve the data quality from every-50-reads containing one error to every-1300-reads containing one error. Studies on experimental miRNA sequencing datasets show that the errors are often rectified at the 5' ends and the seed regions of the reads, and that there are remarkable changes after the correction in miRNA isoform abundance, volume of singleton reads, overall entropy, isomiR families, tissue-specific miRNAs, and rare-miRNA quantities.
Keyphrases
- patient safety
- adverse drug
- single cell
- single molecule
- rna seq
- sars cov
- crispr cas
- big data
- small molecule
- respiratory syndrome coronavirus
- healthcare
- quality improvement
- circulating tumor
- machine learning
- wastewater treatment
- peripheral blood
- weight loss
- air pollution
- transcription factor
- microbial community
- health insurance
- weight gain