Login / Signup

2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.

Matthew T ParkerKatarzyna KnopGeoffrey J BartonGordon Grant Simpson
Published in: Genome biology (2021)
Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technologies can reduce the accuracy of intron identification. Here we apply alignment metrics and machine-learning-derived sequence information to filter spurious splice junctions from long-read alignments and use the remaining junctions to guide realignment in a two-pass approach. This method, available in the software package 2passtools ( https://github.com/bartongroup/2passtools ), improves the accuracy of spliced alignment and transcriptome assembly for species both with and without existing high-quality annotations.
Keyphrases
  • single molecule
  • single cell
  • machine learning
  • rna seq
  • genome wide
  • healthcare
  • transcription factor
  • magnetic resonance
  • dna methylation
  • image quality
  • social media
  • data analysis
  • amino acid
  • bioinformatics analysis