Login / Signup

CapTrap-Seq: A platform-agnostic and quantitative approach for high-fidelity full-length RNA transcript sequencing.

Silvia Carbonell-SalaJulien LagardeHiromi NishiyoriEmilio PalumboCarme ArnanHazuki TakahashiPiero CarniciBarbara Uszczynska-RatajczakRoderic Guigó
Published in: bioRxiv : the preprint server for biology (2023)
Long-read RNA sequencing is essential to produce accurate and exhaustive annotation of eukaryotic genomes. Despite advancements in throughput and accuracy, achieving reliable end-to-end identification of RNA transcripts remains a challenge for long-read sequencing methods. To address this limitation, we developed CapTrap-seq, a cDNA library preparation method, which combines the Cap-trapping strategy with oligo(dT) priming to detect 5'capped, full-length transcripts, together with the data processing pipeline LyRic. We benchmarked CapTrap-seq and other popular RNA-seq library preparation protocols in a number of human tissues using both ONT and PacBio sequencing. To assess the accuracy of the transcript models produced, we introduced a capping strategy for synthetic RNA spike-in sequences that mimics the natural 5'cap formation in RNA spike-in molecules. We found that the vast majority (up to 90%) of transcript models that LyRic derives from CapTrap-seq reads are full-length. This makes it possible to produce highly accurate annotations with minimal human intervention.
Keyphrases
  • rna seq
  • single cell
  • high throughput
  • endothelial cells
  • high resolution
  • randomized controlled trial
  • induced pluripotent stem cells
  • nucleic acid
  • gene expression
  • single molecule
  • deep learning