Login / Signup

Splice_sim: a nucleotide conversion-enabled RNA-seq simulation and evaluation framework.

Niko PopitschTobias NeumannArndt von HaeselerStefan L Ameres
Published in: Genome biology (2024)
Nucleotide conversion RNA sequencing techniques interrogate chemical RNA modifications in cellular transcripts, resulting in mismatch-containing reads. Biases in mapping the resulting reads to reference genomes remain poorly understood. We present splice_sim, a splice-aware RNA-seq simulation and evaluation pipeline that introduces user-defined nucleotide conversions at set frequencies, creates mixture models of converted and unconverted reads, and calculates mapping accuracies per genomic annotation. By simulating nucleotide conversion RNA-seq datasets under realistic experimental conditions, including metabolic RNA labeling and RNA bisulfite sequencing, we measure mapping accuracies of state-of-the-art spliced-read mappers for mouse and human transcripts and derive strategies to prevent biases in the data interpretation.
Keyphrases
  • high density
  • rna seq
  • single cell
  • machine learning
  • high resolution
  • big data
  • single molecule
  • induced pluripotent stem cells
  • deep learning
  • pluripotent stem cells