Login / Signup

A multi-sample approach increases the accuracy of transcript assembly.

Li SongSarven SabunciyanGuangyu YangLiliana D Florea
Published in: Nature communications (2019)
Transcript assembly from RNA-seq reads is a critical step in gene expression and subsequent functional analyses. Here we present PsiCLASS, an accurate and efficient transcript assembler based on an approach that simultaneously analyzes multiple RNA-seq samples. PsiCLASS combines mixture statistical models for exonic feature selection across multiple samples with splice graph based dynamic programming algorithms and a weighted voting scheme for transcript selection. PsiCLASS achieves significantly better sensitivity-precision tradeoff, and renders precision up to 2-3 fold higher than the StringTie system and Scallop plus TACO, the two best current approaches. PsiCLASS is efficient and scalable, assembling 667 GEUVADIS samples in 9 h, and has robust accuracy with large numbers of samples.
Keyphrases
  • rna seq
  • single cell
  • gene expression
  • machine learning
  • deep learning
  • magnetic resonance
  • dna methylation
  • magnetic resonance imaging
  • mass spectrometry
  • computed tomography
  • contrast enhanced
  • network analysis