Login / Signup

Alevin-fry unlocks rapid, accurate and memory-frugal quantification of single-cell RNA-seq data.

Dongze HeMohsen ZakeriHirak SarkarCharlotte SonesonAvi SrivastavaRob Patro
Published in: Nature methods (2022)
The rapid growth of high-throughput single-cell and single-nucleus RNA-sequencing (scRNA-seq and snRNA-seq) technologies has produced a wealth of data over the past few years. The size, volume and distinctive characteristics of these data necessitate the development of new computational methods to accurately and efficiently quantify sc/snRNA-seq data into count matrices that constitute the input to downstream analyses. We introduce the alevin-fry framework for quantifying sc/snRNA-seq data. In addition to being faster and more memory frugal than other accurate quantification approaches, alevin-fry ameliorates the memory scalability and false-positive expression issues that are exhibited by other lightweight tools. We demonstrate how alevin-fry can be effectively used to quantify sc/snRNA-seq data, and also how the spliced and unspliced molecule quantification required as input for RNA velocity analyses can be seamlessly extracted from the same preprocessed data used to generate normal gene expression count matrices.
Keyphrases
  • single cell
  • rna seq
  • high throughput
  • electronic health record
  • gene expression
  • big data
  • genome wide
  • dna methylation
  • high resolution
  • poor prognosis
  • mass spectrometry