Login / Signup

Modular, efficient and constant-memory single-cell RNA-seq preprocessing.

Páll MelstedA Sina BooeshaghiLauren LiuFan GaoLambda LuKyung Hoi Joseph MinEduardo da Veiga BeltrameKristján Eldjárn HjörleifssonJase GehringLior Pachter
Published in: Nature biotechnology (2021)
We describe a workflow for preprocessing of single-cell RNA-sequencing data that balances efficiency and accuracy. Our workflow is based on the kallisto and bustools programs, and is near optimal in speed with a constant memory requirement providing scalability for arbitrarily large datasets. The workflow is modular, and we demonstrate its flexibility by showing how it can be used for RNA velocity analyses.
Keyphrases
  • single cell
  • rna seq
  • electronic health record
  • high throughput
  • working memory
  • public health
  • big data
  • machine learning
  • deep learning
  • data analysis