Modular, efficient and constant-memory single-cell RNA-seq preprocessing.
Páll MelstedA Sina BooeshaghiLauren LiuFan GaoLambda LuKyung Hoi Joseph MinEduardo da Veiga BeltrameKristján Eldjárn HjörleifssonJase GehringLior PachterPublished in: Nature biotechnology (2021)
We describe a workflow for preprocessing of single-cell RNA-sequencing data that balances efficiency and accuracy. Our workflow is based on the kallisto and bustools programs, and is near optimal in speed with a constant memory requirement providing scalability for arbitrarily large datasets. The workflow is modular, and we demonstrate its flexibility by showing how it can be used for RNA velocity analyses.