Secuer: Ultrafast, scalable and accurate clustering of single-cell RNA-seq data.
Nana WeiYating NieLin L LiuXiaoqi ZhengHua-Jun WuPublished in: PLoS computational biology (2022)
Identifying cell clusters is a critical step for single-cell transcriptomics study. Despite the numerous clustering tools developed recently, the rapid growth of scRNA-seq volumes prompts for a more (computationally) efficient clustering method. Here, we introduce Secuer, a Scalable and Efficient speCtral clUstERing algorithm for scRNA-seq data. By employing an anchor-based bipartite graph representation algorithm, Secuer enjoys reduced runtime and memory usage over one order of magnitude for datasets with more than 1 million cells. Meanwhile, Secuer also achieves better or comparable accuracy than competing methods in small and moderate benchmark datasets. Furthermore, we showcase that Secuer can also serve as a building block for a new consensus clustering method, Secuer-consensus, which again improves the runtime and scalability of state-of-the-art consensus clustering methods while also maintaining the accuracy. Overall, Secuer is a versatile, accurate, and scalable clustering framework suitable for small to ultra-large single-cell clustering tasks.
Keyphrases
- single cell
- rna seq
- high throughput
- machine learning
- high resolution
- working memory
- stem cells
- deep learning
- signaling pathway
- electronic health record
- induced apoptosis
- cell death
- bone marrow
- mesenchymal stem cells
- computed tomography
- big data
- neural network
- sensitive detection
- cell cycle arrest
- cell therapy
- pi k akt
- convolutional neural network
- endoplasmic reticulum stress
- genome wide
- quantum dots