GapClust is a light-weight approach distinguishing rare cells from voluminous single cell expression profiles.
Botao FaTing WeiYuan ZhouLuke JohnstonXin YuanYanran MaYue ZhangZhangsheng YuPublished in: Nature communications (2021)
Single cell RNA sequencing (scRNA-seq) is a powerful tool in detailing the cellular landscape within complex tissues. Large-scale single cell transcriptomics provide both opportunities and challenges for identifying rare cells playing crucial roles in development and disease. Here, we develop GapClust, a light-weight algorithm to detect rare cell types from ultra-large scRNA-seq datasets with state-of-the-art speed and memory efficiency. Benchmarking on diverse experimental datasets demonstrates the superior performance of GapClust compared to other recently proposed methods. When applying our algorithm to an intestine and 68 k PBMC datasets, GapClust identifies the tuft cells and a previously unrecognised subtype of monocyte, respectively.
Keyphrases
- single cell
- rna seq
- induced apoptosis
- high throughput
- cell cycle arrest
- machine learning
- physical activity
- deep learning
- gene expression
- weight gain
- stem cells
- genome wide
- dendritic cells
- endoplasmic reticulum stress
- endothelial cells
- oxidative stress
- cell proliferation
- mass spectrometry
- immune response
- mesenchymal stem cells
- bone marrow