Login / Signup

A scalable sparse neural network framework for rare cell type annotation of single-cell transcriptome data.

Yuqi ChengXingyu FanJianing ZhangYu Li
Published in: Communications biology (2023)
Automatic cell type annotation methods are increasingly used in single-cell RNA sequencing (scRNA-seq) analysis due to their fast and precise advantages. However, current methods often fail to account for the imbalance of scRNA-seq datasets and ignore information from smaller populations, leading to significant biological analysis errors. Here, we introduce scBalance, an integrated sparse neural network framework that incorporates adaptive weight sampling and dropout techniques for auto-annotation tasks. Using 20 scRNA-seq datasets with varying scales and degrees of imbalance, we demonstrate that scBalance outperforms current methods in both intra- and inter-dataset annotation tasks. Additionally, scBalance displays impressive scalability in identifying rare cell types in million-level datasets, as shown in the bronchoalveolar cell landscape. scBalance is also significantly faster than commonly used tools and comes in a user-friendly format, making it a superior tool for scRNA-seq analysis on the Python-based platform.
Keyphrases
  • single cell
  • rna seq
  • neural network
  • high throughput
  • machine learning
  • weight loss
  • working memory
  • emergency department
  • gene expression
  • data analysis
  • cell therapy
  • low cost
  • patient safety
  • big data