Multimodal hierarchical classification of CITE-seq data delineates immune cell states across lineages and tissues.
Daniel P CaronWilliam L SpechtDavid ChenSteven B WellsPeter A SzaboIsaac J JensenDonna L FarberPeter A SimsPublished in: bioRxiv : the preprint server for biology (2023)
Single-cell RNA sequencing (scRNA-seq) is invaluable for profiling cellular heterogeneity and dissecting transcriptional states, but transcriptomic profiles do not always delineate subsets defined by surface proteins, as in cells of the immune system. Cellular Indexing of Transcriptomes and Epitopes (CITE-seq) enables simultaneous profiling of single-cell transcriptomes and surface proteomes; however, accurate cell type annotation requires a classifier that integrates this multimodal data. Here, we describe M ulti Mo dal C lassifier Hi erarchy (MMoCHi), a marker-based approach for classification, reconciling gene and protein expression without reliance on reference atlases. We benchmark MMoCHi using sorted T lymphocyte subsets and annotate a cross-tissue human immune cell dataset. MMoCHi outperforms leading transcriptome-based classifiers and multimodal unsupervised clustering in its ability to identify immune cell subsets that are not readily resolved and to reveal novel subset markers. MMoCHi is designed for adaptability and can integrate CITE-seq annotation of cell types and developmental states across diverse lineages, tissues, or individuals.
Keyphrases
- genome wide identification
- single cell
- rna seq
- peripheral blood
- machine learning
- gene expression
- high throughput
- pain management
- deep learning
- electronic health record
- big data
- induced apoptosis
- endothelial cells
- cell cycle arrest
- high resolution
- endoplasmic reticulum stress
- genome wide
- artificial intelligence
- dna methylation
- data analysis
- mass spectrometry
- chronic pain
- pluripotent stem cells
- cell proliferation
- heat shock protein
- copy number