scSTAR reveals hidden heterogeneity with a real-virtual cell pair structure across conditions in single-cell RNA sequencing data.
Jie HaoJiawei ZouJiaqiang ZhangKe ChenDuojiao WuWei CaoGuoguo ShangJean Yee Hwa YangKongFatt Wong-LinHourong SunZhen ZhangXiangdong WangWantao ChenXin ZouPublished in: Briefings in bioinformatics (2023)
Cell-state transition can reveal additional information from single-cell ribonucleic acid (RNA)-sequencing data in time-resolved biological phenomena. However, most of the current methods are based on the time derivative of the gene expression state, which restricts them to the short-term evolution of cell states. Here, we present single-cell State Transition Across-samples of RNA-seq data (scSTAR), which overcomes this limitation by constructing a paired-cell projection between biological conditions with an arbitrary time span by maximizing the covariance between two feature spaces using partial least square and minimum squared error methods. In mouse ageing data, the response to stress in CD4+ memory T cell subtypes was found to be associated with ageing. A novel Treg subtype characterized by mTORC activation was identified to be associated with antitumour immune suppression, which was confirmed by immunofluorescence microscopy and survival analysis in 11 cancers from The Cancer Genome Atlas Program. On melanoma data, scSTAR improved immunotherapy-response prediction accuracy from 0.8 to 0.96.
Keyphrases
- single cell
- rna seq
- high throughput
- gene expression
- electronic health record
- big data
- mesenchymal stem cells
- dna methylation
- machine learning
- data analysis
- magnetic resonance imaging
- computed tomography
- high resolution
- optical coherence tomography
- artificial intelligence
- young adults
- mass spectrometry
- single molecule
- health information