Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes.
Ruli GaoShanshan BaiYing C HendersonYiyun LinAislyn SchalckYun YanTapsi KumarMin HuEmi SeiAlexander DavisFang WangSimona F ShaitelmanJennifer Rui WangKen ChenStacy MoulderStephen Y LaiNicholas E NavinPublished in: Nature biotechnology (2021)
Single-cell transcriptomic analysis is widely used to study human tumors. However, it remains challenging to distinguish normal cell types in the tumor microenvironment from malignant cells and to resolve clonal substructure within the tumor. To address these challenges, we developed an integrative Bayesian segmentation approach called copy number karyotyping of aneuploid tumors (CopyKAT) to estimate genomic copy number profiles at an average genomic resolution of 5 Mb from read depth in high-throughput single-cell RNA sequencing (scRNA-seq) data. We applied CopyKAT to analyze 46,501 single cells from 21 tumors, including triple-negative breast cancer, pancreatic ductal adenocarcinoma, anaplastic thyroid cancer, invasive ductal carcinoma and glioblastoma, to accurately (98%) distinguish cancer cells from normal cell types. In three breast tumors, CopyKAT resolved clonal subpopulations that differed in the expression of cancer genes, such as KRAS, and signatures, including epithelial-to-mesenchymal transition, DNA repair, apoptosis and hypoxia. These data show that CopyKAT can aid in the analysis of scRNA-seq data in a variety of solid human tumors.
Keyphrases
- single cell
- copy number
- mitochondrial dna
- rna seq
- high throughput
- genome wide
- endothelial cells
- dna repair
- dna methylation
- induced pluripotent stem cells
- electronic health record
- papillary thyroid
- induced apoptosis
- endoplasmic reticulum stress
- deep learning
- big data
- oxidative stress
- machine learning
- transcription factor
- single molecule
- cell therapy
- gene expression
- signaling pathway
- binding protein
- poor prognosis
- optical coherence tomography
- squamous cell carcinoma