A Bayesian method to infer copy number clones from single-cell RNA and ATAC sequencing.
Lucrezia PatrunoSalvatore MiliteRiccardo BergaminNicola CalonaciAlberto d'OnofrioFabio AnselmiMarco AntoniottiAlex GraudenziGiulio CaravagnaPublished in: PLoS computational biology (2023)
Single-cell RNA and ATAC sequencing technologies enable the examination of gene expression and chromatin accessibility in individual cells, providing insights into cellular phenotypes. In cancer research, it is important to consistently analyze these states within an evolutionary context on genetic clones. Here we present CONGAS+, a Bayesian model to map single-cell RNA and ATAC profiles onto the latent space of copy number clones. CONGAS+ clusters cells into tumour subclones with similar ploidy, rendering straightforward to compare their expression and chromatin profiles. The framework, implemented on GPU and tested on real and simulated data, scales to analyse seamlessly thousands of cells, demonstrating better performance than single-molecule models, and supporting new multi-omics assays. In prostate cancer, lymphoma and basal cell carcinoma, CONGAS+ successfully identifies complex subclonal architectures while providing a coherent mapping between ATAC and RNA, facilitating the study of genotype-phenotype maps and their connection to genomic instability.
Keyphrases
- copy number
- single cell
- genome wide
- mitochondrial dna
- gene expression
- induced apoptosis
- rna seq
- prostate cancer
- dna methylation
- single molecule
- cell cycle arrest
- high throughput
- dna damage
- endoplasmic reticulum stress
- poor prognosis
- oxidative stress
- diffuse large b cell lymphoma
- radical prostatectomy
- cell death
- high resolution
- machine learning
- electronic health record
- big data
- mass spectrometry
- long non coding rna
- high density
- artificial intelligence
- lymph node metastasis