Cardelino: computational integration of somatic clonal substructure and single-cell transcriptomes.
Davis J McCarthyRaghd RostomYuanhua HuangDaniel J KunzPetr DanecekMarc Jan BonderTzachi HagaiRuqian Lyunull nullWenyi WangDaniel J GaffneyBenjamin David SimonsOliver StegleSarah A TeichmannPublished in: Nature methods (2020)
Bulk and single-cell DNA sequencing has enabled reconstructing clonal substructures of somatic tissues from frequency and cooccurrence patterns of somatic variants. However, approaches to characterize phenotypic variations between clones are not established. Here we present cardelino (https://github.com/single-cell-genetics/cardelino), a computational method for inferring the clonal tree configuration and the clone of origin of individual cells assayed using single-cell RNA-seq (scRNA-seq). Cardelino flexibly integrates information from imperfect clonal trees inferred based on bulk exome-seq data, and sparse variant alleles expressed in scRNA-seq data. We apply cardelino to a published cancer dataset and to newly generated matched scRNA-seq and exome-seq data from 32 human dermal fibroblast lines, identifying hundreds of differentially expressed genes between cells from different somatic clones. These genes are frequently enriched for cell cycle and proliferation pathways, indicating a role for cell division genes in somatic evolution in healthy skin.
Keyphrases
- single cell
- rna seq
- copy number
- genome wide
- cell cycle
- induced apoptosis
- high throughput
- electronic health record
- cell cycle arrest
- big data
- dna methylation
- endothelial cells
- signaling pathway
- cell proliferation
- gene expression
- genome wide identification
- randomized controlled trial
- wound healing
- stem cells
- squamous cell carcinoma
- transcription factor
- systematic review
- deep learning
- machine learning
- pi k akt
- childhood cancer