De novo detection of somatic variants in long-read single-cell RNA sequencing data.
Arthur DondiNico BorgsmüllerPedro F FerreiraBrian J HaasFrancis JacobViola A Heinzelmann-Schwarznull nullNiko BeerenwinkelPublished in: bioRxiv : the preprint server for biology (2024)
In cancer, genetic and transcriptomic variations generate clonal heterogeneity, possibly leading to treatment resistance. Long-read single-cell RNA sequencing (LR scRNA-seq) has the potential to detect genetic and transcriptomic variations simultaneously. Here, we present LongSom, a computational workflow leveraging LR scRNA-seq data to call de novo somatic single-nucleotide variants (SNVs), copy-number alterations (CNAs), and gene fusions to reconstruct the tumor clonal heterogeneity. For SNV calling, LongSom distinguishes somatic SNVs from germline polymorphisms by reannotating marker gene expression-based cell types using called variants and applying strict filters. Applying LongSom to ovarian cancer samples, we detected clinically relevant somatic SNVs that were validated against single-cell and bulk panel DNA-seq data and could not be detected with short-read (SR) scRNA-seq. Leveraging somatic SNVs and fusions, LongSom found subclones with different predicted treatment outcomes. In summary, LongSom enables de novo SNVs, CNAs, and fusions detection, thus enabling the study of cancer evolution, clonal heterogeneity, and treatment resistance.
Keyphrases
- single cell
- copy number
- rna seq
- mitochondrial dna
- genome wide
- dna methylation
- high throughput
- gene expression
- electronic health record
- single molecule
- papillary thyroid
- big data
- stem cells
- data analysis
- real time pcr
- label free
- cell therapy
- combination therapy
- dna damage
- lymph node metastasis
- childhood cancer
- quantum dots
- circulating tumor cells
- cell free
- bone marrow