De novo detection of somatic variants in long-read single-cell RNA sequencing data.
Arthur DondiNico BorgsmüllerPedro F FerreiraBrian J HaasFrancis JacobViola A Heinzelmann-Schwarznull nullNiko BeerenwinkelPublished in: bioRxiv : the preprint server for biology (2024)
In cancer, genetic and transcriptomic variations generate clonal heterogeneity, possibly leading to treatment resistance. Long-read single-cell RNA sequencing (LR scRNA-seq) has the potential to detect genetic and transcriptomic variations simultaneously. Here, we present LongSom, a computational workflow leveraging LR scRNA-seq data to call de novo somatic single-nucleotide variants (SNVs), copy-number alterations (CNAs), and gene fusions to reconstruct the tumor clonal heterogeneity. For SNV calling, LongSom distinguishes somatic SNVs from germline polymorphisms by reannotating marker gene expression-based cell types using called variants and applying strict filters. Applying LongSom to ovarian cancer samples, we detected clinically relevant somatic SNVs that were validated against single-cell and bulk panel DNA-seq data and could not be detected with short-read (SR) scRNA-seq. Leveraging somatic SNVs and fusions, LongSom found subclones with different predicted treatment outcomes. In summary, LongSom enables de novo SNVs, CNAs, and fusions detection, thus enabling the study of cancer evolution, clonal heterogeneity, and treatment resistance.
Keyphrases
- single cell
- copy number
- rna seq
- mitochondrial dna
- genome wide
- dna methylation
- high throughput
- gene expression
- electronic health record
- single molecule
- papillary thyroid
- big data
- machine learning
- squamous cell carcinoma
- circulating tumor
- dna repair
- dna damage
- mesenchymal stem cells
- cell free
- young adults
- deep learning
- childhood cancer
- transcription factor
- nucleic acid
- replacement therapy