Marker-free characterization of full-length transcriptomes of single live circulating tumor cells.
Sarita PooniaAnurag GoelSmriti ChawlaNamrata BhattacharyaPriyadarshini RaiYi Fang LeeYoon Sim YapJay WestAli Asgar BhagatJuhi TayalAnurag MehtaGaurav AhujaAngshul MajumdarNaveen RamalingamDebarka SenguptaPublished in: Genome research (2022)
The identification and characterization of circulating tumor cells (CTCs) are important for gaining insights into the biology of metastatic cancers, monitoring disease progression, and medical management of the disease. The limiting factor in the enrichment of purified CTC populations is their sparse availability, heterogeneity, and altered phenotypes relative to the primary tumor. Intensive research both at the technical and molecular fronts led to the development of assays that ease CTC detection and identification from peripheral blood. Most CTC detection methods based on single-cell RNA sequencing (scRNA-seq) use a mix of size selection, marker-based white blood cells (WBC) depletion, and antibodies targeting tumor-associated antigens. However, the majority of these methods either miss atypical CTCs or suffer from WBC contamination. We present unCTC, an R package for unbiased identification and characterization of CTCs from single-cell transcriptomic data. unCTC features many standard and novel computational and statistical modules for various analyses. These include a novel method of scRNA-seq clustering, named Deep Dictionary Learning using k -means clustering cost (DDLK), expression-based copy number variation (CNV) inference, and combinatorial, marker-based verification of the malignant phenotypes. DDLK enables robust segregation of CTCs and WBCs in the pathway space, as opposed to the gene expression space. We validated the utility of unCTC on scRNA-seq profiles of breast CTCs from six patients, captured and profiled using an integrated ClearCell FX and Polaris workflow that works by the principles of size-based separation of CTCs and marker-based WBC depletion.
Keyphrases
- circulating tumor cells
- single cell
- rna seq
- copy number
- high throughput
- gene expression
- circulating tumor
- peripheral blood
- end stage renal disease
- mitochondrial dna
- genome wide
- dna methylation
- newly diagnosed
- healthcare
- squamous cell carcinoma
- chronic kidney disease
- electronic health record
- ejection fraction
- small cell lung cancer
- poor prognosis
- risk assessment
- peritoneal dialysis
- cell cycle arrest
- prognostic factors
- real time pcr
- cancer therapy
- climate change
- dendritic cells
- drug delivery
- cell death
- machine learning
- mass spectrometry
- long non coding rna
- patient reported
- network analysis
- binding protein