CamoTSS: analysis of alternative transcription start sites for cellular phenotypes and regulatory patterns from 5' scRNA-seq data.
Ruiyan HouChung-Chau HonYuanhua HuangPublished in: Nature communications (2023)
Five-prime single-cell RNA-seq (scRNA-seq) has been widely employed to profile cellular transcriptomes, however, its power of analysing transcription start sites (TSS) has not been fully utilised. Here, we present a computational method suite, CamoTSS, to precisely identify TSS and quantify its expression by leveraging the cDNA on read 1, which enables effective detection of alternative TSS usage. With various experimental data sets, we have demonstrated that CamoTSS can accurately identify TSS and the detected alternative TSS usages showed strong specificity in different biological processes, including cell types across human organs, the development of human thymus, and cancer conditions. As evidenced in nasopharyngeal cancer, alternative TSS usage can also reveal regulatory patterns including systematic TSS dysregulations.
Keyphrases
- single cell
- rna seq
- high throughput
- endothelial cells
- transcription factor
- papillary thyroid
- electronic health record
- squamous cell
- induced pluripotent stem cells
- poor prognosis
- big data
- squamous cell carcinoma
- genome wide
- gene expression
- mesenchymal stem cells
- childhood cancer
- long non coding rna
- young adults
- lymph node metastasis
- single molecule