Analysis of alternative polyadenylation from single-cell RNA-seq using scDaPars reveals cell subpopulations invisible to gene expression.
Yipeng GaoLei LiChristopher Ian AmosWeibo XiePublished in: Genome research (2021)
Alternative polyadenylation (APA) is a major mechanism of post-transcriptional regulation in various cellular processes including cell proliferation and differentiation, but the APA heterogeneity among single cells remains largely unknown. Single-cell RNA sequencing (scRNA-seq) has been extensively used to define cell subpopulations at the transcription level. Yet, most scRNA-seq data have not been analyzed in an "APA-aware" manner. Here, we introduce dynamic analysis of APA from single-cell RNA-seq (scDaPars), a bioinformatics algorithm to accurately quantify APA events at both single-cell and single-gene resolution using either 3'-end (10x Chromium) or full-length (Smart-seq2) scRNA-seq data. Validations in both real and simulated data indicate that scDaPars can robustly recover missing APA events caused by the low amounts of mRNA sequenced in single cells. When applied to cancer and human endoderm differentiation data, scDaPars not only revealed cell-type-specific APA regulation but also identified cell subpopulations that are otherwise invisible to conventional gene expression analysis. Thus, scDaPars will enable us to understand cellular heterogeneity at the post-transcriptional APA level.
Keyphrases
- single cell
- rna seq
- gene expression
- high throughput
- electronic health record
- induced apoptosis
- cell proliferation
- big data
- cell cycle arrest
- genome wide
- transcription factor
- genome wide identification
- copy number
- dna methylation
- machine learning
- signaling pathway
- cell cycle
- oxidative stress
- multidrug resistant
- mesenchymal stem cells
- lymph node metastasis
- squamous cell
- heat shock protein
- heat stress
- data analysis