Improving the diversity of captured full-length isoforms using a normalized single-molecule RNA-sequencing method.
Yueming HuXing-Sheng ShuJiaxian YuMing-An SunZewei ChenXianming LiuQiongfang FangWei ZhangXinjie HuiYing YingLi FuDesheng LuRakesh KumarYejun WangPublished in: Communications biology (2020)
Human genes form a large variety of isoforms after transcription, encoding distinct transcripts to exert different functions. Single-molecule RNA sequencing facilitates accurate identification of the isoforms by extending nucleotide read length significantly. However, the gene or isoform diversity is lowly represented by the mRNA molecules captured by single-molecule RNA sequencing. Here, we show that a cDNA normalization procedure before the library preparation for PacBio RS II sequencing captures 3.2-6.0 fold more full-length high-quality isoform species for different human samples, as compared to the non-normalized capture procedure. Many lowly expressed, functionally important isoforms can be detected. In addition, normalized PacBio RNA sequencing also resolves more allele-specific haplotype transcripts. Finally, we apply the cDNA normalization based long-read RNA sequencing method to profile the transcriptome of human gastric signet-ring cell carcinomas, identify new cancer-specific transcriptome signatures, and thus, bring out the utility of the improved protocols in gene expression studies.
Keyphrases
- single molecule
- single cell
- rna seq
- endothelial cells
- gene expression
- atomic force microscopy
- living cells
- genome wide
- induced pluripotent stem cells
- dna methylation
- minimally invasive
- high grade
- stem cells
- high resolution
- transcription factor
- papillary thyroid
- young adults
- lymph node metastasis
- molecularly imprinted