Full-length transcriptome reconstruction reveals a large diversity of RNA and protein isoforms in rat hippocampus.
Xi WangXintian YouJulian D LangerJingyi HouFiona RupprechtIrena VlatkovicClaudia QuedenauGeorgi TushevIrina EpsteinBernhard SchaefkeWei SunLiang FangGuipeng LiYuhui HuBeatriz Alvarez-CastelaoWei ChenPublished in: Nature communications (2019)
Gene annotation is a critical resource in genomics research. Many computational approaches have been developed to assemble transcriptomes based on high-throughput short-read sequencing, however, only with limited accuracy. Here, we combine next-generation and third-generation sequencing to reconstruct a full-length transcriptome in the rat hippocampus, which is further validated using independent 5´ and 3´-end profiling approaches. In total, we detect 28,268 full-length transcripts (FLTs), covering 6,380 RefSeq genes and 849 unannotated loci. Based on these FLTs, we discover co-occurring alternative RNA processing events. Integrating with polysome profiling and ribosome footprinting data, we predict isoform-specific translational status and reconstruct an open reading frame (ORF)-eome. Notably, a high proportion of the predicted ORFs are validated by mass spectrometry-based proteomics. Moreover, we identify isoforms with subcellular localization pattern in neurons. Collectively, our data advance our knowledge of RNA and protein isoform diversity in the rat brain and provide a rich resource for functional studies.
Keyphrases
- single cell
- rna seq
- high throughput
- mass spectrometry
- genome wide
- electronic health record
- big data
- oxidative stress
- healthcare
- protein protein
- amino acid
- dna methylation
- nucleic acid
- genome wide identification
- liquid chromatography
- binding protein
- copy number
- gene expression
- small molecule
- prefrontal cortex
- cerebral ischemia
- genome wide analysis
- gas chromatography
- artificial intelligence
- data analysis
- genome wide association study
- case control
- bioinformatics analysis
- brain injury