Monitoring transcription initiation activities in rat and dog.
Marina LizioAbdul Kadir MukarramMizuho OhnoShoko WatanabeMasayoshi ItohAkira HasegawaTimo LassmannJessica SeverinJayson HarshbargerImad AbugessaisaTakeya KasukawaChung-Chau HonPiero CarninciYoshihide HayashizakiAlistair R R ForrestHideya KawajiPublished in: Scientific data (2017)
The promoter landscape of several non-human model organisms is far from complete. As a part of FANTOM5 data collection, we generated 13 profiles of transcription initiation activities in dog and rat aortic smooth muscle cells, mesenchymal stem cells and hepatocytes by employing CAGE (Cap Analysis of Gene Expression) technology combined with single molecule sequencing. Our analyses show that the CAGE profiles recapitulate known transcription start sites (TSSs) consistently, in addition to uncover novel TSSs. Our dataset can be thus used with high confidence to support gene annotation in dog and rat species. We identified 28,497 and 23,147 CAGE peaks, or promoter regions, for rat and dog respectively, and associated them to known genes. This approach could be seen as a standard method for improvement of existing gene models, as well as discovery of novel genes. Given that the FANTOM5 data collection includes dog and rat matched cell types in human and mouse as well, this data would also be useful for cross-species studies.
Keyphrases
- gene expression
- single molecule
- transcription factor
- genome wide
- dna methylation
- oxidative stress
- mesenchymal stem cells
- endothelial cells
- genome wide identification
- single cell
- electronic health record
- big data
- aortic valve
- cell therapy
- copy number
- small molecule
- stem cells
- left ventricular
- heart failure
- machine learning
- high throughput
- genome wide analysis
- induced pluripotent stem cells
- atomic force microscopy
- atrial fibrillation
- deep learning