FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies.
Imad AbugessaisaShuhei NoguchiAkira HasegawaJayson HarshbargerAtsushi KondoMarina LizioJessica SeverinPiero CarninciHideya KawajiTakeya KasukawaPublished in: Scientific data (2017)
The FANTOM5 consortium described the promoter-level expression atlas of human and mouse by using CAGE (Cap Analysis of Gene Expression) with single molecule sequencing. In the original publications, GRCh37/hg19 and NCBI37/mm9 assemblies were used as the reference genomes of human and mouse respectively; later, the Genome Reference Consortium released newer genome assemblies GRCh38/hg38 and GRCm38/mm10. To increase the utility of the atlas in forthcoming researches, we reprocessed the data to make them available on the recent genome assemblies. The data include observed frequencies of transcription starting sites (TSSs) based on the realignment of CAGE reads, and TSS peaks that are converted from those based on the previous reference. Annotations of the peak names were also updated based on the latest public databases. The reprocessed results enable us to examine frequencies of transcription initiations on the recent genome assemblies and to refer promoters with updated information across the genome assemblies consistently.
Keyphrases
- endothelial cells
- gene expression
- single molecule
- genome wide
- dna methylation
- induced pluripotent stem cells
- single cell
- pluripotent stem cells
- big data
- healthcare
- poor prognosis
- living cells
- mental health
- machine learning
- artificial intelligence
- deep learning
- long non coding rna
- health information
- adverse drug
- high speed
- aqueous solution