Gene Expression Nebulas (GEN): a comprehensive data portal integrating transcriptomic profiles across multiple species at both bulk and single-cell levels.
Yuansheng ZhangDong ZouTongtong ZhuTianyi XuMing ChenGuangyi NiuWenting ZongRong PanWei JingJian SangChang LiuYujia XiongYubin SunShuang ZhaiHuanxin ChenWenming ZhaoJingfa XiaoYiming BaoLili HaoZhang ZhangPublished in: Nucleic acids research (2021)
Transcriptomic profiling is critical to uncovering functional elements from transcriptional and post-transcriptional aspects. Here, we present Gene Expression Nebulas (GEN, https://ngdc.cncb.ac.cn/gen/), an open-access data portal integrating transcriptomic profiles under various biological contexts. GEN features a curated collection of high-quality bulk and single-cell RNA sequencing datasets by using standardized data processing pipelines and a structured curation model. Currently, GEN houses a large number of gene expression profiles from 323 datasets (157 bulk and 166 single-cell), covering 50 500 samples and 15 540 169 cells across 30 species, which are further categorized into six biological contexts. Moreover, GEN integrates a full range of transcriptomic profiles on expression, RNA editing and alternative splicing for 10 bulk datasets, providing opportunities for users to conduct integrative analysis at both transcriptional and post-transcriptional levels. In addition, GEN provides abundant gene annotations based on value-added curation of transcriptomic profiles and delivers online services for data analysis and visualization. Collectively, GEN presents a comprehensive collection of transcriptomic profiles across multiple species, thus serving as a fundamental resource for better understanding genetic regulatory architecture and functional mechanisms from tissues to cells.
Keyphrases
- single cell
- rna seq
- gene expression
- data analysis
- high throughput
- transcription factor
- induced apoptosis
- dna methylation
- genome wide
- electronic health record
- cell cycle arrest
- healthcare
- poor prognosis
- crispr cas
- primary care
- cell death
- social media
- squamous cell carcinoma
- artificial intelligence
- genome wide identification
- binding protein
- heat stress
- genome wide analysis