Epigenome-augmented eQTL-hotspots reveal genome-wide transcriptional programs in 36 human tissues.
Huanhuan LiuQinwei ChenJintao GuoYing ZhouZhiyu YouJun RenYuanyuan ZengJing YangJialiang HuangQiyuan LiPublished in: Briefings in bioinformatics (2024)
Expression quantitative trait loci (eQTLs) are used to inform the mechanisms of transcriptional regulation in eukaryotic cells. However, the specificity of genome-wide eQTL identification is limited by stringent control for false discoveries. Here, we described a method based on the non-homogeneous Poisson process to identify 125 489 regions with highly frequent, multiple eQTL associations, or 'eQTL-hotspots', from the public database of 59 human tissues or cell types. We stratified the eQTL-hotspots into two classes with their distinct sequence and epigenomic characteristics. Based on these classifications, we developed a machine-learning model, E-SpotFinder, for augmented discovery of tissue- or cell-type-specific eQTL-hotspots. We applied this model to 36 tissues or cell types. Using augmented eQTL-hotspots, we recovered 655 402 eSNPs and reconstructed a comprehensive regulatory network of 2 725 380 cis-interactions among eQTL-hotspots. We further identified 52 012 modules representing transcriptional programs with unique functional backgrounds. In summary, our study provided a framework of epigenome-augmented eQTL analysis and thereby constructed comprehensive genome-wide networks of cis-regulations across diverse human tissues or cell types.
Keyphrases
- genome wide
- dna methylation
- gene expression
- endothelial cells
- single cell
- machine learning
- copy number
- cell therapy
- public health
- induced pluripotent stem cells
- healthcare
- transcription factor
- poor prognosis
- high throughput
- pluripotent stem cells
- high resolution
- artificial intelligence
- cell death
- bone marrow
- virtual reality
- emergency department
- wastewater treatment
- mesenchymal stem cells
- heat shock
- long non coding rna
- cell proliferation
- binding protein
- bioinformatics analysis
- genome wide association study
- adverse drug