Integrating muti-omics data to identify tissue-specific DNA methylation biomarkers for cancer risk.
Yaohua YangYaxin ChenShuai XuXingyi GuoGuochong JiaJie PingXiang ShuTianying ZhaoFangcheng YuanGang WangYufang XieHang CiHongmo LiuYawen QiYongjun LiuDan LiuWei-Min LiFei YeXiao-Ou ShuQuan LongLi LiQiuyin CaiJirong LongPublished in: Nature communications (2024)
The relationship between tissue-specific DNA methylation and cancer risk remains inadequately elucidated. Leveraging resources from the Genotype-Tissue Expression consortium, here we develop genetic models to predict DNA methylation at CpG sites across the genome for seven tissues and apply these models to genome-wide association study data of corresponding cancers, namely breast, colorectal, renal cell, lung, ovarian, prostate, and testicular germ cell cancers. At Bonferroni-corrected P < 0.05, we identify 4248 CpGs that are significantly associated with cancer risk, of which 95.4% (4052) are specific to a particular cancer type. Notably, 92 CpGs within 55 putative novel loci retain significant associations with cancer risk after conditioning on proximal signals identified by genome-wide association studies. Integrative multi-omics analyses reveal 854 CpG-gene-cancer trios, suggesting that DNA methylation at 309 distinct CpGs might influence cancer risk through regulating the expression of 205 unique cis-genes. These findings substantially advance our understanding of the interplay between genetics, epigenetics, and gene expression in cancer etiology.
Keyphrases
- dna methylation
- genome wide
- gene expression
- papillary thyroid
- copy number
- squamous cell
- single cell
- poor prognosis
- germ cell
- prostate cancer
- genome wide association
- stem cells
- big data
- electronic health record
- childhood cancer
- lymph node metastasis
- young adults
- squamous cell carcinoma
- transcription factor
- cell therapy