A machine learning approach to brain epigenetic analysis reveals kinases associated with Alzheimer's disease.
Yanting HuangXiaobo SunHuige JiangShaojun YuChloe RobinsMatthew J ArmstrongRonghua LiZhen MeiXiaochuan ShiEkaterina Sergeevna GerasimovPhilip Lawrence De JagerDavid A BennettAliza P WingoPeng JinThomas S WingoZhaohui S QinPublished in: Nature communications (2021)
Alzheimer's disease (AD) is influenced by both genetic and environmental factors; thus, brain epigenomic alterations may provide insights into AD pathogenesis. Multiple array-based Epigenome-Wide Association Studies (EWASs) have identified robust brain methylation changes in AD; however, array-based assays only test about 2% of all CpG sites in the genome. Here, we develop EWASplus, a computational method that uses a supervised machine learning strategy to extend EWAS coverage to the entire genome. Application to six AD-related traits predicts hundreds of new significant brain CpGs associated with AD, some of which are further validated experimentally. EWASplus also performs well on data collected from independent cohorts and different brain regions. Genes found near top EWASplus loci are enriched for kinases and for genes with evidence for physical interactions with known AD genes. In this work, we show that EWASplus implicates additional epigenetic loci for AD that are not found using array-based AD EWASs.
Keyphrases
- genome wide
- dna methylation
- machine learning
- resting state
- white matter
- functional connectivity
- high throughput
- gene expression
- copy number
- cerebral ischemia
- big data
- mental health
- mass spectrometry
- artificial intelligence
- multiple sclerosis
- brain injury
- blood brain barrier
- electronic health record
- high density
- bioinformatics analysis