Classification and deep-learning-based prediction of Alzheimer disease subtypes by using genomic data.
Daichi ShigemizuShintaro AkiyamaMutsumi SuganumaMotoki FurutaniAkiko YamakawaYukiko NakanoKouichi OzakiShumpei NiidaPublished in: Translational psychiatry (2023)
Late-onset Alzheimer's disease (LOAD) is the most common multifactorial neurodegenerative disease among elderly people. LOAD is heterogeneous, and the symptoms vary among patients. Genome-wide association studies (GWAS) have identified genetic risk factors for LOAD but not for LOAD subtypes. Here, we examined the genetic architecture of LOAD based on Japanese GWAS data from 1947 patients and 2192 cognitively normal controls in a discovery cohort and 847 patients and 2298 controls in an independent validation cohort. Two distinct groups of LOAD patients were identified. One was characterized by major risk genes for developing LOAD (APOC1 and APOC1P1) and immune-related genes (RELB and CBLC). The other was characterized by genes associated with kidney disorders (AXDND1, FBP1, and MIR2278). Subsequent analysis of albumin and hemoglobin values from routine blood test results suggested that impaired kidney function could lead to LOAD pathogenesis. We developed a prediction model for LOAD subtypes using a deep neural network, which achieved an accuracy of 0.694 (2870/4137) in the discovery cohort and 0.687 (2162/3145) in the validation cohort. These findings provide new insights into the pathogenic mechanisms of LOAD.
Keyphrases
- end stage renal disease
- late onset
- deep learning
- ejection fraction
- chronic kidney disease
- newly diagnosed
- peritoneal dialysis
- gene expression
- neural network
- early onset
- cell proliferation
- small molecule
- mild cognitive impairment
- patient reported outcomes
- physical activity
- electronic health record
- dna methylation
- artificial intelligence
- transcription factor
- genome wide association
- big data