Improving genetic risk modeling of dementia from real-world data in underrepresented populations.
Timothy ChangMingzhou FuLeopoldo Valiente-BanuetSatpal WadhwaBogdan PasaniucKeith VosselPublished in: Research square (2024)
BACKGROUND : Genetic risk modeling for dementia offers significant benefits, but studies based on real-world data, particularly for underrepresented populations, are limited. METHODS : We employed an Elastic Net model for dementia risk prediction using single-nucleotide polymorphisms prioritized by functional genomic data from multiple neurodegenerative disease genome-wide association studies. We compared this model with APOE and polygenic risk score models across genetic ancestry groups, using electronic health records from UCLA Health for discovery and All of Us cohort for validation. RESULTS : Our model significantly outperforms other models across multiple ancestries, improving the area-under-precision-recall curve by 21-61% and the area-under-the-receiver-operating characteristic by 10-21% compared to the APOE and the polygenic risk score models. We identified shared and ancestry-specific risk genes and biological pathways, reinforcing and adding to existing knowledge. CONCLUSIONS : Our study highlights benefits of integrating functional mapping, multiple neurodegenerative diseases, and machine learning for genetic risk models in diverse populations. Our findings hold potential for refining precision medicine strategies in dementia diagnosis.
Keyphrases
- electronic health record
- mild cognitive impairment
- genome wide
- machine learning
- healthcare
- copy number
- cognitive decline
- clinical decision support
- high fat diet
- gene expression
- high resolution
- mental health
- small molecule
- type diabetes
- metabolic syndrome
- dna methylation
- adipose tissue
- climate change
- human health
- skeletal muscle
- mass spectrometry
- genetic diversity
- case control
- insulin resistance
- breast cancer risk