Theoretical and empirical quantification of the accuracy of polygenic scores in ancestry divergent populations.
Ying WangJing GuoGuiyan NiJian YangPeter M VisscherLoic YengoPublished in: Nature communications (2020)
Polygenic scores (PGS) have been widely used to predict disease risk using variants identified from genome-wide association studies (GWAS). To date, most GWAS have been conducted in populations of European ancestry, which limits the use of GWAS-derived PGS in non-European ancestry populations. Here, we derive a theoretical model of the relative accuracy (RA) of PGS across ancestries. We show through extensive simulations that the RA of PGS based on genome-wide significant SNPs can be predicted accurately from modelling linkage disequilibrium (LD), minor allele frequencies (MAF), cross-population correlations of causal SNP effects and heritability. We find that LD and MAF differences between ancestries can explain between 70 and 80% of the loss of RA of European-based PGS in African ancestry for traits like body mass index and type 2 diabetes. Our results suggest that causal variants underlying common genetic variation identified in European ancestry GWAS are mostly shared across continents.
Keyphrases
- genome wide
- genome wide association study
- copy number
- genome wide association
- type diabetes
- dna methylation
- rheumatoid arthritis
- disease activity
- genetic diversity
- cardiovascular disease
- insulin resistance
- molecular dynamics
- adipose tissue
- metabolic syndrome
- interstitial lung disease
- hepatitis c virus
- human immunodeficiency virus
- antiretroviral therapy
- weight loss