A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response.
Yang LuoMasahiro KanaiWanson ChoiXinyi LiSaori SakaueKenichi YamamotoKotaro OgawaMaria Gutierrez-ArcelusPeter K GregersenPhilip E StuartJames T ElderLukas ForerSebastian SchönherrChristian FuchsbergerAlbert Vernon SmithJacques FellayMary N CarringtonDavid W HaasXiuqing GuoNicholette D D AllredYii-Der Ida ChenJerome I RotterKent D TaylorStephen S RichAdolfo CorreaJames G WilsonSekar KathiresanMichael H ChoAndres MetspaluTõnu EskoYukinori OkadaBuhm Hannull nullPaul J McLarenSoumya RaychaudhuriPublished in: Nature genetics (2021)
Fine-mapping to plausible causal variation may be more effective in multi-ancestry cohorts, particularly in the MHC, which has population-specific structure. To enable such studies, we constructed a large (n = 21,546) HLA reference panel spanning five global populations based on whole-genome sequences. Despite population-specific long-range haplotypes, we demonstrated accurate imputation at G-group resolution (94.2%, 93.7%, 97.8% and 93.7% in admixed African (AA), East Asian (EAS), European (EUR) and Latino (LAT) populations). Applying HLA imputation to genome-wide association study data for HIV-1 viral load in three populations (EUR, AA and LAT), we obviated effects of previously reported associations from population-specific HIV studies and discovered a novel association at position 156 in HLA-B. We pinpointed the MHC association to three amino acid positions (97, 67 and 156) marking three consecutive pockets (C, B and D) within the HLA-B peptide-binding groove, explaining 12.9% of trait variance.
Keyphrases
- high resolution
- antiretroviral therapy
- genome wide association study
- hiv positive
- hiv infected
- human immunodeficiency virus
- hiv testing
- hepatitis c virus
- hiv aids
- men who have sex with men
- air pollution
- amino acid
- genetic diversity
- south africa
- electronic health record
- mass spectrometry
- big data
- machine learning
- case control
- binding protein
- dna binding