Mapping the drivers of within-host pathogen evolution using massive data sets.
Duncan S PalmerIsaac TurnerSarah FidlerJohn FraterDominique GoedhalsPhilip GoulderKuan-Hsiang Gary HuangAnnette OxeniusRodney PhillipsRoger ShapiroCloete van VuurenAngela R McLeanGilean McVeanPublished in: Nature communications (2019)
Differences among hosts, resulting from genetic variation in the immune system or heterogeneity in drug treatment, can impact within-host pathogen evolution. Genetic association studies can potentially identify such interactions. However, extensive and correlated genetic population structure in hosts and pathogens presents a substantial risk of confounding analyses. Moreover, the multiple testing burden of interaction scanning can potentially limit power. We present a Bayesian approach for detecting host influences on pathogen evolution that exploits vast existing data sets of pathogen diversity to improve power and control for stratification. The approach models key processes, including recombination and selection, and identifies regions of the pathogen genome affected by host factors. Our simulations and empirical analysis of drug-induced selection on the HIV-1 genome show that the method recovers known associations and has superior precision-recall characteristics compared to other approaches. We build a high-resolution map of HLA-induced selection in the HIV-1 genome, identifying novel epitope-allele combinations.
Keyphrases
- drug induced
- genome wide
- high resolution
- liver injury
- candida albicans
- antiretroviral therapy
- hiv positive
- hiv infected
- human immunodeficiency virus
- hiv testing
- dna methylation
- electronic health record
- hepatitis c virus
- hiv aids
- copy number
- men who have sex with men
- adverse drug
- diabetic rats
- gene expression
- emergency department
- dna repair
- molecular dynamics
- mass spectrometry
- single cell
- artificial intelligence
- deep learning
- data analysis
- endothelial cells
- replacement therapy
- case control