Login / Signup

Strong positive selection biases identity-by-descent-based inferences of recent demography and population structure in Plasmodium falciparum.

Bing GuoVíctor BordaRoland LaboulayeMichele D SpringMariusz WojnarskiBrian A VeselyJoana C SilvaNorman C WatersTimothy D O'ConnorShannon Takala-Harrison
Published in: Nature communications (2024)
Malaria genomic surveillance often estimates parasite genetic relatedness using metrics such as Identity-By-Decent (IBD), yet strong positive selection stemming from antimalarial drug resistance or other interventions may bias IBD-based estimates. In this study, we use simulations, a true IBD inference algorithm, and empirical data sets from different malaria transmission settings to investigate the extent of this bias and explore potential correction strategies. We analyze whole genome sequence data generated from 640 new and 3089 publicly available Plasmodium falciparum clinical isolates. We demonstrate that positive selection distorts IBD distributions, leading to underestimated effective population size and blurred population structure. Additionally, we discover that the removal of IBD peak regions partially restores the accuracy of IBD-based inferences, with this effect contingent on the population's background genetic relatedness and extent of inbreeding. Consequently, we advocate for selection correction for parasite populations undergoing strong, recent positive selection, particularly in high malaria transmission settings.
Keyphrases
  • plasmodium falciparum
  • ulcerative colitis
  • copy number
  • genome wide
  • big data
  • machine learning
  • gene expression
  • deep learning
  • dna methylation