Login / Signup

R H : a genetic metric for measuring intrahost Plasmodium falciparum relatedness and distinguishing cotransmission from superinfection.

Wesley WongSarah VolkmanRachel F DanielsStephen F SchaffnerMouhamad SyYaye Die NdiayeAida S BadianeAwa B DemeMamadou Alpha DialloJules GomisNgayo SyDaouda NdiayeDyann F WirthDaniel L Hartl
Published in: PNAS nexus (2022)
Multiple-strain (polygenomic) infections are a ubiquitous feature of Plasmodium falciparum parasite population genetics. Under simple assumptions of superinfection, polygenomic infections are hypothesized to be the result of multiple infectious bites. As a result, polygenomic infections have been used as evidence of repeat exposure and used to derive genetic metrics associated with high transmission intensity. However, not all polygenomic infections are the result of multiple infectious bites. Some result from the transmission of multiple, genetically related strains during a single infectious bite (cotransmission). Superinfection and cotransmission represent two distinct transmission processes, and distinguishing between the two could improve inferences regarding parasite transmission intensity. Here, we describe a new metric, R H , that utilizes the correlation in allelic state (heterozygosity) within polygenomic infections to estimate the likelihood that the observed complexity resulted from either superinfection or cotransmission. R H is flexible and can be applied to any type of genetic data. As a proof of concept, we used R H to quantify polygenomic relatedness and estimate cotransmission and superinfection rates from a set of 1,758 malaria infections genotyped with a 24 single nucleotide polymorphism (SNP) molecular barcode. Contrary to expectation, we found that cotransmission was responsible for a significant fraction of 43% to 53% of the polygenomic infections collected in three distinct epidemiological regions in Senegal. The prediction that polygenomic infections frequently result from cotransmission stresses the need to incorporate estimates of relatedness within polygenomic infections to ensure the accuracy of genomic epidemiology surveillance data for informing public health activities.
Keyphrases
  • plasmodium falciparum
  • public health
  • genome wide
  • machine learning
  • gene expression
  • risk factors
  • deep learning
  • single molecule