Contemporary Ne estimation using temporally spaced data with linked loci.
Tin-Yu J HuiJon Haël BrenasAustin BurtPublished in: Molecular ecology resources (2021)
The contemporary effective population size N e is important in many disciplines including population genetics, conservation science and pest management. One of the most popular methods of estimating this quantity uses temporal changes in allele frequency due to genetic drift. A significant assumption of the existing methods is the independence among loci while constructing confidence intervals (CI), which restricts the types of species or genetic data applicable to the methods. Although genetic linkage does not bias point N e estimates, applying these methods to linked loci can yield unreliable CI that are far too narrow. We extend the current methods to enable the use of many linked loci to produce precise contemporary N e estimates, while preserving the targeted CI width and coverage. This is achieved by deriving the covariance of changes in allele frequency at linked loci in the face of recombination and sampling errors, such that the extra sampling variance due to between-locus correlation is properly handled. Extensive simulations are used to verify the new method. We apply the method to two temporally spaced genomic data sets of Anopheles mosquitoes collected from a cluster of villages in Burkina Faso between 2012 and 2014. With over 33,000 linked loci considered, the N e estimate for Anopheles coluzzii is 9,242 (95% CI 5,702-24,282), and for Anopheles gambiae it is 4,826 (95% CI 3,602-7,353).
Keyphrases
- genome wide
- genome wide association study
- dna methylation
- copy number
- genome wide association
- aedes aegypti
- electronic health record
- big data
- public health
- emergency department
- patient safety
- oxidative stress
- dna damage
- data analysis
- cancer therapy
- deep learning
- drug delivery
- healthcare
- human immunodeficiency virus
- hiv testing
- men who have sex with men
- antiretroviral therapy
- affordable care act