Login / Signup

The promise and deceit of genomic selection component analyses.

John K Kelly
Published in: Proceedings. Biological sciences (2021)
Selection component analyses (SCA) relate individual genotype to fitness components such as viability, fecundity and mating success. SCA are based on population genetic models and yield selection estimates directly in terms of predicted allele frequency change. This paper explores the statistical properties of gSCA: experiments that apply SCA to genome-wide scoring of SNPs in field sampled individuals. Computer simulations indicate that gSCA involving a few thousand genotyped samples can detect allele frequency changes of the magnitude that has been documented in field experiments on diverse taxa. To detect selection, imprecise genotyping from low-level sequencing of large samples of individuals provides much greater power than precise genotyping of smaller samples. The simulations also demonstrate the efficacy of 'haplotype matching', a method to combine information from a limited collection of whole genome sequence (the reference panel) with the much larger sample of field individuals that are measured for fitness. Pooled sequencing is demonstrated as another way to increase statistical power. Finally, I discuss the interpretation of selection estimates in relation to the Beavis effect, the overestimation of selection intensities at significant loci.
Keyphrases
  • genome wide
  • dna methylation
  • body composition
  • physical activity
  • high throughput
  • clinical trial
  • gene expression
  • machine learning
  • amino acid