Login / Signup

A note on simulating null distributions for G matrix comparisons.

Michael B MorrisseySandra HangartnerKeyne Monro
Published in: Evolution; international journal of organic evolution (2019)
Genetic variances and covariances, summarized in G matrices, are key determinants of the course of adaptive evolution. Consequently, understanding how G matrices vary among populations is critical to answering a variety of questions in evolutionary biology. A method has recently been proposed for generating null distributions of statistics pertaining to differences in G matrices among populations. The general approach facilitated by this method is likely to prove to be very important in studies of the evolution of G. We have identified an issue in the method that will cause it to create null distributions of differences in G matrices that are likely to be far too narrow. The issue arises from the fact that the method as currently used generates null distributions of statistics pertaining to differences in G matrices across populations by simulating breeding value vectors based on G matrices estimated from data, randomizing these vectors across populations, and then calculating null values of statistics from G matrices that are calculated directly from the variances and covariances among randomized vectors. This calculation treats breeding values as quantities that are directly measurable, instead of predicted from G matrices that are themselves estimated from patterns of covariance among kin. The existing method thus neglects a major source of uncertainty in G matrices, which renders it anti-conservative. We first suggest a correction to the method. We then apply the original and modified methods to a very simple instructive scenario. Finally, we demonstrate the use of both methods in the analysis of a real data set.
Keyphrases
  • electronic health record
  • genome wide
  • big data
  • gene expression
  • machine learning
  • monte carlo
  • double blind
  • deep learning
  • copy number
  • artificial intelligence
  • phase iii
  • data analysis