Login / Signup

Predictor bias in genomic and phenomic selection.

Hermann Gregor DallingerFranziska LöschenbergerHerbert BistrichChristian AmetzHerbert HetzendorferLaura MoralesSebastian MichelHermann Buerstmayr
Published in: TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik (2023)
NIRS of wheat grains as phenomic predictors for grain yield show inflated prediction ability and are biased toward grain protein content. Estimating the breeding value of individuals using genome-wide marker data (genomic prediction) is currently one of the most important drivers of breeding progress in major crops. Recently, phenomic technologies, including remote sensing and aerial hyperspectral imaging of plant canopies, have made it feasible to predict the breeding value of individuals in the absence of genetic marker data. This is commonly referred to as phenomic prediction. Hyperspectral measurements in the form of near-infrared spectroscopy have been used since the 1980 s to predict compositional parameters of harvest products. Moreover, in recent studies NIRS from grains was used to predict grain yield. The same studies showed that phenomic prediction can outperform genomic prediction for grain yield. The genome is static and not environment dependent, thereby limiting genomic prediction ability. Gene expression is tissue specific and differs under environmental influences, leading to a tissue- and environment-specific phenome, potentially explaining the higher predictive ability of phenomic prediction. Here, we compare genomic prediction and phenomic prediction from hyperspectral measurements of wheat grains for the prediction of a variety of traits including grain yield. We show that phenomic predictions outperform genomic prediction for some traits. However, phenomic predictions are biased toward the information present in the predictor. Future studies on this topic should investigate whether population parameters are retained in phenomic prediction as they are in genomic prediction. Furthermore, we find that unbiased phenomic prediction abilities are considerably lower than previously reported and recommend a method to circumvent this issue.
Keyphrases
  • genome wide
  • gene expression
  • copy number
  • healthcare
  • machine learning
  • big data
  • climate change
  • amino acid
  • mass spectrometry
  • binding protein
  • artificial intelligence