Login / Signup

Regularized selection indices for breeding value prediction using hyper-spectral image data.

Marco Lopez-CruzEric OlsonGabriel RovereJosé CrosaSusanne DreisigackerSuchismita MondalRavi Prakash SinghGustavo de Los Campos
Published in: Scientific reports (2020)
High-throughput phenotyping (HTP) technologies can produce data on thousands of phenotypes per unit being monitored. These data can be used to breed for economically and environmentally relevant traits (e.g., drought tolerance); however, incorporating high-dimensional phenotypes in genetic analyses and in breeding schemes poses important statistical and computational challenges. To address this problem, we developed regularized selection indices; the methodology integrates techniques commonly used in high-dimensional phenotypic regressions (including penalization and rank-reduction approaches) into the selection index (SI) framework. Using extensive data from CIMMYT's (International Maize and Wheat Improvement Center) wheat breeding program we show that regularized SIs derived from hyper-spectral data offer consistently higher accuracy for grain yield than those achieved by standard SIs, and by vegetation indices commonly used to predict agronomic traits. Regularized SIs offer an effective approach to leverage HTP data that is routinely generated in agriculture; the methodology can also be used to conduct genetic studies using high-dimensional phenotypes that are often collected in humans and model organisms including body images and whole-genome gene expression profiles.
Keyphrases
  • electronic health record
  • high throughput
  • big data
  • genome wide
  • deep learning
  • optical coherence tomography
  • magnetic resonance imaging
  • copy number
  • machine learning
  • data analysis
  • convolutional neural network