Login / Signup

Genomic selection for genotype performance and stability using information on multiple traits and multiple environments.

Jon BančičBen OvendenGregor GorjancDaniel J Tolhurst
Published in: TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik (2023)
The inclusion of multiple traits and multiple environments within a partially separable factor analytic approach for genomic selection provides breeders with an informative framework to utilise genotype by environment by trait interaction for efficient selection. This paper develops a single-stage genomic selection (GS) approach which incorporates information on multiple traits and multiple environments within a partially separable factor analytic framework. The factor analytic linear mixed model is an effective method for analysing multi-environment trial (MET) datasets, but has not been extended to GS for multiple traits and multiple environments. The advantage of using all information is that breeders can utilise genotype by environment by trait interaction (GETI) to obtain more accurate predictions across correlated traits and environments. The partially separable factor analytic linear mixed model (SFA-LMM) developed in this paper is based on a three-way separable structure, which includes a factor analytic matrix between traits, a factor analytic matrix between environments and a genomic relationship matrix between genotypes. A diagonal matrix is then added to enable a different genotype by environment interaction (GEI) pattern for each trait and a different genotype by trait interaction (GTI) pattern for each environment. The results show that the SFA-LMM provides a better fit than separable approaches and a comparable fit to non-separable and partially separable approaches. The distinguishing feature of the SFA-LMM is that it will include fewer parameters than all other approaches as the number of genotypes, traits and environments increases. Lastly, a selection index is used to demonstrate simultaneous selection for overall performance and stability. This research represents an important continuation in the advancement of plant breeding analyses, particularly with the advent of high-throughput datasets involving a very large number of genotypes, traits and environments.
Keyphrases
  • genome wide
  • copy number
  • high throughput
  • dna methylation
  • clinical trial
  • machine learning
  • high resolution
  • deep learning
  • health information
  • social media
  • rna seq