Semiautomated Feature Extraction from RGB Images for Sorghum Panicle Architecture GWAS.
Yan ZhouSrikant SrinivasanSeyed Vahid MirnezamiAaron KusmecQi FuLakshmi AttigalaMaria G Salas FernandezBaskar GanapathysubramanianPatrick S SchnablePublished in: Plant physiology (2018)
Because structural variation in the inflorescence architecture of cereal crops can influence yield, it is of interest to identify the genes responsible for this variation. However, the manual collection of inflorescence phenotypes can be time consuming for the large populations needed to conduct genome-wide association studies (GWAS) and is difficult for multidimensional traits such as volume. A semiautomated phenotyping pipeline, TIM (Toolkit for Inflorescence Measurement), was developed and used to extract unidimensional and multidimensional features from images of 1,064 sorghum (Sorghum bicolor) panicles from 272 genotypes comprising a subset of the Sorghum Association Panel. GWAS detected 35 unique single-nucleotide polymorphisms associated with variation in inflorescence architecture. The accuracy of the TIM pipeline is supported by the fact that several of these trait-associated single-nucleotide polymorphisms (TASs) are located within chromosomal regions associated with similar traits in previously published quantitative trait locus and GWAS analyses of sorghum. Additionally, sorghum homologs of maize (Zea mays) and rice (Oryza sativa) genes known to affect inflorescence architecture are enriched in the vicinities of TASs. Finally, our TASs are enriched within genomic regions that exhibit high levels of divergence between converted tropical lines and cultivars, consistent with the hypothesis that these chromosomal intervals were targets of selection during modern breeding.
Keyphrases
- genome wide
- copy number
- deep learning
- dna methylation
- genome wide association
- convolutional neural network
- genome wide association study
- machine learning
- optical coherence tomography
- randomized controlled trial
- high throughput
- climate change
- bioinformatics analysis
- transcription factor
- genetic diversity
- single cell