Genome- wide structural and functional variant discovery of rice landraces using genotyping by sequencing.
Smitha Kunhiraman VasumathyMaya PeringottillamKrishna T SundaramS Hari Krishna KumarManickavelu AlaguPublished in: Molecular biology reports (2020)
Rice landraces are vital genetic resources for agronomic and quality traits but the undeniable collection of Kerala landraces remains poorly delineated. To effectively conserve, manage, and use these resources, understanding the genomic structure of germplasm is essential. Genotyping by sequencing (GBS) enables identification of an immense number of single nucleotide polymorphism (SNP) and insertion deletion (InDel) from 96 rice germplasm. In the present study, a total of 16.9 × 107 reads were generated, and among that 16.3 × 107 reads were mapped to the indica reference genome. Exploring GBS data unfolded a wide genomic variations including 82,59,639 SNPs and 1,07,140 Indels. Both neighbor-joining tree and principal coordinate analysis with InDel markers revealed the selected germplasm in this study as highly diverse in structure. We assembled unmapped reads which were further employed for gene ontology analysis. These unmapped sequences that are generally expelled from subsequent studies of GBS data analysis may exist as an unexplored resort for several novel significant biological findings. The discovery of SNPs from the haplotyping results of GS3 and GIF1 genes provided insight into marker- assisted selection based on grain size and yield and can be utilized for rice yield improvement. To our knowledge, this is the first report on structural variation analysis using the GBS platform in rice landraces collected from Kerala. Genomic information from this study endows with valuable resources for perceptive rice landrace structure and can also facilitate sequencing-based molecular breeding.