ggComp enables dissection of germplasm resources and construction of a multiscale germplasm network in wheat.
Zhengzhao YangZihao WangWenxi WangXiaoming XieLingling ChaiXiaobo WangXibo FengJinghui LiHuiru PengZhenqi SuMingshan YouYingyin YaoMingming XinZhaorong HuJie LiuRongqi LiangZhongfu NiQixin SunWeilong GuoPublished in: Plant physiology (2022)
Accurate germplasm characterization is a vital step for accelerating crop genetic improvement, which remains largely infeasible for crops such as bread wheat (Triticum aestivum L.), which has a complex genome that undergoes frequent introgression and contains many structural variations. Here, we propose a genomic strategy called ggComp, which integrates resequencing data with copy number variations and stratified single-nucleotide polymorphism densities to enable unsupervised identification of pairwise germplasm resource-based Identity-By-Descent (gIBD) blocks. The reliability of ggComp was verified in wheat cultivar Nongda5181 by dissecting parental-descent patterns represented by inherited genomic blocks. With gIBD blocks identified among 212 wheat accessions, we constructed a multi-scale genomic-based germplasm network. At the whole-genome level, the network helps to clarify pedigree relationship, demonstrate genetic flow, and identify key founder lines. At the chromosome level, we were able to trace the utilization of 1RS introgression in modern wheat breeding by hitchhiked segments. At the single block scale, the dissected germplasm-based haplotypes nicely matched with previously identified alleles of "Green Revolution" genes and can guide allele mining and dissect the trajectory of beneficial alleles in wheat breeding. Our work presents a model-based framework for precisely evaluating germplasm resources with genomic data. A database, WheatCompDB (http://wheat.cau.edu.cn/WheatCompDB/), is available for researchers to exploit the identified gIBDs with a multi-scale network.