Chromosome-level genome of Entada phaseoloides provides insights into genome evolution and biosynthesis of triterpenoid saponins.
Min LinJian-Bo JianZhu-Qing ZhouChun-Hai ChenWen WangHui XiongZhi-Nan MeiPublished in: Molecular ecology resources (2022)
As a medicinal herbal plant, Entada phaseoloides has high levels of secondary metabolites, particularly triterpenoid saponins, which are important resources for scientific research and medical applications. However, the lack of a reference genome for this genus has limited research on its evolution and utilization of its medicinal potential. In this study, we report a chromosome-scale genome assembly for E. phaseoloides using Illumina, Nanopore long reads and high-throughput chromosome conformation capture technology. The assembled reference genome is 456.18 Mb (scaffold N50 = 30.9 Mb; contig N50 = 6.34 Mb) with 95.71% of the sequences anchored onto 14 pseudochromosomes. E. phaseoloides was estimated to have diverged from the Leguminosae lineage at ~72.0 million years ago. With the integration of transcriptomic and metabolomic data, gene expression patterns and metabolite profiling of E. phaseoloides were determined in different tissues. The pattern of gene expression and metabolic profile of the kernel were distinct from those of other tissues. Furthermore, the evolution of certain gene families involved in the biosynthesis of triterpenoid saponins and terpenes was analysed and offers new insights into the formation of these two metabolites. Four CYP genes, one UGT gene and related transcription factors were identified as candidate genes contributing to regulation of triterpenoid saponin biosynthesis. As the first high-quality assembled reference genome in the genus Entada, it will not only provide new information for the evolutionary study of this genus and conservation biology of E. phaseoloides but also lay a foundation for the formation and utilization of secondary metabolites in medicinal plants.