Login / Signup

READRetro: natural product biosynthesis predicting with retrieval-augmented dual-view retrosynthesis.

Taein KimSeul LeeYejin KwakMin-Soo ChoiJeongbin ParkSung Ju HwangSang-Gyu Kim
Published in: The New phytologist (2024)
Plants, as a sessile organism, produce various secondary metabolites to interact with the environment. These chemicals have fascinated the plant science community because of their ecological significance and notable biological activity. However, predicting the complete biosynthetic pathways from target molecules to metabolic building blocks remains a challenge. Here, we propose retrieval-augmented dual-view retrosynthesis (READRetro) as a practical bio-retrosynthesis tool to predict the biosynthetic pathways of plant natural products. Conventional bio-retrosynthesis models have been limited in their ability to predict biosynthetic pathways for natural products. READRetro was optimized for the prediction of complex metabolic pathways by incorporating cutting-edge deep learning architectures, an ensemble approach, and two retrievers. Evaluation of single- and multi-step retrosynthesis showed that each component of READRetro significantly improved its ability to predict biosynthetic pathways. READRetro was also able to propose the known pathways of secondary metabolites such as monoterpene indole alkaloids and the unknown pathway of menisdaurilide, demonstrating its applicability to real-world bio-retrosynthesis of plant natural products. For researchers interested in the biosynthesis and production of secondary metabolites, a user-friendly website (https://readretro.net) and the open-source code of READRetro have been made available.
Keyphrases
  • ms ms
  • deep learning
  • public health
  • healthcare
  • mental health
  • machine learning