Login / Signup

Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.

Olivia RiedlingAllison S WalkerAntonis Rokas
Published in: Microbiology spectrum (2024)
Fungi are key sources of natural products and iconic drugs, including penicillin and statins. DNA sequencing has revealed that there are likely millions of biosynthetic pathways in fungal genomes, but the chemical structures and bioactivities of >99% of natural products produced by these pathways remain unknown. We used artificial intelligence to predict the bioactivities of diverse fungal biosynthetic pathways. We found that the accuracies of our predictions were generally low, between 51% and 68%, likely because the natural products and bioactivities of only very few fungal pathways are known. With >15,000 characterized fungal natural products, millions of putative biosynthetic pathways present in fungal genomes, and increased demand for novel drugs, our study suggests that there is an urgent need for efforts that systematically identify fungal biosynthetic pathways, their natural products, and their bioactivities.
Keyphrases
  • artificial intelligence
  • cell wall
  • big data
  • machine learning
  • cardiovascular disease
  • type diabetes
  • deep learning
  • high resolution
  • copy number
  • genome wide
  • transcription factor