Login / Signup

Explaining and avoiding failure modes in goal-directed generation of small molecules.

Maxime LangevinRodolphe VuilleumierMarc Bianciotto
Published in: Journal of cheminformatics (2022)
Despite growing interest and success in automated in-silico molecular design, questions remain regarding the ability of goal-directed generation algorithms to perform unbiased exploration of novel chemical spaces. A specific phenomenon has recently been highlighted: goal-directed generation guided with machine learning models produce molecules with high scores according to the optimization model, but low scores according to control models, even when trained on the same data distribution and the same target. In this work, we show that this worrisome behavior is actually due to issues with the predictive models and not the goal-directed generation algorithms. We show that with appropriate predictive models, this issue can be resolved, and molecules generated have high scores according to both the optimization and the control models.
Keyphrases
  • machine learning
  • deep learning
  • big data
  • artificial intelligence
  • high throughput
  • molecular docking
  • electronic health record
  • body composition
  • resistance training
  • molecular dynamics simulations
  • single cell