Login / Signup

When Yield Prediction Does Not Yield Prediction: An Overview of the Current Challenges.

Varvara VoinarovskaMikhail KabeshovDmytro DudenkoSamuel GenhedenIgor V Tetko
Published in: Journal of chemical information and modeling (2023)
Machine Learning (ML) techniques face significant challenges when predicting advanced chemical properties, such as yield, feasibility of chemical synthesis, and optimal reaction conditions. These challenges stem from the high-dimensional nature of the prediction task and the myriad essential variables involved, ranging from reactants and reagents to catalysts, temperature, and purification processes. Successfully developing a reliable predictive model not only holds the potential for optimizing high-throughput experiments but can also elevate existing retrosynthetic predictive approaches and bolster a plethora of applications within the field. In this review, we systematically evaluate the efficacy of current ML methodologies in chemoinformatics, shedding light on their milestones and inherent limitations. Additionally, a detailed examination of a representative case study provides insights into the prevailing issues related to data availability and transferability in the discipline.
Keyphrases
  • machine learning
  • high throughput
  • electronic health record
  • cross sectional
  • single cell
  • highly efficient
  • human health
  • risk assessment
  • transition metal
  • data analysis