Login / Signup

Collaborative Assessment of Molecular Geometries and Energies from the Open Force Field.

Lorenzo D'AmoreDavid F HahnDavid L DotsonJoshua T HortonJamshed AnwarIan CraigThomas FoxAlberto GobbiSirish Kaushik LakkarajuXavier LucasKatharina MeierDavid L MobleyArjun NarayananChristina E M SchindlerWilliam C SwopePieter J In 't VeldJeffrey WagnerBai XueGary John Tresadern
Published in: Journal of chemical information and modeling (2022)
Force fields form the basis for classical molecular simulations, and their accuracy is crucial for the quality of, for instance, protein-ligand binding simulations in drug discovery. The huge diversity of small-molecule chemistry makes it a challenge to build and parameterize a suitable force field. The Open Force Field Initiative is a combined industry and academic consortium developing a state-of-the-art small-molecule force field. In this report, industry members of the consortium worked together to objectively evaluate the performance of the force fields (referred to here as OpenFF) produced by the initiative on a combined public and proprietary dataset of 19,653 relevant molecules selected from their internal research and compound collections. This evaluation was important because it was completely blind; at most partners, none of the molecules or data were used in force field development or testing prior to this work. We compare the Open Force Field "Sage" version 2.0.0 and "Parsley" version 1.3.0 with GAFF-2.11-AM1BCC, OPLS4, and SMIRNOFF99Frosst. We analyzed force-field-optimized geometries and conformer energies compared to reference quantum mechanical data. We show that OPLS4 performs best, and the latest Open Force Field release shows a clear improvement compared to its predecessors. The performance of established force fields such as GAFF-2.11 was generally worse. While OpenFF researchers were involved in building the benchmarking infrastructure used in this work, benchmarking was done entirely in-house within industrial organizations and the resulting assessment is reported here. This work assesses the force field performance using separate benchmarking steps, external datasets, and involving external research groups. This effort may also be unique in terms of the number of different industrial partners involved, with 10 different companies participating in the benchmark efforts.
Keyphrases
  • single molecule
  • small molecule
  • quality improvement
  • drug discovery
  • healthcare
  • machine learning
  • hepatitis c virus
  • data analysis