SQM2.20: Semiempirical quantum-mechanical scoring function yields DFT-quality protein-ligand binding affinity predictions in minutes.
Adam PecinaJindřich FanfrlíkMartin LepšíkJan ŘezáčPublished in: Nature communications (2024)
Accurate estimation of protein-ligand binding affinity is the cornerstone of computer-aided drug design. We present a universal physics-based scoring function, named SQM2.20, addressing key terms of binding free energy using semiempirical quantum-mechanical computational methods. SQM2.20 incorporates the latest methodological advances while remaining computationally efficient even for systems with thousands of atoms. To validate it rigorously, we have compiled and made available the PL-REX benchmark dataset consisting of high-resolution crystal structures and reliable experimental affinities for ten diverse protein targets. Comparative assessments demonstrate that SQM2.20 outperforms other scoring methods and reaches a level of accuracy similar to much more expensive DFT calculations. In the PL-REX dataset, it achieves excellent correlation with experimental data (average R 2 = 0.69) and exhibits consistent performance across all targets. In contrast to DFT, SQM2.20 provides affinity predictions in minutes, making it suitable for practical applications in hit identification or lead optimization.
Keyphrases
- density functional theory
- high resolution
- molecular dynamics
- protein protein
- binding protein
- molecular docking
- magnetic resonance
- emergency department
- mass spectrometry
- capillary electrophoresis
- small molecule
- machine learning
- quality improvement
- crystal structure
- monte carlo
- transcription factor
- energy transfer
- drug induced