MILCDock: Machine Learning Enhanced Consensus Docking for Virtual Screening in Drug Discovery.
Connor J MorrisJacob A SternBrenden StarkMax ChristophersonDennis Della CortePublished in: Journal of chemical information and modeling (2022)
Molecular docking tools are regularly used to computationally identify new molecules in virtual screening for drug discovery. However, docking tools suffer from inaccurate scoring functions with widely varying performance on different proteins. To enable more accurate ranking of active over inactive ligands in virtual screening, we created a machine learning consensus docking tool, MILCDock, that uses predictions from five traditional molecular docking tools to predict the probability a ligand binds to a protein. MILCDock was trained and tested on data from both the DUD-E and LIT-PCBA docking datasets and shows improved performance over traditional molecular docking tools and other consensus docking methods on the DUD-E dataset. LIT-PCBA targets proved to be difficult for all methods tested. We also find that DUD-E data, although biased, can be effective in training machine learning tools if care is taken to avoid DUD-E's biases during training.
Keyphrases
- molecular docking
- molecular dynamics simulations
- drug discovery
- machine learning
- protein protein
- molecular dynamics
- big data
- artificial intelligence
- small molecule
- healthcare
- electronic health record
- clinical practice
- palliative care
- quality improvement
- mass spectrometry
- virtual reality
- pain management
- health insurance
- data analysis