Machine-Learning Based Stacked Ensemble Model for Accurate Analysis of Molecular Dynamics Simulations.
Samrendra K SinghKarteek K BejagamYaxin AnSanket A DeshmukhPublished in: The journal of physical chemistry. A (2019)
Accurate, faster, and on-the-fly analysis of the molecular dynamics (MD) simulations trajectory becomes very critical during the discovery of new materials or while developing force-field parameters due to automated nature of these processes. Here to overcome the drawbacks of algorithm based analysis approaches, we have developed and utilized an approach that integrates machine-learning (ML) based stacked ensemble model (SEM) with MD simulations, for the first time. As a proof-of-concept, two SEMs were developed to analyze two dynamical properties of a water droplet, its contact angle, and hydrogen bonds. The two SEMs consisted of two layered networks of random forest, artificial neural network, support vector regression, Kernel ridge regression, and k-nearest neighbors ML models. The root-mean-square error values, uncertainty quantification, and sensitivity analysis of both the SEMs suggested that the final result was more accurate as compared to that of the individual ML models. This new computational framework is very general, robust, and has a huge potential in analyzing large size MD simulation trajectories as it can capture critical information very accurately.
Keyphrases
- molecular dynamics
- neural network
- machine learning
- molecular dynamics simulations
- density functional theory
- high resolution
- high throughput
- artificial intelligence
- deep learning
- big data
- molecular docking
- depressive symptoms
- small molecule
- convolutional neural network
- climate change
- single cell
- single molecule
- healthcare
- gold nanoparticles
- drosophila melanogaster