Spatially Resolved Uncertainties for Machine Learning Potentials.
Esther HeidJohannes SchörghuberRalf WanzenböckGeorg K H MadsenPublished in: Journal of chemical information and modeling (2024)
Machine learning potentials have become an essential tool for atomistic simulations, yielding results close to ab initio simulations at a fraction of computational cost. With recent improvements on the achievable accuracies, the focus has now shifted on the data set composition itself. The reliable identification of erroneously predicted configurations to extend a given data set is therefore of high priority. Yet, uncertainty estimation techniques have achieved mixed results for machine learning potentials. Consequently, a general and versatile method to correlate energy or atomic force uncertainties with the model error has remained elusive to date. In the current work, we show that epistemic uncertainty cannot correlate with model error by definition but can be aggregated over groups of atoms to yield a strong correlation. We demonstrate that our method correctly estimates prediction errors both globally per structure and locally resolved per atom. The direct correlation of local uncertainty and local error is used to design an active learning framework based on identifying local subregions of a large simulation cell and performing ab initio calculations only for the subregion subsequently. We successfully utilized this method to perform active learning in the low-data regime for liquid water.
Keyphrases
- machine learning
- big data
- molecular dynamics
- electronic health record
- artificial intelligence
- monte carlo
- molecular dynamics simulations
- single cell
- density functional theory
- stem cells
- deep learning
- data analysis
- emergency department
- patient safety
- mesenchymal stem cells
- cell therapy
- quality improvement
- virtual reality