Optimal Development of Transferable Machine Learning Interatomic Potentials using Active Learning
ORAL
Abstract
Automated methods for generating atomistic configurations make possible the creation of vast and diverse datasets where potentials that exhibit consistent accuracy across diverse configurations can be trained. However, automated generation frameworks struggle to create configurations of physical relevance to the potential development task of interest. For that reason, a training dataset that combines configurations generated using automated frameworks and domain expertise is expected to yield better performing potentials. Nevertheless, integrating configurations from these two frameworks is not a trivial task given the fact that data-driven methods often yield a very large number of configurations, which places a severe computational burden for calculating the results, and domain-expertise methods are not capable of scaling in order to generate a vast number of configurations. Consequently, there is a critical need for an enhanced training protocol that can integrate both types of configurations in an optimized and data-driven manner. This work addresses this challenge by establishing an automated protocol to train a potential using an ensemble of neural network based potentials and active learning. The developed protocol trains the potential iteratively by automatically incorporating a fixed set of configurations that maximize the information gained.
*This work was supported by the Laboratory Directed Research and Development program at Sandia National Laboratories. Sandia National Laboratories is a multi-mission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC., a wholly owned subsidiary of Honeywell International, Inc., for the U.S. Department of Energy National Nuclear Security Administration under contract DE-NA0003525. The views expressed in the article do not necessarily represent the views of the U.S. Department of Energy or the United States Government. SAND2022-14565 A
–
Presenters
-
David O Montes de Oca Zapiain
- Sandia National laboratories