Pushing the Accuracy Limit of Foundation Neural Network Models with Quantum Monte Carlo Forces and Path Integrals

Anouar Benali, Thomas Plé, Olivier Adjoua, Valay Agarawal, Thomas Applencourt, Marharyta Blazhynska, Raymond Clay, Kevin Gasperich, Khalid Hossain, Jeongnim Kim, Christopher Knight, Jaron T. Krogel, Yvon Maday, Maxime Maria, Matthieu Montes, Ye Luo, Evgeny Posenitskiy, Corentin Villot, Venkatram Vishwanath, Louis Lagardère, Jean-Philip Piquemal

arXiv:2504.07948·physics.chem-ph·Published 2025-04-10·Updated 2025-04-14

We propose an end-to-end integrated strategy to produce highly accurate quantum chemistry (QC) synthetic datasets (energies and forces) aimed at deriving Foundation Machine Learning models for molecular simulation. Starting from Density Functional Theory (DFT), a "Jacob's Ladder" approach leverages computationally-optimized layers of massively GPU-accelerated software with increasing accuracy. Thanks to Exascale, this is the first time that the computationally intensive calculation of Quantum Monte Carlo forces (QMC), and the combination of multi-determinant QMC energies and forces with selected-Configuration Interaction wavefunctions, are computed at such scale at the complete basis-set limit. To bridge the gap between accurate QC and condensed-phase Molecular Dynamics, we leverage transfer learning to improve the DFT-based FeNNix-Bio1 foundation model. The resulting approach is coupled to path integrals adaptive sampling quantum dynamics to perform nanosecond reactive simulations at unprecedented accuracy. These results demonstrate the promise of Exascale to deepen our understanding of the inner machinery of complex biosystems.

TopicsQuantum Chemistry & Force Fields

Tagsdft molecular-dynamics monte-carlo quantum-chemistry

arXiv categoriesphysics.chem-ph

arXiv abstract pagePDF