MACE4IRmol: An uncertainty-aware foundation model for molecular infrared spectroscopy
Nitik Bhatia, Ondrej Krejci, Silvana Botti, Patrick Rinke, Miguel A. L. Marques
arXiv:2508.19118·physics.chem-ph·Published 2025-08-26·Updated 2026-03-09
Machine-learned interatomic potentials (MLIPs) have shown significant promise in predicting infrared spectra with high fidelity. However, the absence of general-purpose MLIPs that simultaneously span broad chemical diversity and provide reliable uncertainty estimates has limited their wider applicability. In this work, we introduce MACE4IRmol, an uncertainty-aware foundation model ensemble built on the MACE architecture. MACE4IRmol is trained on ~16 million molecular geometries and the corresponding density-functional theory (DFT) energies, forces, and dipole moments from the QCML dataset. The training data encompasses approximately 80 elements and a diverse set of molecules, including organic and inorganic compounds, and metal complexes. Importantly, MACE4IRmol is formulated as an ensemble of models to enable uncertainty quantification, which helps improve robustness in chemically diverse systems. Within this ensemble, separate models are trained with and without explicit dispersion corrections, allowing systematic assessment of van der Waals effects. In addition, MACE4IRmol delivers accurate predictions of energies, forces, dipole moments, and infrared spectra at a fraction of the computational cost of DFT, while enabling the explicit inclusion of nuclear quantum effects in infrared spectrum simulations. By combining generality, accuracy, efficiency, and uncertainty estimation, MACE4IRmol opens the door to rapid and reliable infrared spectra prediction for complex and diverse molecular systems.
TopicsUncertainty Quantification & Bayesian Methods
Tagsdft uncertainty-quantification
arXiv categoriesphysics.chem-ph
arXiv abstract pagePDF