Determining Atomic Structure from Spectroscopy via an Active Learning Framework

Ian Slagle, Faisal Alamgir, Victor Fung

arXiv:2602.20959·cond-mat.mtrl-sci·Published 2026-02-24

Determining atomic structure from spectroscopic data is central to materials science but remains restricted to a limited set of techniques and material classes, largely due to the computational cost and complexity of structural refinement. Here we introduce ActiveStructOpt, a general framework that integrates graph neural network surrogate models with active learning to efficiently determine candidate structures that reproduce target spectra with minimal computational expenditure. Benchmarking with X-ray pair distribution function data, and with the more computationally demanding simulations of X-ray absorption near-edge spectra (XANES) and extended X-ray absorption fine structure (EXAFS), demonstrate that ActiveStructOpt reliably determines structures that match closely in spectra across diverse materials classes. Under equivalent computational budgets, ActiveStructOpt outperforms existing structure determination methods. By enabling data-efficient, multi-objective structural refinement across a broad range of computable spectroscopic techniques, ActiveStructOpt provides a flexible and extensible approach to atomic structure determination in complex materials.

TopicsMolecular Representation & Learning, Quantum Chemistry & Force Fields

Tagsactive-learning gnn materials-science

arXiv categoriescond-mat.mtrl-sci

arXiv abstract pagePDF