Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI

Turab Lookman, YuJie Liu, Zhibin Gao

arXiv:2601.00742·physics.comp-ph·Published 2026-01-02·Updated 2026-01-09

This perspective explores the evolution of materials informatics, from its foundational roots in physics and information theory to its maturation through artificial intelligence (AI). We trace the field's trajectory from early milestones to the transformative impact of the Materials Genome Initiative and the recent advent of large language models (LLMs). Rather than a mere toolkit, we present materials informatics as an evolving ecosystem, reviewing key methodologies such as Bayesian Optimization, Reinforcement Learning, and Transformers that drive inverse design and autonomous self-driving laboratories. We specifically address the practical challenges of LLM integration, comparing specialist versus generalist models and discussing solutions for uncertainty quantification. Looking forward, we assess the transition of AI from a predictive tool to a collaborative research partner. By leveraging active learning and retrieval-augmented generation (RAG), the field is moving toward a new era of autonomous materials science, increasingly characterized by "human-out-of-the-loop" discovery processes.

TopicsLarge Language Models & Materials, Quantum Chemistry & Force Fields

Tagsactive-learning bayesian-optimization materials-discovery materials-science reinforcement-learning

arXiv categoriesphysics.comp-ph

arXiv abstract page PDF