Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning

Hyunsoo Park, Aron Walsh

arXiv:2511.07158·cs.LG·Published 2025-11-10

Discovering functional crystalline materials entails navigating an immense combinatorial design space. While recent advances in generative artificial intelligence have enabled the sampling of chemically plausible compositions and structures, a fundamental challenge remains: the objective misalignment between likelihood-based sampling in generative modelling and targeted focus on underexplored regions where novel compounds reside. Here, we introduce a reinforcement learning framework that guides latent denoising diffusion models toward diverse and novel, yet thermodynamically viable crystalline compounds. Our approach integrates group relative policy optimisation with verifiable, multi-objective rewards that jointly balance creativity, stability, and diversity. Beyond de novo generation, we demonstrate enhanced property-guided design that preserves chemical validity, while targeting desired functional properties. This approach establishes a modular foundation for controllable AI-driven inverse design that addresses the novelty-validity trade-off across scientific discovery applications of generative models.

TopicsGenerative Models & Discovery

Tagsdiffusion-models scientific-discovery

arXiv categoriescs.LG, physics.comp-ph

arXiv abstract pagePDF