Mol-CADiff: Causality-Aware Autoregressive Diffusion for Molecule Generation

Md Atik Ahamed, Qiang Ye, Qiang Cheng

arXiv:2503.05499·cs.LG·Published 2025-03-07

The design of novel molecules with desired properties is a key challenge in drug discovery and materials science. Traditional methods rely on trial-and-error, while recent deep learning approaches have accelerated molecular generation. However, existing models struggle with generating molecules based on specific textual descriptions. We introduce Mol-CADiff, a novel diffusion-based framework that uses causal attention mechanisms for text-conditional molecular generation. Our approach explicitly models the causal relationship between textual prompts and molecular structures, overcoming key limitations in existing methods. We enhance dependency modeling both within and across modalities, enabling precise control over the generation process. Our extensive experiments demonstrate that Mol-CADiff outperforms state-of-the-art methods in generating diverse, novel, and chemically valid molecules, with better alignment to specified properties, enabling more intuitive language-driven molecular design.

TopicsGenerative Design & Molecule Optimization, Large Language Models & Materials

Tagsdrug-discovery materials-science molecular-generation

arXiv categoriescs.LG

arXiv abstract pagePDF