Guiding Generative Protein Language Models with Reinforcement Learning

Filippo Stocco, Maria Artigues-Lleixa, Andrea Hunklinger, Talal Widatalla, Marc Guell, Noelia Ferruz

arXiv:2412.12979·q-bio.BM·Published 2024-12-17·Updated 2025-11-27

Protein language models (pLMs) have demonstrated success at generating functional proteins across vast sequence spaces but lack the ability to design high-fitness variants on demand. Here, we iteratively guide pLMs toward user-defined objectives by applying reinforcement learning (RL). We demonstrate that RL can steer pLMs toward various protein properties, such as topologies or binding affinities, in a few iterations through long evolutionary trajectories. We apply our framework to the design of epidermal growth factor receptor (EGFR) binders, achieving a 26-fold increase in binding affinity in two iterations.

TopicsProtein & Biomolecules

Tagsprotein-ligand protein-llm reinforcement-learning

arXiv categoriesq-bio.BM

arXiv abstract page PDF