Guiding Generative Protein Language Models with Reinforcement Learning
Filippo Stocco, Maria Artigues-Lleixa, Andrea Hunklinger, Talal Widatalla, Marc Guell, Noelia Ferruz
arXiv:2412.12979·q-bio.BM·Published 2024-12-17·Updated 2025-11-27
Protein language models (pLMs) have demonstrated success at generating functional proteins across vast sequence spaces but lack the ability to design high-fitness variants on demand. Here, we iteratively guide pLMs toward user-defined objectives by applying reinforcement learning (RL). We demonstrate that RL can steer pLMs toward various protein properties, such as topologies or binding affinities, in a few iterations through long evolutionary trajectories. We apply our framework to the design of epidermal growth factor receptor (EGFR) binders, achieving a 26-fold increase in binding affinity in two iterations.
TopicsProtein & Biomolecules
Tagsprotein-ligand protein-llm reinforcement-learning
arXiv categoriesq-bio.BM
arXiv abstract pagePDF