Challenges and Guidelines in Deep Generative Protein Design: Four Case Studies
Tianyuan Zheng, Alessandro Rondina, Gos Micklem, Pietro Liò
arXiv:2411.18568·q-bio.BM·Published 2024-11-27·Updated 2025-07-13
Deep generative models show promise for $\textit{de novo}$ protein design, yet reliably producing designs that are geometrically plausible, evolutionarily consistent, functionally relevant, and dynamically stable remains challenging. We present a deep generative modeling pipeline for early $\textit{de novo}$ design of monomeric proteins, based on Score Matching and Flow Matching. We apply this pipeline to four diverse protein families with an adaptable evaluation protocol. Generated structures display realistic, clash-free conformations enriched with family-specific features, while the designed sequences preserve essential functional residues while retaining variability. Molecular dynamics and binding simulations show dynamic stability, with wild-type-like binding pockets that interact favorably with family-specific ligands. These results provide practical guidelines for integrating generative models into protein design workflows.
TopicsGenerative Design & Molecule Optimization, Large Language Models & Materials, Protein & Biomolecules, Quantum Chemistry & Force Fields
Tagsdrug-discovery generative-model molecular-dynamics protein-structure
arXiv categoriesq-bio.BM
arXiv abstract pagePDF