PepALD: Macrocyclic Peptide Generation via Autoregressive Latent Diffusion

Junming Zhang, Siyu Yi, Wei Ju, Zhonghui Gu

arXiv:2606.14510·cs.LG·Published 2026-06-12

Macrocyclic peptides are promising therapeutic candidates for intracellular targets, but their design requires simultaneous control over non-natural monomer chemistry, ring topology, membrane permeability, and target binding. Existing SMILES- or HELM-string generative models either operate in long atom-level sequence spaces or treat monomers as symbolic tokens with limited chemical grounding. We introduce PepALD, an Autoregressive Latent Diffusion (ALD) foundation model for \textit{de novo} macrocyclic peptide generation. The model represents HELM monomers with structured chemical embeddings, generates each residue through context-conditioned diffusion in chemically informed latent space, predicts R-group-aware ring closures during autoregressive generation, and aligns the denoiser to affinity rewards using winner-protected diffusion-adapted preference optimization. In silico experiments demonstrate PepALD's generation quality and reward-optimization performance against representative peptide generation baselines.

TopicsGenerative Design & Molecule Optimization

Tagsdrug-discovery generative-model

arXiv categoriescs.LG

arXiv abstract pagePDF