Calibrating Generative Models to Distributional Constraints

Henry D. Smith, Nathaniel L. Diamant, Brian L. Trippe

arXiv:2510.10020·stat.ML·Published 2025-10-11·Updated 2026-01-17

Generative models frequently suffer miscalibration, wherein statistics of the sampling distribution such as class probabilities deviate from desired values. We frame calibration as a constrained optimization problem and seek the closest model in Kullback-Leibler divergence satisfying calibration constraints. To address the intractability of imposing these constraints exactly, we introduce two surrogate objectives for fine-tuning: (1) the relax loss, which replaces the constraint with a miscalibration penalty, and (2) the reward loss, which converts calibration into a reward fine-tuning problem. We demonstrate that these approaches substantially reduce calibration error across hundreds of simultaneous constraints and models with up to one billion parameters, spanning applications in protein design, image generation, and language modeling.

TopicsGenerative Design & Molecule Optimization, Protein & Biomolecules

Tagsgenerative-model protein-structure

arXiv categoriesstat.ML, cs.LG, q-bio.BM

arXiv abstract pagePDF