Tokenised Flow Matching for Hierarchical Simulation Based Inference
Giovanni Charles, Cosmo Santoni, Seth Flaxman, Elizaveta Semenova
arXiv:2604.20723·cs.LG·Published 2026-04-22
The cost of simulator evaluations is a key practical bottleneck for Simulation Based Inference (SBI). In hierarchical settings with shared global parameters and exchangeable site-level parameters and observations, this structure can be exploited to improve simulation efficiency. Existing hierarchical SBI approaches factorise the posterior yet still simulate across multiple sites per training sample; We instead explore likelihood factorisation (LF) to train from single-site simulations. In LF sampling we learn a per-site neural surrogate of the simulator and then assemble synthetic multi-site observations to amortise inference for the full hierarchical posterior. Building on this, we propose Tokenised Flow Matching for Posterior Estimation (TFMPE), a tokenised flow matching approach that supports function-valued observations through likelihood factorisation. To enable systematic evaluation, we introduce a benchmark for hierarchical SBI. We validate TFMPE on this benchmark and on realistic infectious disease and computational fluid dynamics models, finding well-calibrated posteriors while reducing computational cost.
TopicsFluid Dynamics & Plasma Physics
Tagscomputational-fluid-dynamics
arXiv categoriescs.LG, cs.AI
arXiv abstract pagePDF