Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models

Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E. Jones, Nikolaos Bouklas

arXiv:2407.00761·cs.LG·Published 2024-06-30

Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_0$ sparsification prior to Stein variational gradient descent ($L_0$+SVGD) is a more robust and efficient means of uncertainty quantification, in terms of computational cost and performance than the direct application of SGVD or projected SGVD methods. Specifically, $L_0$+SVGD demonstrates superior resilience to noise, the ability to perform well in extrapolated regions, and a faster convergence rate to an optimal solution.

TopicsScientific Machine Learning & PINNs

Tagsscientific-machine-learning sciml uncertainty-quantification

arXiv categoriescs.LG, cs.CE

arXiv abstract page PDF