Improving the performance of Stein variational inference through extreme sparsification of physically-constrained neural network models
Govinda Anantha Padmanabha, Jan Niklas Fuhg, Cosmin Safta, Reese E. Jones, Nikolaos Bouklas
arXiv:2407.00761·cs.LG·Published 2024-06-30
Most scientific machine learning (SciML) applications of neural networks involve hundreds to thousands of parameters, and hence, uncertainty quantification for such models is plagued by the curse of dimensionality. Using physical applications, we show that $L_0$ sparsification prior to Stein variational gradient descent ($L_0$+SVGD) is a more robust and efficient means of uncertainty quantification, in terms of computational cost and performance than the direct application of SGVD or projected SGVD methods. Specifically, $L_0$+SVGD demonstrates superior resilience to noise, the ability to perform well in extrapolated regions, and a faster convergence rate to an optimal solution.
TopicsScientific Machine Learning & PINNs
Tagsscientific-machine-learning sciml uncertainty-quantification
arXiv categoriescs.LG, cs.CE
arXiv abstract pagePDF