Infinity-norm-based Input-to-State-Stable Long Short-Term Memory networks: a thermal systems perspective
Stefano De Carli, Davide Previtali, Leandro Pitturelli, Mirko Mazzoleni, Antonio Ferramosca, Fabio Previdi
arXiv:2503.11553·math.OC·Published 2025-03-14·Updated 2025-04-10
Recurrent Neural Networks (RNNs) have shown remarkable performances in system identification, particularly in nonlinear dynamical systems such as thermal processes. However, stability remains a critical challenge in practical applications: although the underlying process may be intrinsically stable, there may be no guarantee that the resulting RNN model captures this behavior. This paper addresses the stability issue by deriving a sufficient condition for Input-to-State Stability based on the infinity-norm (ISS$_{\infty}$) for Long Short-Term Memory (LSTM) networks. The obtained condition depends on fewer network parameters compared to prior works. A ISS$_{\infty}$-promoted training strategy is developed, incorporating a penalty term in the loss function that encourages stability and an ad hoc early stopping approach. The quality of LSTM models trained via the proposed approach is validated on a thermal system case study, where the ISS$_{\infty}$-promoted LSTM outperforms both a physics-based model and an ISS$_{\infty}$-promoted Gated Recurrent Unit (GRU) network while also surpassing non-ISS$_{\infty}$-promoted LSTM and GRU RNNs.
TopicsDynamical Systems & PDE Learning
Tagsdynamical-systems
arXiv categoriesmath.OC, stat.ML
arXiv abstract pagePDF