MetMamba: Regional Weather Forecasting with Spatial-Temporal Mamba Model

Haoyu Qin, Yungang Chen, Qianchuan Jiang, Pengchao Sun, Xiancai Ye, Chao Lin

arXiv:2408.06400·physics.ao-ph·Published 2024-08-12·Updated 2024-08-14

Deep Learning based Weather Prediction (DLWP) models have been improving rapidly over the last few years, surpassing state of the art numerical weather forecasts by significant margins. While much of the optimization effort is focused on training curriculum to extend forecast range in the global context, two aspects remains less explored: limited area modeling and better backbones for weather forecasting. We show in this paper that MetMamba, a DLWP model built on a state-of-the-art state-space model, Mamba, offers notable performance gains and unique advantages over other popular backbones using traditional attention mechanisms and neural operators. We also demonstrate the feasibility of deep learning based limited area modeling via coupled training with a global host model.

TopicsScientific Machine Learning & PINNs

Tagsneural-operators weather-forecasting

arXiv categoriesphysics.ao-ph, cs.LG

arXiv abstract pagePDF