Abstract
Machine learning (ML) methods have become an important tool for modelling and forecasting complex high-dimensional spatiotemporal datasets such as those found in environmental and climate modelling applications. ML approaches can offer a fast, low-cost alternative to short-term forecasting than expensive numerical simulation while addressing a significant outstanding limitation of numerical modelling by being able to robustly and dynamically quantify predictive uncertainty. Low-cost and near-instantaneous forecasting of high-level climate variables has clear applications in early warning systems, nowcasting, and parameterising small-scale locally relevant simulations. This paper presents a novel approach for multi-task spatiotemporal regression by combining data-driven autoencoders with Gaussian Processes (GP) to produce a probabilistic tensor-based regression model. The proposed method is demonstrated for forecasting one-step-ahead temperature and pressure on a global scale simultaneously. By conducting probabilistic regression in the learned latent space, samples can be propagated back to the original feature space to produce uncertainty estimates at a vastly reduced computational cost. The composite GP-autoencoder model was able to simultaneously forecast global temperature and pressure values with average errors of 3.82 °C and 638 hPa, respectively. Further, on average the true values were within the proposed posterior distribution 95.6% of the time illustrating that the model produces a well-calibrated predictive posterior distribution.
Original language | English |
---|---|
Article number | 107536 |
Number of pages | 12 |
Journal | Engineering Applications of Artificial Intelligence |
Volume | 128 |
Early online date | 21 Nov 2023 |
DOIs | |
Publication status | Published - 1 Feb 2024 |
Bibliographical note
© 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).Funder
The authors would like to thank The Scientific Computing Research Technology Platform at the University of Warwick for the computational resources allocated to this study.Keywords
- uncertainty quantification
- Multi-task learning
- Climate forecast
- Gaussian process emulator
- Autoencoder