AtmoRep

A stochastic model of atmosphere dynamics using large scale representation learning

Representation Learning in Earth System Science

"AtmoRep uses large-scale representation learning from artificial intelligence to determine a general description of the highly complex, stochastic dynamics of the atmosphere."

The atmosphere affects humans in a multitude of ways, from loss of life due to adverse weather effects to long-term social and economic impacts on societies. Computer simulations of atmospheric dynamics are, therefore, of great importance for the well-being of our and future generations. Here, we propose AtmoRep, a novel, task-independent stochastic computer model of atmospheric dynamics that can provide skillful results for a wide range of applications. This is enabled by a novel self-supervised learning objective and a unique ensemble that samples from the stochastic model with a variability informed by the one in the historical record. Our work establishes that large-scale neural networks can provide skillful, task-independent models of atmospheric dynamics. With this, they provide a novel means to make the large record of atmospheric observations accessible for applications and for scientific inquiry, complementing existing simulations based on first principles.

Short term weather forecasting

The short-term forecasting skills of the pre-trained AtmoRep model are being tested against the numerical prediction models from ECMWF (IFS), as well as the main machine learning based forecasting algorithms now available.

Downscaling

Downscaling, also known as super-resolution in computer vision, aims to increase the resolution of a coarse spatial input. In particular, we are using the COSMO REA6 Reanalysis dataset, to obtain a downscaled version of AtmoRep.

Bias corrections

The presence of biases in models can cause shifts and trends in reanalyses that can lead to sub-optimal forecasts. In AtmoRep we correct for biases in the total precipitations rates by fine tuning the pre-trained model on radar data from the RADKLIM dataset.

Counterfactuals

counterfactuals are a methodology to answer "what if" questions, i.e. for the attribution of human impacts on extreme weather events. In AtmoRep, we tweaked the additional informations given to the network to generate counterfactual scenarios.

Attention maps

Attention maps have been used before in natural language processing and computer vision to understand what a transformer neural network has learned and to visualize and exemplify their generalization abilities. Link to AtmoRep attention maps.

Temporal interpolation

Temporal interpolation refers to the task of (re-)creating atmospheric state data with a higher temporal resolution than the input. The AtmoRep model shows substantially better skill compared to linear interpolation for 3 hours time windows.

History

July 2024: Kick-off of the related HClimRep project funded by the HelmholtzAI initiative!
June 2024: I. Luise, AI4Science Forum, invited seminar.
June 2024: I. Luise, University of Clermont-Ferrand, invited seminar.
June 2024: I. Luise, Helmholtz AI conference, satellite event, invited talk.
June 2024: I. Luise C. Lessig, PASC 2024, invited talk + organisers of a mini-symposium on “Foundation models in Earth System Science”.
May 2024: I. Luise, ENES HPC Workshop on High resolution modeling in Earth system science, invited talk.
May 2024: I. Luise, ESA-ECMWF Machine Learning Workshop, talk and panelist.
May 2024: C. Lessig, Karlsruhe Institute of Technology, invited seminar.
April 2024: C. Lessig, EGU24, solicited talk.
April 2024: I. Luise, CERN OpenLab technical workshop.
November 2023: I. Luise, NEURIPS 2023, poster.
November 2023: The AtmoRep code is now open source! Check our GitHub.
August 2023: Pre-print on ArXiv.
August 2023: Christian presented AtmoRep at Google Air Quality Journal Club (see below for slides).
July 2023: Christian presented AtmoRep at NVIDIA research (see below for slides).
June 2023: AtmoRep at the SIAM Conference on Mathematical & Computational Issues in the Geosciences 2023.
May 2023: Christian presented AtmoRep at GAMM 2023.
May 2023: Christian presented AtmoRep at ECMWF.
April 2023: Christian presented AtmoRep at EGU 2023.
March 2023: Ilaria presented AtmoRep and EMP2 at the CERN OpenLab Technical workshop in Geneva.
March 2023: Christian presented AtmoRep at the IntelliAQ workshop in Cologne. We had some lively discussions on where and how machine learning will be best used in atmospheric science in the future.
February 2023: Christian gave a talk at the NSF AI Institute for Research on Trustworthy AI in Weather, Climate, and Coastal Oceanography.
January 2023: Atmorep has become part of the newly established WestAI service centre which aims at enhancing and accelerating large-scale deep learning models for language, image and weather data processing.
December 2022: Christian presented an online poster at AGU on Tuesday, Dec. 13th.
November 2022: Christian presented a poster at the ECMWF Workshop on Machine Learning in Reading, UK.
October 2022: Christian presented online at Google Reasearch!
October 2022: Ilaria presented online at the KI forum of the German Meteorological Service (DWD).
September 2022: Ilaria presented at the annual conference of the Italian society for Climate Science in Rome.
December 2022: The AtmoRep project is conceived by the Climate21 collaboration during the Machine Learning and the Physics of Climate seminar at the Kavli Institute in Santa Barbara, California.

Presentations

Journal publications:

S. Hoffmann and C. Lessig. Atmodist: Self-supervised representation learning for atmospheric dynamics. Environmental Data Science, 2:e6, 2023. Link
The AtmoRep Collaboration. AtmoRep: A stochastic model of atmosphere dynamics using large scale representation learning. ArXiv preprint, 2023. Link

Talks:

I. Luise, talk at AI+Environment Summit, November 2023;
C. Lessig, talk at Google Air Quality Joural Club, May 2023; Slides.
C. Lessig, NVIDIA reseach, invited talk, May 2023; Slides.
C. Lessig, ECWMF invited talk, May 2023; Slides.
C. Lessig, EGU, April 2023; Slides.
I. Luise, CERN OpenLab technical workshop, March 2023; Slides.
C. Lessig, IntelliAQ workshop, March 2023; Slides.
C. Lessig, NSF AI Institute for Research on Trustworthy AI in Weather, Climate, and Coastal Oceanography, February 2023; Slides.
C. Lessig, Google Research, October 2022; Slides.
I. Luise, DWD KI Forum, October 2022; Slides.

Posters:

I. Luise et al., NeurIPS Dec. 2023; Posters.
C. Lessig et al., American Geophysical Union Annual Meeting 2022; Posters.
C. Lessig et al., ECMWF Workshop on Machine Learning, Nov. 2022; Posters.
I. Luise et al., Italian Society for Climate Sciences Annual Conference 2022.

Who we are

AtmoRep is a multi disciplinary collaboration among Computer Scientists from ECMWF, Earth Scientists from the Jülich Supercomputing Center and physicists from CERN.

Christian Lessig

Christian Lessig is a machine learning expert at ECMWF, the Eropean Center for Medium Weather Forecast. His background is in computer science but he also works today in scientific computing and numerical analysis. In the last years, his research moved towards addressing climate change, in particular by developing hybrid weather and climate simulation models that combine classical discretizations of the governing partial differential equations with neural networks that account for phenomena that are too expensive to simulate or whose physics is not well understood.

Ilaria Luise

Ilaria Luise is a Senior Research Fellow at CERN, the European Center for Nuclear Research in Geneva. She works as a physicists within the Innovation Division of the CERN IT-Department. Her background is in high energy physics and big data management. She is Co-PI of the EMP2 project at CERN, which is part of the CERN Innovation Programme on Environmental Applications (CIPEA). The EMP2 project aims at implementing the AtmoRep model into a digital twin engine. This is performed in collaboration with the EU funded InterTwin project and the Digital Twin initiative at CERN.

Martin Schultz

Martin Schultz is the group leader of the Earth System Data Exploration research group at the Jülich Supercomputing Center. He has more than 30 years of experience in working with atmospheric data and numerical modeling of atmospheric composition and climate. He has authored and co-authored more than 130 publications and has been listed as a highly cited researcher in the field of environmental sciences in 2017 and 2020. He is an ERC Advanced Grant holder (IntelliAQ) where he explores the potential of machine learning for the analysis of air quality data.

Bing Gong

Bing Gong is a postdoctoral researcher at the Jülich Supercomputing Center since 2019. Her current duties in the group are developing state-of-art scalable deep learning neural networks with a focus on time series prediction and video frame prediction in weather and air quality applications. She obtained her Ph.D. in the field of artificial intelligence in the application of environmental science and energy from the Technical University of Madrid, Spain, in July 2017.

Michael Langguth

Michael Langguth holds a Master degree in Physics of the Earth and Atmosphere from the Rheinische Friedrich-Wilhelms-University of Bonn. During his PhD he implemented a hybrid parametrization scheme for deep convection in the ICOsahedral Non‐hydrostatic (ICON) model developed by the DWD and the MPI-M. His current research interests focus on machine learning for atmospheric Earth system, combined with expertise from numerical modelling.

Scarlet Stadtler

Scarlet Stadtler is a postdoctoral associate at the Jülich Supercomputing Centre (JSC). Her research focuses on explainable machine learning and uncertainty quantification. She is a trained meteorologist and atmospheric chemist, she applies data-driven techniques in air quality research. As PI of the KISTE project, AI strategy for Earth System data, she leads the construction of an Earth-AI software platform and Earth-AI e-learning platform.