Skip to content

Evaluating and improving the performance of ECMWF’s current land surface Machine Learning model prototype.

Notifications You must be signed in to change notification settings

ECMWFCode4Earth/ML-BEES

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ml-bees_logo

ML-BEES

Developing Machine Learning-based Emulation of the Earth's Surface

UniMP ECLand

Layer 1 soil moisture inference from UniMP and ECLand model

About the project

Machine Learning (ML) is becoming increasingly important for numerical weather prediction (NWP), and ML-based models have reached similar or improved forecast scores than state-of-the-art physical models. ECMWF has intensified its activities in the application of ML models for atmospheric forecasting and developed the Artificial Intelligence/Integrated Forecasting System (AIFS). To harness the potential of ML for land modelling and data assimilation activities at ECMWF, a first ML emulator prototype has been developed (Pinnington et al. AMS Annual Meeting 2024). The ML model was trained on the "offline" ECMWF Land Surface Modelling System (ECLand) using a preselected ML training database. The current prototype is based on the information of model increments without introducing further temporal constraints and provides a cheap alternative to physical models.

By developing and evaluating four types of state-of-art machine learning emulators (XGBoost, MLP, UniMP (GNN), and MAMBA), our project demonstrated that machine learning (ML) models can effectively and efficiently emulate land surface models in terms of spatial-temporal variability, uncertainty, and physical consistency. Among the models tested, XGBoost delivered the best overall performance across all 17 target variables, while the MLP model, though slightly less accurate, was the most efficient, providing inference six times faster than XGBoost. These ML models can serve as reliable alternatives to physically-based land surface models for computationally intensive experiments, with the choice of model depending on the specific task.

Project structure

ML-BEES/
├── AR5_region_mask/         # .shp for masking AR5 regions for spatial condition analysis
├── ML-BEES-eval/            # functions and Jupyter notebooks for ML emulator evaluation; more details please refers to readme.md
├── ML-BEES-train/           # Source code for ML models and utility functions for data loading and inference
├── media/                   # Images and videos used in README or documentation
├── ai_land_original/        # Example scripts developed by Pinnington et al.
├── README.md                
├── LICENSE                  # License for the project
└── .gitignore               # Files to ignore in git

Documentation

For more details about our project workflow and analysis, please find them in the following slides and project report.

Authors

Participants:

  • Till Fohrmann
  • Johannes Leonhardt
  • Hakam Shams
  • Yikui Zhang

Acknowledgments

Big thanks for all of our mentors from European Centre for Medium-Range Weather Forecasts (ECMWF) in Reading and Bonn, as well as the resources and funding from ECMWF Code For Earth 2024 Event!

About

Evaluating and improving the performance of ECMWF’s current land surface Machine Learning model prototype.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •