ML-BEES

Developing Machine Learning-based Emulation of the Earth's Surface

Layer 1 soil moisture inference from UniMP and ECLand model

About the project

Machine Learning (ML) is becoming increasingly important for numerical weather prediction (NWP), and ML-based models have reached similar or improved forecast scores than state-of-the-art physical models. ECMWF has intensified its activities in the application of ML models for atmospheric forecasting and developed the Artificial Intelligence/Integrated Forecasting System (AIFS). To harness the potential of ML for land modelling and data assimilation activities at ECMWF, a first ML emulator prototype has been developed (Pinnington et al. AMS Annual Meeting 2024). The ML model was trained on the "offline" ECMWF Land Surface Modelling System (ECLand) using a preselected ML training database. The current prototype is based on the information of model increments without introducing further temporal constraints and provides a cheap alternative to physical models.

By developing and evaluating four types of state-of-art machine learning emulators (XGBoost, MLP, UniMP (GNN), and MAMBA), our project demonstrated that machine learning (ML) models can effectively and efficiently emulate land surface models in terms of spatial-temporal variability, uncertainty, and physical consistency. Among the models tested, XGBoost delivered the best overall performance across all 17 target variables, while the MLP model, though slightly less accurate, was the most efficient, providing inference six times faster than XGBoost. These ML models can serve as reliable alternatives to physically-based land surface models for computationally intensive experiments, with the choice of model depending on the specific task.

Project structure

ML-BEES/
├── AR5_region_mask/         # .shp for masking AR5 regions for spatial condition analysis
├── ML-BEES-eval/            # functions and Jupyter notebooks for ML emulator evaluation; more details please refers to readme.md
├── ML-BEES-train/           # Source code for ML models and utility functions for data loading and inference
├── media/                   # Images and videos used in README or documentation
├── ai_land_original/        # Example scripts developed by Pinnington et al.
├── README.md                
├── LICENSE                  # License for the project
└── .gitignore               # Files to ignore in git

Documentation

For more details about our project workflow and analysis, please find them in the following slides and project report.

Authors

Participants:

Till Fohrmann
Johannes Leonhardt
Hakam Shams
Yikui Zhang

Acknowledgments

Big thanks for all of our mentors from European Centre for Medium-Range Weather Forecasts (ECMWF) in Reading and Bonn, as well as the resources and funding from ECMWF Code For Earth 2024 Event!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML-BEES

About the project

Project structure

Documentation

Authors

Acknowledgments

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 212 Commits
AR5_region_mask		AR5_region_mask
ML-BEES-eval		ML-BEES-eval
ML-BEES-train		ML-BEES-train
ai_land_original		ai_land_original
media		media
.gitignore		.gitignore
README.md		README.md

ECMWFCode4Earth/ML-BEES

Folders and files

Latest commit

History

Repository files navigation

ML-BEES

About the project

Project structure

Documentation

Authors

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages