This Project Pythia Cookbook covers an extremely basic precipitation classification project. This notebook will introduce learners to the scikit-learn API, basic exploratory data analysis (EDA), and evaluations. It is meant to be a very early and basic introduction to these concepts, it is not meant to be an in-depth intorduction to machine learning. It could be the first introduction to machine learning for learners familiar with weather data.
This cookbook is meant to be a companion to Unidata's CyberTraining project.
Ana Castaneda Montoya, Thomas Martin
This notebook has a few sections, from inital data loading to a end to end machine learning workflow.
This section gives some nice examples of pair plots in Seaborn, and Correlation Matricies.
For machine learning, we need a testing, training, and validation dataset. This section covers how to do that, and gives some great refrences on the why.
For (most) machine learning models, scaling is necessary. This sections covers how to do that.
The part where we actually train a model! We also see how good it is.
You can either run the notebook using Binder or on your local machine.
The simplest way to interact with a Jupyter Notebook is through
Binder, which enables the execution of a
Jupyter Book in the cloud. The details of how this works are not
important for now. All you need to know is how to launch a Pythia
Cookbooks chapter via Binder. Simply navigate your mouse to
the top right corner of the book chapter you are viewing and click
on the rocket ship icon, (see figure below), and be sure to select
“launch Binder”. After a moment you should be presented with a
notebook that you can interact with. I.e. you’ll be able to execute
and even change the example programs. You’ll see that the code cells
have no output at first, until you execute them by pressing
{kbd}Shift
+{kbd}Enter
. Complete details on how to interact with
a live Jupyter notebook are described in Getting Started with
Jupyter.
If you are interested in running this material locally on your computer, you will need to follow this workflow:
(Replace "cookbook-example" with the title of your cookbooks)
-
Clone the
https://github.com/ThomasMGeo/ptype-ml-cookbook
repository:git clone https://github.com/ThomasMGeo/ptype-ml-cookbook.git
-
Create and activate your conda environment from the
environment.yml
fileconda env create -f environment.yml conda activate cookbook-example
-
Move into the
notebooks
directory and start up Jupyterlabcd notebooks/ jupyter lab