This repository contains the code for the course Applied Machine Learning in Genomic Data Science (AMLG).
This course was held at:
- Winter Semester 2023/24, Leibniz University Hannover
- Winter Semester 2024/25, Leibniz University Hannover
Happy coding! 👩💻👨💻
In this course, all exercises are provided as Jupyter notebooks.
A Jupyter notebook is a JSON file, following a versioned schema, usually ending with the
.ipynb
extension. The main part of a Jupyter notebook is a list of cells. List of cells are different types of cells for Markdown, code, and output of the code type cells.
The notebooks are organized in demos, exercises, and projects.
In each exercises folder, you will find two versions of each notebook: one named, e.g., hic_analysis.ipynb
, and another one named hic_analysis_exercise.ipynb
.
Please work in the exercise version.
If you get stuck, feel free to take a look at the corresponding solution.
Note: The exercise versions will be uploaded according to the current status of the course.
You can simply clone the repository over HTTP via the command line:
git clone https://github.com/voges/amlg.git
Git is probably already installed on every Linux distribution. On Windows systems, we recommend using the Windows Subsystem for Linux along with any long-term support (LTS) Ubuntu distribution. Please refer to the Ubuntu documentation for installation instructions. On Mac systems, we recommend installing Git using the Xcode command line tools (via
xcode-select --install
) or via Homebrew.
We recommend using Visual Studio Code and the provided development container.
We recommend using the GWDG Jupyter Cloud. Here, in addition to the terminal, you can use the graphical user interface to clone the repository.
Alternatively, you can use any other online Jupyter server, such as Google Colab.
The data used are available via the Harvard Dataverse under the DOI 10.7910/DVN/ZSVS5X. A copy of the data is also hosted here via Seafile at Leibniz University Hannover. Note: It is not necessary to download the data beforehand. The individual notebooks already contain the code to download the necessary data.