Skip to content

Latest commit

 

History

History
20 lines (14 loc) · 1.64 KB

README.md

File metadata and controls

20 lines (14 loc) · 1.64 KB

Neural Networks For GWAS

A final project for DS 4440 - Neural Networks

Using the python library Selene to enable applications of neural networks to Genome Wide Association Studies.

Instructions for running training:

You will need to upload this directory to Google Drive, where you can train the models by opening run_training.ipynb in Colab and running each of the cells.

  • Data can be downloaded and processed by following the instructions in getting_started_with_selene.ipynb.
  • To select which model is used, simply update the two relevant lines in training_configs.yml.
  • Information about the training (loss and precision) will be logged periodically in colab. To get the raw data from the logs, copy them into their own .log file and use scripts/scrape_log_values.py to extract the raw list of values. These values can then be used to visualize the results.

Important files

  • run_training.ipynb loads data files and runs training in colab
  • training_configs.yml contains all of the configurations for training, which can be updated to train a different model, change the length of training, or change the way that logs are generated.
  • getting_started_with_selene.ipynb contains instructions for preparing data and installing Selene. This file was copied from a tutorial in the Selene GitHub repo.
  • visualize_results.ipynb contains visualizations to compare the performance of each model.

View the colab files and data in Google Drive.