npretor_udacity_deepRL

This Deep Q network maps the 37 state spaces of the Unity environment to a best policy for choosing the 4 possible actions in the environment. More detail is given in the report. The majority of the project code is taken from this Unity project: https://github.com/udacity/Value-based-methods

1. Installation

Clone the repo

git clone https://github.com/npretor/npretor_udacity_deepRL
cd npretor_udacity_deepRL

Install conda if not installed
a. Install dependancies: Installing the unity dependencies was a pain (running OS X Big Sur). That can be another blog post. Install non-unity reqs using:
```
conda create --name deepqn --file package-list.txt
```
If using jupyterlab:
```
python3 -m ipykernel install --user --name=deepqn
source activate deepqn
python3 -m jupyterlab 
```
If using bash
```
source activate deepqn
```

3. Training -

This runs without the Weights and Biases module. Use the jupyter notebook for weights and biases tracking, but you will need to login and config wandb yourself.

python3 Navigation_Training.py

4. Demo

python3 Navigation_Demo.py

Report

Report

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Banana.app/Contents		Banana.app/Contents
blog_posts		blog_posts
media		media
.gitignore		.gitignore
Navigation_Demo.ipynb		Navigation_Demo.ipynb
Navigation_Demo.py		Navigation_Demo.py
Navigation_Training.ipynb		Navigation_Training.ipynb
Navigation_Training.py		Navigation_Training.py
README.md		README.md
Report.md		Report.md
agent.py		agent.py
checkpoint_magic-violet-19.pth		checkpoint_magic-violet-19.pth
checkpoint_neat-fog-17.pth		checkpoint_neat-fog-17.pth
model.py		model.py
package-list.txt		package-list.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

npretor_udacity_deepRL

1. Installation

3. Training -

This runs without the Weights and Biases module. Use the jupyter notebook for weights and biases tracking, but you will need to login and config wandb yourself.

4. Demo

Report

About

Releases

Packages

Languages

npretor/npretor_udacity_deepRL

Folders and files

Latest commit

History

Repository files navigation

npretor_udacity_deepRL

1. Installation

3. Training -

This runs without the Weights and Biases module. Use the jupyter notebook for weights and biases tracking, but you will need to login and config wandb yourself.

4. Demo

Report

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages