path-finding

This is a repository for a class project in reinforcement learning.

Contributors:
Zoe Kanavas ([email protected])
Erin Musabandesu ([email protected])
Liam Lynch ([email protected])

Data can be found here:

UC Davis Google Drive Data Access

Run the file RL_testbed_final.py from the same directory as the following:

Sample_A (data folder)
heuristic_info_all_samples.csv

Creates a folder for results (pickled dictionary) and figs based on a trial number. The trial number is also taken as the random seed.

Two algorithms implemented:

Episodic semigradient SARSA (Sutton and Barto, pg. 244) - with linear approximation function
Continuous semigradient SARSA (Sutton and Barto, pg. 251) - with linear approximation function