Spectral DQN

Adapting to Reward Progressivity via Spectral Reinforcement Learning, ICLR'21.

Abstract

We consider reinforcement learning tasks with progressive rewards; that is, tasks where the rewards tend to increase in magnitude over time. We hypothesise that this property may be problematic for value-based deep reinforcement learning agents, particularly if the agent must first succeed in relatively unrewarding regions of the task in order to reach more rewarding regions. To address this issue, we propose Spectral DQN, which decomposes the reward into frequencies such that the high frequencies only activate when large rewards are found. This allows the training loss to be balanced so that it gives more even weighting across small and large reward regions. In two domains with extreme reward progressivity, where standard value-based methods struggle significantly, Spectral DQN is able to make much farther progress. Moreover, when evaluated on a set of six standard Atari games that do not overtly favour the approach, Spectral DQN remains more than competitive: While it underperforms one of the benchmarks in a single game, it comfortably surpasses the benchmarks in three games. These results demonstrate that the approach is not overfit to its target problem, and suggest that Spectral DQN may have advantages beyond addressing reward progressivity.

Requirements

The installation instructions below assume that you have Anaconda installed. You will also need to source some Atari ROMs and place them in the /roms folder.

Installing

First, install the Arcade Learning Environment (ALE) by following the instructions in README_ALE.md

Next, create a Conda environment with the necessary packages installed by running:

conda env create --file environment.yml

Running

source activate spectral_dqn
cd agent
python2 ./python_agent.py game_title

The agent's configuration settings are stored in /agent/python_agent.py.

To see the agent playing the game, set USE_SDL = False then select the SDL window that appears and press 'd' to toggle the display.

To see performance and debugging graphs, set agent_params["show_graphs"] = True.

If you don't have a CUDA enabled GPU, you can set agent_params["gpu"] = -1 to train on the CPU.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
agent		agent
ale_python_interface		ale_python_interface
doc		doc
roms		roms
src		src
CMakeLists.txt		CMakeLists.txt
ChangeLog		ChangeLog
Copyright.txt		Copyright.txt
License.txt		License.txt
README-SDL.txt		README-SDL.txt
README.md		README.md
README_ALE.md		README_ALE.md
ale.cfg		ale.cfg
common.rules		common.rules
environment.yml		environment.yml
makefile.mac		makefile.mac
makefile.unix		makefile.unix
md5.txt		md5.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spectral DQN

Abstract

Requirements

Installing

Running

About

Releases

Packages

Languages

License

mchldann/SpectralDQN

Folders and files

Latest commit

History

Repository files navigation

Spectral DQN

Abstract

Requirements

Installing

Running

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages