MuZero PyTorch

Implementation of Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model by DeepMind
for CartPole-v0 environment.

MuZero + naive tree search is working.
MuZero + monte carlo tree search (MCTS) is now working.
(search policy has to be approx. uniform at the first episode. If not then restart)
Improvements: more tricks/hacks for better MCTS training.

MCTS results

Naive tree search results

Search in the fully expanded tree at depth n the maximum discounted value (+ discounted rewards).
Take the action which is the first action from the root to the maximum node.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
media		media
naive_tree_search		naive_tree_search
.gitattributes		.gitattributes
Agent.py		Agent.py
Env_Runner.py		Env_Runner.py
Env_Wrapper.py		Env_Wrapper.py
Experience_Replay.py		Experience_Replay.py
MCTS.py		MCTS.py
Networks.py		Networks.py
README.md		README.md
episode_returns.csv		episode_returns.csv
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MuZero PyTorch

MCTS results

Naive tree search results

About

Releases

Packages

Languages

Hauf3n/MuZero-PyTorch

Folders and files

Latest commit

History

Repository files navigation

MuZero PyTorch

MCTS results

Naive tree search results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages