GitHub

The AlphaGo (paper), AlphZero (paper), and Muzero (paper) algorithms created by Deepmind were made to solve complex tasks comparable to those met in the real world. It so happens that games are an accurate representation of some of those tasks, and so, games are a great way to test the learning capabilities as well as the generality of these models. The Muzero algorithm is the compilement of its predecessors (AlphaGo and AlphaZero) which are inferior to it in both accuracy and robusticity. The template was used here to reproduce the Muzero algorithm usng the multiprocessing module, which I recommend be swapped according to one's needs (TPU).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
muzero_repro.py		muzero_repro.py
networks.py		networks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

AzerAfram/Muzero_reproduction

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages