The AlphaGo (paper), AlphZero (paper), and Muzero (paper) algorithms created by Deepmind were made to solve complex tasks comparable to those met in the real world. It so happens that games are an accurate representation of some of those tasks, and so, games are a great way to test the learning capabilities as well as the generality of these models. The Muzero algorithm is the compilement of its predecessors (AlphaGo and AlphaZero) which are inferior to it in both accuracy and robusticity. The template was used here to reproduce the Muzero algorithm usng the multiprocessing module, which I recommend be swapped according to one's needs (TPU).
-
Notifications
You must be signed in to change notification settings - Fork 0
AzerAfram/Muzero_reproduction
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published