Releases: DHDev0/Muzero
Releases · DHDev0/Muzero
V1.1
v1
- mcts stability on prior on multi sim
- multi processor and gpu
- support for all type of torch float
- expand compatibility to atari and mujoco for 1D and 2D observation
- add lstm,conv-lstm,transformer Model
- fix bug on previous backprob
- fix weight initialization on linear layer
- add priority to actualize replay buffer from training