v0.4.3
Env
- add rule-based gomoku expert (#465)
Algorithm
- fix a2c policy batch size bug (#481)
- enable activation option in collaq attention and mixer
- minor fix about IBC (#477)
Enhancement
- add IGM support (#486)
- add tb logger middleware and demo
Fix
- the type conversion in ding_env_wrapper (#483)
- di-orchestrator version bug in unittest (#479)
- data collection errors caused by shallow copies (#475)
- gym==0.26.0 seed args bug
Style
- add readme tutorial link(environment & algorithm) (#490) (#493)
- adjust location of the default_model method in policy (#453)
New Repo
- DI-sheep: Deep Reinforcement Learning + 3 Tiles Game
Contributors: @PaParaZz1 @nighood @norman26625 @ZHZisZZ @cpwan @mahuangxu