v0.4.3

PaParaZz1 released this 22 Sep 17:02

· 299 commits to main since this release

f8a596f

Env

add rule-based gomoku expert (#465)

Algorithm

fix a2c policy batch size bug (#481)
enable activation option in collaq attention and mixer
minor fix about IBC (#477)

Enhancement

add IGM support (#486)
add tb logger middleware and demo

Fix

the type conversion in ding_env_wrapper (#483)
di-orchestrator version bug in unittest (#479)
data collection errors caused by shallow copies (#475)
gym==0.26.0 seed args bug

Style

add readme tutorial link(environment & algorithm) (#490) (#493)
adjust location of the default_model method in policy (#453)

New Repo

DI-sheep: Deep Reinforcement Learning + 3 Tiles Game

Contributors: @PaParaZz1 @nighood @norman26625 @ZHZisZZ @cpwan @mahuangxu

Contributors

cpwan, PaParaZz1, and 4 other contributors

Assets 2