Skip to content

v0.4.3

Compare
Choose a tag to compare
@PaParaZz1 PaParaZz1 released this 22 Sep 17:02
· 299 commits to main since this release

Env

  1. add rule-based gomoku expert (#465)

Algorithm

  1. fix a2c policy batch size bug (#481)
  2. enable activation option in collaq attention and mixer
  3. minor fix about IBC (#477)

Enhancement

  1. add IGM support (#486)
  2. add tb logger middleware and demo

Fix

  1. the type conversion in ding_env_wrapper (#483)
  2. di-orchestrator version bug in unittest (#479)
  3. data collection errors caused by shallow copies (#475)
  4. gym==0.26.0 seed args bug

Style

  1. add readme tutorial link(environment & algorithm) (#490) (#493)
  2. adjust location of the default_model method in policy (#453)

New Repo

  • DI-sheep: Deep Reinforcement Learning + 3 Tiles Game

Contributors: @PaParaZz1 @nighood @norman26625 @ZHZisZZ @cpwan @mahuangxu