Releases: keraJLi/rejax
Releases · keraJLi/rejax
🌴 Jumanji compatability
Train rejax agents on Jumanji environments!
- Added
rejax.compat.jumanji2gymnax
, create Jumanji environments viarejax.compat.create("jumanji/...")
- Dropped support for Python 3.9 (it might still work but we now want flax 0.9, which requires 3.10)
- Updated configs and added tests to make sure they are valid
- Changed build system to use pdm
- Changed package layout to use source layout and renamed test folder to tests
Full Changelog: v0.1.0...v0.1.1
v0.1.0
This update changes the algorithm interface:
Before (v0.0.x) | After (v0.1.0) |
---|---|
|
|
Rationale:
- It's simpler and more intuitive
- Parameters and algorithm subroutines depend on each other (e.g. an algorithm that samples from a replay buffer also has the buffer's size as an HP). It makes sense to collect them in the same class to modularize the algorithm architecture.
- We can eliminate a lot of boilerplate code by inheriting from mixins that have both parameters and subroutines.
What's Changed
- Merged config and algorithm
- New algorithm: Implicit Quantile Networks by Dabney et al, 2018
- New algorithm: Parallelised Q Networks by Gallici, Fellows et al, 2024
- Removed DDPG, as it is now a special case of TD3
- Added support for more than two critics to SAC and TD3
- Changed default hyperparameters (to be powers of 2 mostly)
- Change the name of hyperparameters: (
gradient_steps
->num_epochs
,tau
->polyak
) - Removed
rejax.evaluate.make_evaluate
, userejax.evaluate.evaluate
instead - Moved
rejax.algos.networks
andrejax.algos.buffers
torejax
- New module:
rejax.compat
implements loading environments from different packages. Currently supports gymnax, brax, and navix - Removed
rejax.brax2gymnax
(use the newrejax.compat
instead)
Full Changelog: v0.0.1...v0.1.0
🎉Pypi release!
All basic features are here!
I've kept this as a pre-release since there will likely be some breaking changes in the future.