[BugFix] More flexible episode_reward computation in logger #136

matteobettini · 2024-10-03T17:05:47Z

This PR fixes the way episode rewards are computed in BenchMARL

Here is an overview:

BenchMARL will be looking at the global done (always assumed to be set), which can usually be computed using any or all over the single agents dones.

In all cases the global done is what is used to compute the episode reward.

We log episode_reward min, mean, max over episodes at three different levels:

agent (disabled by default, can be turned on manually)
group averaged over agents in group
global averaged over agents in groups and gropus

Requiremment

When agents are done and the global done is not set, agents should be getting a reward of 0 (if you are not using global rewards)

Fixes #135

amend

39df9cc

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 3, 2024

matteobettini mentioned this pull request Oct 3, 2024

MAgent2 integration #135

Closed

amend

ab30f13

matteobettini added the smacv2 Label to be assigned to run the SMACv2 tests label Oct 4, 2024

matteobettini marked this pull request as ready for review October 4, 2024 09:17

matteobettini added 2 commits October 4, 2024 10:23

disable single agent logging

d62b21e

amend

20ad0fb

matteobettini changed the title ~~[BugFix] More flexible reward computation in logger~~ [BugFix] More flexible episode_reward computation in logger Oct 4, 2024

matteobettini added 2 commits October 4, 2024 10:56

amend

d1b7f5f

amend

3f10e1b

matteobettini merged commit b214375 into main Oct 4, 2024
13 checks passed

matteobettini deleted the better_logger branch October 4, 2024 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] More flexible episode_reward computation in logger #136

[BugFix] More flexible episode_reward computation in logger #136

matteobettini commented Oct 3, 2024 •

edited

Loading

[BugFix] More flexible episode_reward computation in logger #136

[BugFix] More flexible episode_reward computation in logger #136

Conversation

matteobettini commented Oct 3, 2024 • edited Loading

This PR fixes the way episode rewards are computed in BenchMARL

Requiremment

matteobettini commented Oct 3, 2024 •

edited

Loading