Fix tests for mps support #2005

deathcoder · 2024-09-14T16:15:12Z

Description

closes #914

When i started on the base branch feat/mps-support there were 45 failing tests that i now consider fixed, a few things to note:

in most cases i added a check (if mps device is available then i have to apply various casting to make sure tensors are float32 and remain float32) not sure if this approach is correct but happy to change it to something else that also works
i decided to skip test_float64_action_space tests entirely since float64 is not supported
this test test_save_load[True-SAC] only fails when running the full-suite or running all test_save_load tests (make pytest or python3 -m pytest -v -k 'test_save_load') if instead i run the the single breaking test (python3 -m pytest -v -k 'test_save_load[True-SAC]') then it passes 🤷‍♂️ i also run the test file in pycharm and it passes there too so i'm not sure what the issue is, i can add the stacktace of the failing test in a comment if needed
i'm not sure about a few things regarding this template, i think these are not breaking changes but for example i force a cast in vec_normalize:normalize_reward that maybe is considered breaking?
i also looked into the changelog but i couldnt figure out how to edit it

Here the full list of fixed tests

Unsupported tests fixed by skipping

Motivation and Context

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Note: You can run most of the checks using make commit-checks.

Note: we are using a maximum length of 127 characters per line

Attempt fix ci: only cast reward from float64 to float32

araffin · 2024-09-18T12:31:18Z

Hello,
thanks for having a look at that.
Apart from some tests failing, does the algorithms work in normal conditions? (for instance PPO("MlpPolicy", "Pendulum-v1", device="mps").learn(10_000))

(In theory, if pytorch supports MPS properly, you would only need to specify the device)

deathcoder · 2024-09-18T15:09:42Z

hey 👋 yes that works, i have also tested A2C both on this branch, i'm still a beginner in this so i cant really say if all advanced use cases also work, but i think having the tests passing is a good indicator

Fix tests

1c25053

deathcoder mentioned this pull request Sep 14, 2024

Use MPS device when available #951

Open

14 tasks

deathcoder and others added 3 commits September 17, 2024 17:41

Attempt fix ci: only cast reward from float64 to float32

f822ef5

allow running workflows from ui

1ac4a60

Merge pull request #2 from deathcoder/attempt-fix-ci

9970f51

Attempt fix ci: only cast reward from float64 to float32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tests for mps support #2005

Fix tests for mps support #2005

deathcoder commented Sep 14, 2024

araffin commented Sep 18, 2024

deathcoder commented Sep 18, 2024

Fix tests for mps support #2005

Are you sure you want to change the base?

Fix tests for mps support #2005

Conversation

deathcoder commented Sep 14, 2024

Description

Here the full list of fixed tests

Unsupported tests fixed by skipping

Motivation and Context

Types of changes

Checklist

araffin commented Sep 18, 2024

deathcoder commented Sep 18, 2024