Continous CQL loss logging and aligning with discrete logging #317

joshuaspear · 2023-08-10T17:39:27Z

Implementing logging for the conservative loss part of CQL for continuous CQL. Also altered the logging of the loss in the discrete model to include the value of $\alpha$

codecov · 2023-08-10T17:54:50Z

Codecov Report

Merging #317 (80b9579) into refactor_loss (36222ae) will increase coverage by 0.18%.
Report is 12 commits behind head on refactor_loss.
The diff coverage is 97.73%.

@@                Coverage Diff                @@
##           refactor_loss     #317      +/-   ##
=================================================
+ Coverage          92.72%   92.91%   +0.18%     
=================================================
  Files                108      108              
  Lines               7353     7109     -244     
=================================================
- Hits                6818     6605     -213     
+ Misses               535      504      -31

Files Changed	Coverage Δ
d3rlpy/metrics/evaluators.py	`97.15% <ø> (ø)`
d3rlpy/models/encoders.py	`97.01% <ø> (+1.61%)`	⬆️
d3rlpy/models/torch/q_functions/__init__.py	`100.00% <ø> (ø)`
d3rlpy/algos/transformer/base.py	`93.15% <77.77%> (-0.52%)`	⬇️
d3rlpy/algos/qlearning/base.py	`88.21% <81.25%> (-0.15%)`	⬇️
d3rlpy/algos/qlearning/cql.py	`97.70% <88.23%> (-2.30%)`	⬇️
d3rlpy/algos/qlearning/sac.py	`97.67% <88.23%> (-2.33%)`	⬇️
d3rlpy/models/torch/q_functions/base.py	`80.00% <90.00%> (+0.51%)`	⬆️
d3rlpy/algos/qlearning/torch/plas_impl.py	`92.94% <91.07%> (+0.40%)`	⬆️
d3rlpy/algos/qlearning/torch/bc_impl.py	`92.64% <91.17%> (+9.04%)`	⬆️
... and 43 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

takuseno

Thank you for your PR! As I commented below, I'll take the first pass to refactor signatures of methods. Then, I'll let you known when it's done.

takuseno · 2023-08-11T08:56:16Z

.gitignore

@@ -14,7 +14,7 @@ docs/d3rlpy*.rst
 docs/modules.rst
 docs/references/generated
 coverage.xml
-.coverage
+.coverage*


Is this change necessary?

When running tests, I seem to get .coverage... files with references to my local system. Maybe it's the way I'm running the test?

takuseno · 2023-08-11T09:04:47Z

d3rlpy/algos/qlearning/torch/cql_impl.py

+        return loss + conservative_loss, conservative_loss
+
+    @train_api
+    def update_critic(self, batch: TorchMiniBatch) -> np.array:


This is altering a signature of update_critic, which is def update_critic(self, batch: TorchMiniBatch) -> float:. Thus, we actually need to refactor these methods first. I'll take the first pass to return Dict[str, float]. Then, you can make changes on top of it. Sorry for the inconvenience. I'll let you know when it's done.

The interface has been updated in this commit: 67723be . Please check.

@takuseno - I'm sorry for the delay! All looks good to me except my one comment on the AWAC d3rlpy/algos/qlearning/awac.py file. Cheers

@takuseno is there anything else you need from me at all for the PR? :)

@joshuaspear Thanks for the contribution. I've changed the target branch to refactor_loss because I've made some new changes to master branch. I'm seeing conflicts between your PR and refactor_loss branch. Could you resolve it? Here is an example instruction to merge refactor_loss branch to your branch.

$ git fetch upstream $ git checkout master $ git merge upstream/refactor_loss

@takuseno no probs - will do

@takuseno have merged the branches - I included a couple more data classes for the loss outputs FYI

Fixed a small typo. Many thanks again!

…impl. Also 1.Updated the conservative loss of discrete cql to be captured including the alpha multiplication to align with continuous cql. 2. Updated the critic loss of ddpg and continuous CQL to use dataclasses - aligning with DQN and discrete cql

takuseno · 2023-09-02T07:37:04Z

@joshuaspear Thank you for continuing this, but, I'm seeing weirdly large number of changes in your diff now.... It could be easier to close this PR and make a new one based on the latest master to resolve this....

joshuaspear · 2023-09-02T10:40:18Z

@takuseno makes sense :) Will have a crack at it next week

joshuaspear added 5 commits August 4, 2023 06:51

tracking of cql regularisation for continuous cql

4a6edc9

updated for linting and formatting

5b7185c

overwriting dr3 pull and aligning cql logging

7fd0a37

updated formatting

a52651e

update gitignore

a46d73f

takuseno reviewed Aug 11, 2023

View reviewed changes

takuseno and others added 9 commits August 11, 2023 22:58

Fix custom network docs

7255159

Fix typo (takuseno#321)

294544d

Fixed a small typo. Many thanks again!

Add TPU example notebook

b8263d4

Simpify Q functions with forwarders

3ec25d7

Add Checkpointer

bae6777

Refactor models with Modules

817810d

Refactor reset_optimizer_states

4ba297f

Refactor to support torch.compile

2d730ef

Move update logics to impl

9d4f928

takuseno changed the base branch from master to refactor_loss August 26, 2023 01:55

joshuaspear force-pushed the master branch from a46d73f to 9d4f928 Compare August 28, 2023 15:24

joshuaspear added 7 commits August 29, 2023 10:54

corrected formatting

2846d20

generalised loss dataclass

8e5aec8

added method returns

fa90b6c

updated formatting

b1c1e62

updated bcq loss

5bd59a1

updated formatting

80b9579

takuseno deleted the branch takuseno:refactor_loss December 2, 2023 08:33

takuseno closed this Dec 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continous CQL loss logging and aligning with discrete logging #317

Continous CQL loss logging and aligning with discrete logging #317

joshuaspear commented Aug 10, 2023

codecov bot commented Aug 10, 2023 •

edited

Loading

takuseno left a comment

takuseno Aug 11, 2023

joshuaspear Aug 11, 2023

takuseno Aug 11, 2023

takuseno Aug 11, 2023

joshuaspear Aug 24, 2023

joshuaspear Aug 25, 2023

takuseno Aug 26, 2023

joshuaspear Aug 28, 2023

joshuaspear Aug 29, 2023

takuseno commented Sep 2, 2023

joshuaspear commented Sep 2, 2023

Continous CQL loss logging and aligning with discrete logging #317

Continous CQL loss logging and aligning with discrete logging #317

Conversation

joshuaspear commented Aug 10, 2023

codecov bot commented Aug 10, 2023 • edited Loading

Codecov Report

takuseno left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

takuseno commented Sep 2, 2023

joshuaspear commented Sep 2, 2023

codecov bot commented Aug 10, 2023 •

edited

Loading