[BugFix] Patch SAC to allow state_dict manipulation before exec #1607

vmoens · 2023-10-05T14:10:15Z

Description

matteobettini · 2023-10-05T14:32:24Z

test/test_cost.py

+            qvalue_network=value,
+            action_spec=UnboundedContinuousTensorSpec(shape=(2,)),
+        )
+        state = loss.state_dict()


maybe let's add a forward call or an access to the entropy before saving

then we're sure it is instantiated and the test loses its value no?

what we want to test is when the buffer is instantiated in the saved loss and it has to be loaded in a new loss which has just been init

matteobettini · 2023-10-05T14:35:41Z

torchrl/objectives/sac.py

+                raise RuntimeError(
+                    "Cannot infer the dimensionality of the action. Consider providing "
+                    "the target entropy explicitely or provide the spec of the "
+                    "action tensor in the actor network."


there is a tangential interesting bug related to this.

To see it try

from torchrl.modules import QValueActor, ProbabilisticActor, TanhDelta, ValueOperator from tensordict.nn import TensorDictModule from torchrl.objectives import SACLoss if __name__ == "__main__": model = torch.nn.Linear(1, 1) actor_module = TensorDictModule( torch.nn.Linear(3, 4), in_keys=["obs"], out_keys=["logits"] ) policy = ProbabilisticActor( module=actor_module, in_keys=["logits"], out_keys=["action"], distribution_class=TanhDelta, ) value = ValueOperator(module=model, in_keys=["obs"], out_keys="value") loss = SACLoss( actor_network=policy, qvalue_network=value, # action_spec=UnboundedContinuousTensorSpec(shape=(2,)), not passing the spec ) _ = loss.target_entropy

it will trigger this error but the user will see

File "/Users/matbet/PycharmProjects/rl/prova.py", line 29, in <module> loss.target_entropy File "/Users/matbet/PycharmProjects/rl/torchrl/objectives/common.py", line 348, in __getattr__ return super().__getattr__(item) File "/Users/matbet/miniconda3/envs/torchrl/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1614, in __getattr__ raise AttributeError("'{}' object has no attribute '{}'".format( AttributeError: 'SACLoss' object has no attribute 'target_entropy'

so the message here is never shown

this is valid for all @property methods. isn't this curious?

The __getattr__ of torch.nn.Module isn't its greatest feature!
Let me think of a fix...

the same extends also to EnvBase and other components, so the scope of this might be outside this PR

matteobettini

LGTM thanks a lot

torchrl/objectives/sac.py

matteobettini · 2023-10-05T14:43:26Z

the example in #1594 still fails, to test that you can either use that script or do a forward pass before saving the loss

Co-authored-by: Matteo Bettini <55539777+matteobettini@users.noreply.github.com>

vmoens · 2023-10-05T14:53:11Z

Good catch i had forgotten an "_" in the delezify

…o refactor_target_entropy_sac

matteobettini · 2023-10-05T14:56:16Z

good to go for me, let's maybe add the example in #1594 in the tests

vmoens · 2023-10-05T14:57:03Z

already done :)

…rch#1607) Co-authored-by: Matteo Bettini <55539777+matteobettini@users.noreply.github.com>

init

3c079a4

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 5, 2023

vmoens requested a review from matteobettini October 5, 2023 14:10

vmoens added the bug Something isn't working label Oct 5, 2023

matteobettini reviewed Oct 5, 2023

View reviewed changes

matteobettini approved these changes Oct 5, 2023

View reviewed changes

matteobettini reviewed Oct 5, 2023

View reviewed changes

torchrl/objectives/sac.py Outdated Show resolved Hide resolved

Update torchrl/objectives/sac.py

1f45e08

Co-authored-by: Matteo Bettini <55539777+matteobettini@users.noreply.github.com>

vmoens added 2 commits October 5, 2023 15:53

amend

e5a18db

Merge remote-tracking branch 'origin/refactor_target_entropy_sac' int…

28f3065

…o refactor_target_entropy_sac

vmoens merged commit 6a3e9f8 into main Oct 5, 2023
34 of 47 checks passed

vmoens deleted the refactor_target_entropy_sac branch October 5, 2023 14:57

vmoens added a commit to hyerra/rl that referenced this pull request Oct 10, 2023

[BugFix] Patch SAC to allow state_dict manipulation before exec (pyto…

eea6d36

…rch#1607) Co-authored-by: Matteo Bettini <55539777+matteobettini@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Patch SAC to allow state_dict manipulation before exec #1607

[BugFix] Patch SAC to allow state_dict manipulation before exec #1607

vmoens commented Oct 5, 2023 •

edited

Loading

matteobettini Oct 5, 2023

vmoens Oct 5, 2023

matteobettini Oct 5, 2023

matteobettini Oct 5, 2023 •

edited

Loading

vmoens Oct 5, 2023

matteobettini Oct 5, 2023

matteobettini left a comment

matteobettini commented Oct 5, 2023

vmoens commented Oct 5, 2023

matteobettini commented Oct 5, 2023

vmoens commented Oct 5, 2023

[BugFix] Patch SAC to allow state_dict manipulation before exec #1607

[BugFix] Patch SAC to allow state_dict manipulation before exec #1607

Conversation

vmoens commented Oct 5, 2023 • edited Loading

Description

matteobettini Oct 5, 2023

Choose a reason for hiding this comment

vmoens Oct 5, 2023

Choose a reason for hiding this comment

matteobettini Oct 5, 2023

Choose a reason for hiding this comment

matteobettini Oct 5, 2023 • edited Loading

Choose a reason for hiding this comment

vmoens Oct 5, 2023

Choose a reason for hiding this comment

matteobettini Oct 5, 2023

Choose a reason for hiding this comment

matteobettini left a comment

Choose a reason for hiding this comment

matteobettini commented Oct 5, 2023

vmoens commented Oct 5, 2023

matteobettini commented Oct 5, 2023

vmoens commented Oct 5, 2023

vmoens commented Oct 5, 2023 •

edited

Loading

matteobettini Oct 5, 2023 •

edited

Loading