[BugFix] Fix missing ("next", "observation") key in dispatch of losses #1235

Blonck · 2023-06-06T08:27:54Z

Description

Fix missing ("next", "dispatch") key of in_keys from IQL, DDPG, and A2C loss module.

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).

vmoens

LGTM
see my minor comment

vmoens · 2023-06-06T08:49:47Z

torchrl/objectives/a2c.py

        >>> loss_val
-        (tensor(1.7593, grad_fn=<MeanBackward0>), tensor(0.2344, grad_fn=<MeanBackward0>), tensor(1.5480), tensor(-0.0155, grad_fn=<MulBackward0>))
+        (tensor(4.3483, grad_fn=<MeanBackward0>), tensor(1.4114, grad_fn=<MeanBackward0>), tensor(2.5165), tensor(-0.0252, grad_fn=<MulBackward0>))


maybe we don't want to print these values (such that we don't pollute PRs with diffs that aren't relevant?)

Good point, I will just remove it.

vmoens · 2023-06-06T08:50:14Z

torchrl/objectives/iql.py

        ...     next_reward=torch.randn(*batch, 1))
        >>> loss_val
-        (tensor(1.4535, grad_fn=<MeanBackward0>), tensor(0.8389, grad_fn=<MeanBackward0>), tensor(0.3406, grad_fn=<MeanBackward0>), tensor(3.3441))
+        (tensor(1.4535, grad_fn=<MeanBackward0>), tensor(0.7506, grad_fn=<MeanBackward0>), tensor(0.3406, grad_fn=<MeanBackward0>), tensor(3.3441))


vmoens · 2023-06-06T09:00:20Z

torchrl/objectives/a2c.py

-        >>> loss_val
-        (tensor(4.3483, grad_fn=<MeanBackward0>), tensor(1.4114, grad_fn=<MeanBackward0>), tensor(2.5165), tensor(-0.0252, grad_fn=<MulBackward0>))


We can still print something :)
You could do

loss_actor, loss_val, *etc = fun_call(**kwargs) loss_actor.backward() # for instance

anything that makes it feel like "this is a tensor"

vmoens

LGTM thanks!

[BugFix] Fix missing ("next", "observation") key in dispatch of losses

7a10116

Blonck requested a review from vmoens June 6, 2023 08:27

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 6, 2023

Blonck added bug Something isn't working and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Jun 6, 2023

vmoens approved these changes Jun 6, 2023

View reviewed changes

Remove explicit numbers from examples section

f7e2141

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 6, 2023

vmoens reviewed Jun 6, 2023

View reviewed changes

Add .backward() to doctests

53f9317

vmoens approved these changes Jun 6, 2023

View reviewed changes

vmoens merged commit 9467036 into pytorch:main Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Fix missing ("next", "observation") key in dispatch of losses #1235

[BugFix] Fix missing ("next", "observation") key in dispatch of losses #1235

Blonck commented Jun 6, 2023

vmoens left a comment

vmoens Jun 6, 2023

Blonck Jun 6, 2023

vmoens Jun 6, 2023

vmoens Jun 6, 2023

vmoens left a comment

		>>> loss_val
		(tensor(4.3483, grad_fn=<MeanBackward0>), tensor(1.4114, grad_fn=<MeanBackward0>), tensor(2.5165), tensor(-0.0252, grad_fn=<MulBackward0>))

[BugFix] Fix missing ("next", "observation") key in dispatch of losses #1235

[BugFix] Fix missing ("next", "observation") key in dispatch of losses #1235

Conversation

Blonck commented Jun 6, 2023

Description

Types of changes

Checklist

vmoens left a comment

Choose a reason for hiding this comment

vmoens Jun 6, 2023

Choose a reason for hiding this comment

Blonck Jun 6, 2023

Choose a reason for hiding this comment

vmoens Jun 6, 2023

Choose a reason for hiding this comment

vmoens Jun 6, 2023

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment