[BugFix] DQN loss dispatch respect configured tensordict keys #1285

Blonck · 2023-06-15T06:57:04Z

Description

Dispatch of .forward of the DQN loss module respects configured tensordict keys.

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

vmoens

LGTM thanks for this
Do we want to cover the action key too in the tests?

vmoens · 2023-06-15T13:43:56Z

test/test_cost.py

-    def test_dqn_notensordict(self):
+    @pytest.mark.parametrize("observation_key", ["observation", "observation2"])
+    @pytest.mark.parametrize("reward_key", ["reward", "reward2"])
+    @pytest.mark.parametrize("done_key", ["done", "done2"])


do we skip the action key by design?

No it is intended.
Atm, the action key cannot really be configured since it is used in the constructor of the DQN loss via _find_action_space(action_space).

Either I need to remove the action key from the configurable keys or the following part of the constructor must be moved until .set_keys() is called:

if action_space is None: # infer from value net try: action_space = value_network.spec except AttributeError: # let's try with action_space then try: action_space = value_network.action_space except AttributeError: raise ValueError(self.ACTION_SPEC_ERROR) if action_space is None: warnings.warn( "action_space was not specified. DQNLoss will default to 'one-hot'." "This behaviour will be deprecated soon and a space will have to be passed." "Check the DQNLoss documentation to see how to pass the action space. " ) action_space = "one-hot" self.action_space = _find_action_space(action_space)

Got it thanks!

[BugFix] DQN loss dispatch respect configured tensordict keys

53035e5

Blonck added the bug Something isn't working label Jun 15, 2023

Blonck requested a review from vmoens June 15, 2023 06:57

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 15, 2023

vmoens approved these changes Jun 15, 2023

View reviewed changes

vmoens merged commit 2dbdec9 into pytorch:main Jun 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] DQN loss dispatch respect configured tensordict keys #1285

[BugFix] DQN loss dispatch respect configured tensordict keys #1285

Blonck commented Jun 15, 2023

vmoens left a comment

vmoens Jun 15, 2023

Blonck Jun 15, 2023

vmoens Jun 15, 2023

[BugFix] DQN loss dispatch respect configured tensordict keys #1285

[BugFix] DQN loss dispatch respect configured tensordict keys #1285

Conversation

Blonck commented Jun 15, 2023

Description

Types of changes

Checklist

vmoens left a comment

Choose a reason for hiding this comment

vmoens Jun 15, 2023

Choose a reason for hiding this comment

Blonck Jun 15, 2023

Choose a reason for hiding this comment

vmoens Jun 15, 2023

Choose a reason for hiding this comment