[BugFix] `step_mdp` nested keys #1339

matteobettini · 2023-06-30T07:41:35Z

No description provided.

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens

The tests look good, but I suspect the implementation of step_mdp will be drastically slower than the one we have now.
Given the number of issues opened by users lately regarding environments overhead compared to gym, I'm reluctant to any change in step_mdp that makes it slower than what it already is.
Let's wait until we merge the 2 benchmark PRs (the one with step_mdp and the other to run benchmarks in PRs), then it'll be easier to iterate over this

vmoens · 2023-06-30T09:19:45Z

torchrl/envs/utils.py

+    if isinstance(done_key, tuple) and len(done_key) == 1:
+        done_key = done_key[0]
+    if isinstance(reward_key, tuple) and len(reward_key) == 1:
+        reward_key = reward_key[0]
+    if isinstance(action_key, tuple) and len(action_key) == 1:
+        action_key = action_key[0]


this is expensive let's not do that if we can avoid it
We could blend this in unravel_keys in tensordict (which is coded in c++) if it's there for efficiency purposes

vmoens · 2023-06-30T09:21:15Z

torchrl/envs/utils.py

+    if not exclude_action:
+        out._set(action_key, tensordict.get(action_key))


Here if action has to be kepts we're removing it and adding it back, which is expensive, let's avoid that

we are not removing it, the exclusion before is on "next"

vmoens · 2023-06-30T09:21:36Z

torchrl/envs/utils.py

    out = tensordict.get("next").clone(False)
-    excluded = set()
+    excluded = {action_key}
    if exclude_done:
        excluded.add(done_key)
    if exclude_reward:
        excluded.add(reward_key)
    if len(excluded):


that will never happen anymore, so we'll always be calling exclude, which is expensive. Let's avoid that

vmoens · 2023-06-30T09:21:59Z

torchrl/envs/utils.py

-        # out.update(tensordict.select(*td_keys))
-        for key in td_keys:
-            out._set(key, tensordict.get(key))
+        excluded = set.union(excluded, set(out.keys(True, True)))


set on all the keys is super expensive, i'd avoid it if we can

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini · 2023-06-30T13:40:18Z

torchrl/envs/utils.py

+    td_next = tensordict.get("next")
+
+    td_keys = td.keys(True, True)
+    td_next_keys = td_next.keys(True, True)


These are is the only time we traverse.
Basically we just visit every key in tensordict

matteobettini · 2023-06-30T13:40:49Z

torchrl/envs/utils.py

+
+    # Set the keys from root
+    if not exclude_action:
+        _set_key(dest=out, source=tensordict, key=action_key)


here we do the action separate just to not have another if later, perf is the same

vmoens

I can see a version of that working but the way we set the batch size requires multiple access to nested tensordicts that will be time consuming
If you merge main into this branch you'll get the time measure of your solution

vmoens · 2023-06-30T13:41:53Z

torchrl/envs/utils.py

+    td = tensordict.exclude("next")
+    td_next = tensordict.get("next")


vmoens · 2023-06-30T13:43:57Z

torchrl/envs/utils.py

+    excluded = {
+        done_key if exclude_done else None,
+        reward_key if exclude_reward else None,
+    }


Isn't weird to have None in the set?
Should we not build it iteratively?

i can do it, i thought preformance was similar and it was more readable, but i ll change it

vmoens · 2023-06-30T13:45:02Z

torchrl/envs/utils.py

+    if isinstance(key, tuple) and len(key) > 1:  # Setting the batch_sizes
+        for k in range(1, len(key)):
+            dest[key[:k]].batch_size = source[key[:k]].batch_size


if two keys share the same root, we'll set the batch-time multiple times, which is time consuming

yeah I know, i ll improve it

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens

LGTM

new nested step_mdp test

8271709

Signed-off-by: Matteo Bettini <matbet@meta.com>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 30, 2023

matteobettini added 2 commits June 30, 2023 08:44

step_mdp starting from out

123fe5c

Signed-off-by: Matteo Bettini <matbet@meta.com>

step_mdp starting from next

9fcf81b

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens reviewed Jun 30, 2023

View reviewed changes

matteobettini added 5 commits June 30, 2023 10:35

remove action exclusion from next

07dc9bb

Signed-off-by: Matteo Bettini <matbet@meta.com>

more tests

88b53f0

Signed-off-by: Matteo Bettini <matbet@meta.com>

Merge branch 'main' into step_mdp_nested

3953501

undo step_mdp changes

cb46165

Signed-off-by: Matteo Bettini <matbet@meta.com>

mic drop

637fc22

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini commented Jun 30, 2023

View reviewed changes

vmoens reviewed Jun 30, 2023

View reviewed changes

matteobettini added 9 commits June 30, 2023 14:50

Merge branch 'main' into step_mdp_nested

1fdd419

clone without keys

64e51f7

Signed-off-by: Matteo Bettini <matbet@meta.com>

test

16e5c41

Signed-off-by: Matteo Bettini <matbet@meta.com>

lock

7927514

Signed-off-by: Matteo Bettini <matbet@meta.com>

excluded iteratively

ba0d593

Signed-off-by: Matteo Bettini <matbet@meta.com>

amend

a3bd68b

Signed-off-by: Matteo Bettini <matbet@meta.com>

amend

4a38252

Signed-off-by: Matteo Bettini <matbet@meta.com>

fix

4b71050

Signed-off-by: Matteo Bettini <matbet@meta.com>

fix

a501be6

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens added the bug Something isn't working label Jun 30, 2023

vmoens merged commit 12cbe72 into pytorch:main Jun 30, 2023

vmoens reviewed Jun 30, 2023

View reviewed changes

matteobettini deleted the step_mdp_nested branch July 3, 2023 07:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] `step_mdp` nested keys #1339

[BugFix] `step_mdp` nested keys #1339

matteobettini commented Jun 30, 2023

vmoens left a comment

vmoens Jun 30, 2023

vmoens Jun 30, 2023

matteobettini Jun 30, 2023

vmoens Jun 30, 2023

matteobettini Jun 30, 2023

vmoens Jun 30, 2023

matteobettini Jun 30, 2023

matteobettini Jun 30, 2023

vmoens left a comment

vmoens Jun 30, 2023

vmoens Jun 30, 2023

matteobettini Jun 30, 2023

vmoens Jun 30, 2023

matteobettini Jun 30, 2023

vmoens left a comment

		if not exclude_action:
		out._set(action_key, tensordict.get(action_key))

		td = tensordict.exclude("next")
		td_next = tensordict.get("next")

[BugFix] step_mdp nested keys #1339

[BugFix] step_mdp nested keys #1339

Conversation

matteobettini commented Jun 30, 2023

vmoens left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

[BugFix] `step_mdp` nested keys #1339

[BugFix] `step_mdp` nested keys #1339