[BugFix] Support for tensor collection in the `PPOLoss` #2543

priba · 2024-11-06T11:44:18Z

Description

The function get_entropy_bonus was not unpacking the entropy from the TensorDict.

Motivation and Context

When using a CompositeDistribution, the return type of the entropy can be either a Tensor or a TensorDict depending on the aggregate_probabilities. This break the get_entropy_bonus that expects a Tensor. We can unpack it accessing by the entropy key.

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

pytorch-bot · 2024-11-06T11:44:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2543

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 14 Unrelated Failures

As of commit 1f33881 with merge base 997d90e ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
##[error]Workflow failed! Resource not accessible by integration
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
##[error]Workflow failed! Resource not accessible by integration
Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 7f5bd5831ffda8e0217d405a3418e02cc08f1f0a6a5c48af42c4fba46440c559 /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cpu (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda11_8 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_1 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_4 (gh) (trunk failure)
##[error]Unable to find any artifacts for the associated workflow
Unit-tests on Linux / tests-cpu (3.10) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Linux / tests-cpu (3.11) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Linux / tests-cpu (3.12) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Linux / tests-cpu (3.9) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Linux / tests-cpu-oldget (3.12) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Linux / tests-gpu (3.11, 12.1) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_spec_shape_inplace_correction
Unit-tests on Linux / tests-optdeps (3.11, 12.1) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_spec_shape_inplace_correction
Unit-tests on Linux / tests-stable-gpu (3.10, 11.8) / linux-job (gh) (trunk failure)
test/test_transforms.py::TestTensorDictPrimer::test_tensordictprimer_batching[False-SerialEnv]
Unit-tests on Windows / unittests-cpu / windows-job (gh) (trunk failure)
##[error]Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens

LGTM thanks for this
@albertbou92

Co-authored-by: Pau Riba <pau.riba@helsing.ai> (cherry picked from commit 0eabb78)

Pau Riba added 3 commits November 6, 2024 11:20

Add test and entropy key for TensorDict distributions

4cc9ab0

Use the build in function

ef794c7

No need to unsqueeze

1f33881

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 6, 2024

vmoens added the bug Something isn't working label Nov 6, 2024

vmoens approved these changes Nov 6, 2024

View reviewed changes

vmoens merged commit 0eabb78 into pytorch:main Nov 6, 2024
62 of 75 checks passed

vmoens pushed a commit that referenced this pull request Nov 14, 2024

[BugFix] Support for tensor collection in the PPOLoss (#2543)

605c0fe

Co-authored-by: Pau Riba <pau.riba@helsing.ai> (cherry picked from commit 0eabb78)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Support for tensor collection in the `PPOLoss` #2543

[BugFix] Support for tensor collection in the `PPOLoss` #2543

priba commented Nov 6, 2024

pytorch-bot bot commented Nov 6, 2024 •

edited

Loading

vmoens left a comment

[BugFix] Support for tensor collection in the PPOLoss #2543

[BugFix] Support for tensor collection in the PPOLoss #2543

Conversation

priba commented Nov 6, 2024

Description

Motivation and Context

Types of changes

Checklist

pytorch-bot bot commented Nov 6, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2543

❌ 3 New Failures, 14 Unrelated Failures

vmoens left a comment

Choose a reason for hiding this comment

[BugFix] Support for tensor collection in the `PPOLoss` #2543

[BugFix] Support for tensor collection in the `PPOLoss` #2543

pytorch-bot bot commented Nov 6, 2024 •

edited

Loading