[Algorithm] QMixer loss and multiagent models #1378

matteobettini · 2023-07-11T10:08:31Z

No description provided.

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini · 2023-07-11T12:54:55Z

@Acciorocketships it would be nice to add a QGNN mixer in modules/models/multiagent.py.
implementing the mixer interface

Acciorocketships · 2023-07-11T13:16:36Z

What about the QGNN model? If that's not already implemented, I would prioritise that. It has a much larger impact on performance.

matteobettini · 2023-07-11T13:21:36Z

Yeah that is just a normal gnn right? I am thinking to make a MultiagentGNN as i made the MLP.
So that it can be used also in MADDPG and so on.

I can take care of that but the mixer is up to you

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini · 2023-07-11T14:50:56Z

Ready for review @vmoens

torchrl/objectives/__init__.py

torchrl/objectives/multiagent/qmixer.py

torchrl/modules/models/multiagent.py

matteobettini · 2023-07-11T18:39:16Z

Thanks a lot! Will fix.

Most of the comments on the new loss are on parts that have been copy-pasted from DQN.
Should i modify that too when I fix those?

vmoens · 2023-07-11T18:41:30Z

Thanks a lot! Will fix.

Most of the comments on the new loss are on parts that have been copy-pasted from DQN. Should i modify that too when I fix those?

You mean the loss_function default? Yeah sure makes sense

matteobettini · 2023-07-11T18:43:21Z

Not only the default, basically all of the comments.
I am just saying that most of the code of qmixerloss comes from dqnloss.

So your review comments apply to dqnloss too.
I'll modify that too when i fix them

vmoens · 2023-07-11T18:46:15Z

Oh i see what you mean
Better to fix them but not mandatory. I guess my point was that it's better to fix things before merging than after, but that does not mean that you need to take care of everything in this PR.
Those changes should not be too hard to implement though. Careful with bc-breaking changes, eg the device of DQN should be discouraged but not taken away all at once

Signed-off-by: Matteo Bettini <matbet@meta.com>

torchrl/objectives/dqn.py

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini · 2023-07-13T08:07:07Z

torchrl/objectives/dqn.py

-    @in_keys.setter
-    def in_keys(self, values):
-        self._in_keys = values
-


this change you requested makes it bc-breaking

do you want me to still allow it but warn?

no if tests pass without needing it we're good without it

i thin they pass

vmoens · 2023-07-13T15:48:01Z

Tests are failing on GPU tests:

test/test_cost.py::TestQMixer::test_qmix_batcher[categorical-device0-True-2] FAILED [  4%]
___________ TestQMixer.test_qmix_batcher[categorical-device0-True-2] ___________
Traceback (most recent call last):
  File "/work/env/lib/python3.9/site-packages/tensordict/nn/common.py", line 1178, in forward
    raise err
  File "/work/env/lib/python3.9/site-packages/tensordict/nn/common.py", line 1164, in forward
    tensors = self._call_module(tensors, **kwargs)
  File "/work/env/lib/python3.9/site-packages/tensordict/nn/common.py", line 1121, in _call_module
    out = self.module(*tensors, **kwargs)
  File "/work/env/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1522, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "/work/env/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1531, in _call_impl
    return forward_call(*args, **kwargs)
  File "/work/env/lib/python3.9/site-packages/tensordict/nn/functional_modules.py", line 567, in new_fun
    return getattr(type(self), fun_name)(self, *args, **kwargs)
  File "/work/env/lib/python3.9/site-packages/torch/nn/modules/linear.py", line 114, in forward
    return F.linear(input, self.weight, self.bias)
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument mat2 in method wrapper_CUDA_mm)

matteobettini · 2023-07-13T15:48:17Z

oh i am seeing now that some are failing

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens · 2023-07-13T21:07:52Z

There is still this to be fixed

test/test_cost.py::TestQMixer::test_qmix_batcher[categorical-device0-False-3] - assert not True

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini · 2023-07-14T08:04:21Z

There is still this to be fixed

Thanks a lot and sorry for not noticing that.

For the future, can you explain to me where do you see such failures?
I was opening the gpu tests pipelines but I did not see that

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens

Great work! Woohoo!

init

ee77a11

Signed-off-by: Matteo Bettini <matbet@meta.com>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 11, 2023

matteobettini mentioned this pull request Jul 11, 2023

[Example] Multiagent examples: MAPPO-IPPO-MADDPG-IDDPG-IQL-QMIX-VDN #1027

Merged

tests

42e6bb2

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens added enhancement New feature or request new algo New algorithm request or PR labels Jul 11, 2023

vmoens changed the title ~~[MARL] QMixer loss and multiagent models~~ [Algorithm] QMixer loss and multiagent models Jul 11, 2023

docs

2df23a4

Signed-off-by: Matteo Bettini <matbet@meta.com>

Merge branch 'main' into marl_losses_modules

e4773ad

vmoens reviewed Jul 11, 2023

View reviewed changes

matteobettini added 2 commits July 12, 2023 08:26

import

010bccf

Signed-off-by: Matteo Bettini <matbet@meta.com>

amend

e678547

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini commented Jul 12, 2023

View reviewed changes

torchrl/objectives/dqn.py Show resolved Hide resolved

matteobettini added 7 commits July 12, 2023 10:01

docs

f6047bf

Signed-off-by: Matteo Bettini <matbet@meta.com>

docs

b0b4d22

Signed-off-by: Matteo Bettini <matbet@meta.com>

tests mlp

073a44d

Signed-off-by: Matteo Bettini <matbet@meta.com>

tests mlp

fb4c059

Signed-off-by: Matteo Bettini <matbet@meta.com>

tests mixers

d8b28c5

Signed-off-by: Matteo Bettini <matbet@meta.com>

device dqn

9a7b216

Signed-off-by: Matteo Bettini <matbet@meta.com>

remove setter

ed7e5c5

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini commented Jul 13, 2023

View reviewed changes

fix test gpu

01f57a7

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini added 9 commits July 14, 2023 08:34

fix

3f6a35d

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

3931f7f

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

41129de

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

b8fd015

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

66db371

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

eb2e441

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

c61bd07

Signed-off-by: Matteo Bettini <matbet@meta.com>

temp

b94d233

Signed-off-by: Matteo Bettini <matbet@meta.com>

fix

e4705db

Signed-off-by: Matteo Bettini <matbet@meta.com>

matteobettini added 5 commits July 14, 2023 10:08

amend

9b53b93

Signed-off-by: Matteo Bettini <matbet@meta.com>

amend

addc611

Signed-off-by: Matteo Bettini <matbet@meta.com>

amend

137aa0d

Signed-off-by: Matteo Bettini <matbet@meta.com>

docs

b15a517

Signed-off-by: Matteo Bettini <matbet@meta.com>

docs

fd21478

Signed-off-by: Matteo Bettini <matbet@meta.com>

vmoens approved these changes Jul 14, 2023

View reviewed changes

vmoens merged commit 574dbf1 into pytorch:main Jul 14, 2023

matteobettini deleted the marl_losses_modules branch July 14, 2023 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Algorithm] QMixer loss and multiagent models #1378

[Algorithm] QMixer loss and multiagent models #1378

matteobettini commented Jul 11, 2023 •

edited

Loading

matteobettini commented Jul 11, 2023

Acciorocketships commented Jul 11, 2023

matteobettini commented Jul 11, 2023 •

edited

Loading

matteobettini commented Jul 11, 2023

matteobettini commented Jul 11, 2023

vmoens commented Jul 11, 2023

matteobettini commented Jul 11, 2023 •

edited

Loading

vmoens commented Jul 11, 2023

matteobettini Jul 13, 2023

matteobettini Jul 13, 2023

vmoens Jul 13, 2023

matteobettini Jul 13, 2023

vmoens commented Jul 13, 2023

matteobettini commented Jul 13, 2023

vmoens commented Jul 13, 2023

matteobettini commented Jul 14, 2023

vmoens left a comment

[Algorithm] QMixer loss and multiagent models #1378

[Algorithm] QMixer loss and multiagent models #1378

Conversation

matteobettini commented Jul 11, 2023 • edited Loading

matteobettini commented Jul 11, 2023

Acciorocketships commented Jul 11, 2023

matteobettini commented Jul 11, 2023 • edited Loading

matteobettini commented Jul 11, 2023

matteobettini commented Jul 11, 2023

vmoens commented Jul 11, 2023

matteobettini commented Jul 11, 2023 • edited Loading

vmoens commented Jul 11, 2023

matteobettini Jul 13, 2023

Choose a reason for hiding this comment

matteobettini Jul 13, 2023

Choose a reason for hiding this comment

vmoens Jul 13, 2023

Choose a reason for hiding this comment

matteobettini Jul 13, 2023

Choose a reason for hiding this comment

vmoens commented Jul 13, 2023

matteobettini commented Jul 13, 2023

vmoens commented Jul 13, 2023

matteobettini commented Jul 14, 2023

vmoens left a comment

Choose a reason for hiding this comment

matteobettini commented Jul 11, 2023 •

edited

Loading

matteobettini commented Jul 11, 2023 •

edited

Loading

matteobettini commented Jul 11, 2023 •

edited

Loading