Skip to content

Commit

Permalink
[Algorithm] QMixer loss and multiagent models (pytorch#1378)
Browse files Browse the repository at this point in the history
Signed-off-by: Matteo Bettini <matbet@meta.com>
  • Loading branch information
matteobettini authored Jul 14, 2023
1 parent 9c95e1d commit 574dbf1
Show file tree
Hide file tree
Showing 15 changed files with 1,691 additions and 40 deletions.
14 changes: 14 additions & 0 deletions docs/source/reference/modules.rst
Original file line number Diff line number Diff line change
Expand Up @@ -335,6 +335,20 @@ algorithms, such as DQN, DDPG or Dreamer.
RSSMPrior
RSSMPosterior

Multi-agent-specific modules
~~~~~~~~~~~~~~~~~~~~~~~~~~~

These networks implement models that can be used in
multi-agent contexts.

.. autosummary::
:toctree: generated/
:template: rl_template_noinherit.rst

MultiAgentMLP
QMixer
VDNMixer


Exploration
-----------
Expand Down
15 changes: 15 additions & 0 deletions docs/source/reference/objectives.rst
Original file line number Diff line number Diff line change
Expand Up @@ -185,6 +185,21 @@ Dreamer
DreamerModelLoss
DreamerValueLoss

Multi-agent objectives
----------------------
.. currentmodule:: torchrl.objectives.multiagent

These objectives are specific to multi-agent algorithms.

QMixer
~~~~~~

.. autosummary::
:toctree: generated/
:template: rl_template_noinherit.rst

QMixerLoss


Returns
-------
Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -235,6 +235,7 @@ def _main(argv):
"checkpointing": [
"torchsnapshot",
],
"marl": ["vmas"],
},
zip_safe=False,
classifiers=[
Expand All @@ -254,5 +255,4 @@ def _main(argv):


if __name__ == "__main__":

_main(sys.argv[1:])
Loading

0 comments on commit 574dbf1

Please sign in to comment.