We include a non-exhaustive number of examples, showing common use-cases for Mava. We also have a Quickstart notebook that can be used to quickly create and train your first Multi-Agent System.
We include a number of systems running on continuous control tasks.
-
MADDPG: a MADDPG system running on the continuous action space simple_spread MPE environment.
-
Feedforward:
-
Decentralised
- decentralised
- decentralised record agents (recording agents acting in the environment)
- decentralised executor scaling (scaling to 4 executors)
- decentralised multiple trainers(using multiple trainers)
- decentralised custom loggers (using custom logging)
- decentralised lr scheduling(using lr schedule)
- decentralised evaluator intervals(running the evaluation loop at intervals)
-
centralised , networked (using a fully-connected, networked architecture), networked with custom architecture (using a custom, sparse, networked architecture) and state_based.
-
-
Recurrent
- decentralised and state_based.
-
-
MAD4PG: a MAD4PG system running on the continuous action space simple_spread MPE environment.
- Feedforward
- Recurrent
-
MADDPG: a MADDPG system running on the Multiwalker environment.
- Feedforward
- decentralised and centralised.
- Recurrent
- Feedforward
-
MAD4PG: a MAD4PG system running on the Multiwalker environment.
- Feedforward
- decentralised
- decentralised record agents (recording agents acting in the environment).
- Feedforward
-
MAPPO
- Feedforward
- MAD4PG:
a MAD4PG system running on the RoboCup environment.
- Recurrent
We also include a number of systems running on discrete action space environments.
-
MAPPO: a MAPPO system running on the discrete action space simple_spread MPE environment.
- Feedforward
- decentralised and centralised.
- Feedforward
-
MADQN: a MADQN system running on the discrete action space simple_spread MPE environment.
- Feedforward
- Decentralised
- decentralised
- decentralised lr scheduling (using lr schedule)
- decentralised custom lr scheduling (using custom lr schedule)
- decentralised custom epsilon decay scheduling (using configurable epsilon scheduling).
- Decentralised
- Recurrent
- Feedforward
-
VDN: a VDN system running on the discrete action space simple_spread MPE environment.
- Recurrent
-
MADQN: a MADQN system running on the two-player competitive Atari Pong environment.
- Recurrent
-
MAPPO: a MAPPO system running on two-player cooperative Atari Pong.
- feedforward
-
MADDPG: a MADDPG system running on the Simple Speaker Listener environment.
- Feedforward
-
MADDPG: a MADDPG system running on the Simple Spread environment.
- Feedforward
-
MADQN: a MADQN system running on the SMAC environment.
- Feedforward
- Recurrent
-
QMIX: a QMIX system running on the SMAC environment.
- Recurrent
-
VDN: a VDN system running on the SMAC environment.
- Recurrent
- MADQN:
a MADQN system running on the OpenSpiel environment.
- Feedforward