Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge develop into main. #232

Merged
merged 255 commits into from
Jun 9, 2021
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
255 commits
Select commit Hold shift + click to select a range
e148db4
documentation: add instructions for SCII env.
sgrimbly May 20, 2021
8287e30
feat: update dial trainer
KevinEloff May 20, 2021
af0f46f
fix: target network
KevinEloff May 20, 2021
5d9e0b0
feature: Save progress.
DriesSmit May 21, 2021
c81910c
Merge branch 'main' into feature/starcraft-wrapper
sgrimbly May 21, 2021
518b5be
feature: Save changes.
DriesSmit May 21, 2021
c1275df
Merge branch 'main' into feature/starcraft-wrapper
sgrimbly May 21, 2021
9113abf
fix: add SCII envs to tests.
sgrimbly May 21, 2021
abaafc4
Merge branch 'main' into feature/starcraft-wrapper
sgrimbly May 22, 2021
3379871
update ci to run on dev branch
arnupretorius May 22, 2021
54241e2
feature: Add working discrete action space examples of MA-DDPG and MA…
DriesSmit May 22, 2021
787f571
feature: Add comment.
DriesSmit May 22, 2021
4d8fdf9
meage: Merge main.
DriesSmit May 22, 2021
5a27a67
feat: comm madqn
KevinEloff May 22, 2021
6a624de
fix: comm module args
KevinEloff May 22, 2021
13bea90
feat: update dial trainer
KevinEloff May 22, 2021
cb9cd02
Merge branch 'main' into feature/dial-training
KevinEloff May 22, 2021
ffa59f0
fix: update fingerprint
KevinEloff May 22, 2021
957fa9a
feat: general comm network
KevinEloff May 22, 2021
5fc632b
update dockerfile to install sc2
arnupretorius May 22, 2021
77cb037
add 3rd party libraries to be ignored
arnupretorius May 22, 2021
70b1e91
remove smac from setup
arnupretorius May 22, 2021
4555a5c
add mac stuff to be ignored
arnupretorius May 22, 2021
0aeceba
Minor changes to pz wrapper
arnupretorius May 22, 2021
0902e39
feat: environment wrapper for sc2
arnupretorius May 22, 2021
894320d
env factory for sc2
arnupretorius May 22, 2021
8714f33
have mypy ignore smace
arnupretorius May 22, 2021
97b6e28
add install script for sc2
arnupretorius May 22, 2021
14c5c33
feature: Update discrete and continuous MA-DDPG and MA-D4PG to each u…
DriesSmit May 22, 2021
0bcd332
feat: dial prison learning
KevinEloff May 22, 2021
ad37b1d
fix mypy issues
arnupretorius May 22, 2021
47fed0a
change name of wrapper to smac
arnupretorius May 22, 2021
8bc9295
feat: use dial executor
KevinEloff May 22, 2021
2c7af20
feat: update env
KevinEloff May 22, 2021
fb91570
feat: message masking
KevinEloff May 22, 2021
6cb2ede
fix: mypy.
DriesSmit May 22, 2021
2b3e1c4
fix: Implement recurrent discrete/continuous MA-DDPG and MA-D4PG.
DriesSmit May 23, 2021
8eea971
example of running madqn on starcraft environment
arnupretorius May 23, 2021
1353dc6
Merge branch 'main' of https://github.com/instadeepai/Mava into featu…
arnupretorius May 23, 2021
15ae063
Merge branch 'feature/starcraft-wrapper' of https://github.com/instad…
arnupretorius May 23, 2021
8b299af
fix mypy issues
arnupretorius May 23, 2021
e0ef40d
update readme and workflow for sc2
arnupretorius May 23, 2021
7651e55
update yaml for smac install
arnupretorius May 23, 2021
df38243
change versions of python to test to only include 3.7 and 3.8
arnupretorius May 23, 2021
cc99ecf
fix: Small fixes.
DriesSmit May 23, 2021
1901a56
Minor changes
arnupretorius May 23, 2021
2d5edae
Merge branch 'feature/starcraft-wrapper' of https://github.com/instad…
arnupretorius May 23, 2021
461f810
test ci without setuptools
arnupretorius May 23, 2021
d00a0e9
:
arnupretorius May 23, 2021
3e70ef2
small updates to ci and dockerfile
arnupretorius May 23, 2021
2ee7b8c
small update to ci
arnupretorius May 23, 2021
897d2c3
fix: Added baseline cost weighting param to critic loss calc.
KaleabTessera May 23, 2021
a83281d
Merge branch 'develop' into feature/starcraft-wrapper
arnupretorius May 23, 2021
9a3e2b9
Merge branch 'develop' into feature/discrete_maddpg_mad4pg
arnupretorius May 23, 2021
6260a2c
fix: small update to ci to fix checks failing
arnupretorius May 23, 2021
df6fd9f
Merge pull request #203 from instadeepai/fix/ci-error-with-enum
arnupretorius May 23, 2021
0729a24
Merge branch 'develop' into bugfix/use-baseline-cost-critic-loss
arnupretorius May 23, 2021
457dedf
Merge pull request #202 from instadeepai/bugfix/use-baseline-cost-cri…
arnupretorius May 23, 2021
4ef83a1
Merge branch 'develop' into feature/discrete_maddpg_mad4pg
arnupretorius May 23, 2021
cf68fb5
Merge pull request #199 from instadeepai/feature/discrete_maddpg_mad4pg
arnupretorius May 23, 2021
15367a2
fix: Fix legal_actions dtype.
DriesSmit May 23, 2021
4a59987
merge: Merge develop.
DriesSmit May 23, 2021
739a546
fix: Passing none values to target update rate and passing these para…
KaleabTessera May 23, 2021
c1d16c7
fix: made type explicit.
KaleabTessera May 23, 2021
1301f35
Merge pull request #204 from instadeepai/bugfix/target_update_rate-pa…
KaleabTessera May 23, 2021
89d4c0b
Merge branch 'develop' into feature/starcraft-wrapper
arnupretorius May 24, 2021
3df2ab0
Update install_sc2.sh
arnupretorius May 24, 2021
39bd994
Remove commented lines of code in wrapper
arnupretorius May 24, 2021
659f08b
Remove chmod in instructions for sc2 install
arnupretorius May 24, 2021
b836b59
Merge pull request #194 from instadeepai/feature/starcraft-wrapper
arnupretorius May 24, 2021
3c9be70
feature: Add state-based maddpg example.
DriesSmit May 24, 2021
b64968f
Merge remote-tracking branch 'origin/develop' into feature/state-base…
DriesSmit May 24, 2021
8b955c0
feat: loss masking
KevinEloff May 24, 2021
e243276
refactoring
KevinEloff May 24, 2021
24968da
Merge branch 'develop' into feature/dial-training
KevinEloff May 24, 2021
f2f035c
fix: mypy
KevinEloff May 24, 2021
3e9875c
Update trainers.
DriesSmit May 24, 2021
e777089
feat: loss masking
KevinEloff May 24, 2021
6fec483
feat: channel capacity 1
KevinEloff May 24, 2021
2b9b0a6
feat: Split optims by agent type.
KaleabTessera May 24, 2021
6a3e16e
feat: Added discrete maddpg PZ example and set mappo to use diff optims.
KaleabTessera May 24, 2021
0e77534
chore: Updated example env.
KaleabTessera May 24, 2021
4cc601e
VDN works
jcformanek May 24, 2021
5ba25ff
fix: Work with shared weights false.
KaleabTessera May 24, 2021
01e3ff2
feat: Updated dqn, qmix and vdn trainers.
KaleabTessera May 24, 2021
fb6c50a
fix: Updated types.
KaleabTessera May 24, 2021
aaaca0f
fix: Nothing.
DriesSmit May 24, 2021
6749a61
debug/small changes
arnupretorius May 24, 2021
8cb8e45
Merge pull request #207 from instadeepai/feature/optims-by-agent-type
KaleabTessera May 24, 2021
0765401
fix: Small changes.
DriesSmit May 24, 2021
585763d
Merge branch 'develop' into feature/state-based-recurrent-maddpg
DriesSmit May 25, 2021
381490e
fix: Fix decentralised maddpg example.
DriesSmit May 25, 2021
744b46e
Merge branch 'feature/state-based-recurrent-maddpg' of github.com:ins…
DriesSmit May 25, 2021
6bc7bde
fix: mypy.
DriesSmit May 25, 2021
3ad1cf0
fix: Small fixes.
DriesSmit May 25, 2021
79addbe
make q net smaller
arnupretorius May 25, 2021
3df0196
cleanup
KevinEloff May 25, 2021
70ec527
Merge branch 'develop' into feature/dial-training
KevinEloff May 25, 2021
a7adc13
pr comments
KevinEloff May 25, 2021
cc69a31
fix: Update trainers and examples.
DriesSmit May 25, 2021
b008db5
feat: rename trainer functions
KevinEloff May 25, 2021
00fd8d5
Merge pull request #206 from instadeepai/feature/dial-training
arnupretorius May 25, 2021
f8ea27d
Merge remote-tracking branch 'origin/develop' into feature/fix-qmix
sgrimbly May 25, 2021
4489bb9
Merge branch 'develop' into feature/state-based-recurrent-maddpg
DriesSmit May 25, 2021
7965de2
fix: VDN and update files.
sgrimbly May 25, 2021
cbcae80
fix: VDN and update files.
sgrimbly May 25, 2021
edeeefd
fix: bug in VDN.
sgrimbly May 25, 2021
97ded76
feat: Made grad norm configurable for madqn and derivatives.
KaleabTessera May 25, 2021
75ea04f
Merge branch 'develop' into feature/maxgrad-norm-configure-madqn
KaleabTessera May 25, 2021
a1c0e7c
feat: Updated dial.
KaleabTessera May 25, 2021
61b893c
Merge pull request #210 from instadeepai/feature/maxgrad-norm-configu…
arnupretorius May 25, 2021
92863f3
fix: Fix centralised training bug.
DriesSmit May 25, 2021
9fa9579
Merge remote-tracking branch 'origin/develop' into feature/state-base…
DriesSmit May 25, 2021
56e0d99
fix: Small fixes.
DriesSmit May 25, 2021
2933b74
fix: Fix state based and recurrent versions.
DriesSmit May 25, 2021
dc4acf8
fix: get Qmix trainer loops running again.
sgrimbly May 25, 2021
43c67cb
fix: Small space fix.
DriesSmit May 25, 2021
e41baf0
fix: Fixes .keys() and .values() code in MA-DDPG trainers.
DriesSmit May 26, 2021
5722a7b
fix: Sort specs and keys inside Mava specs.
DriesSmit May 26, 2021
45397a8
fix: Remove comments.
DriesSmit May 26, 2021
733a7d5
fix: Rename MADDPG and MAD4PG trainers.
DriesSmit May 26, 2021
afb194a
fix: Re-added SMACEnvWrapper wrapper.
KaleabTessera May 26, 2021
e8693b8
feat: Catch exceptions thrown by rendering.
KaleabTessera May 26, 2021
358c9b7
Merge pull request #208 from instadeepai/feature/state-based-recurren…
arnupretorius May 26, 2021
c4cffac
Merge branch 'develop' into feature/fix-qmix
sgrimbly May 26, 2021
14eeba1
fix: restore new clipping method.
sgrimbly May 26, 2021
6c167d8
fix: Epsilon value in MADQNFeedForwardExecutor.
DriesSmit May 26, 2021
6822b0d
fix: Remove print.
DriesSmit May 26, 2021
17d2d9d
fix: Small fix.
DriesSmit May 26, 2021
87d1743
Merge pull request #213 from instadeepai/feature/feedforward-dqn-smac
DriesSmit May 26, 2021
ecfe63a
Merge branch 'develop' into feature/fix-qmix
sgrimbly May 26, 2021
f64ccef
fix: New version of acme breaks adders. Hardcoded acme version for now.
KaleabTessera May 26, 2021
dd27dd5
Merge branch 'develop' into bufgix/fix-acme-version
KaleabTessera May 26, 2021
f6db849
Merge pull request #215 from instadeepai/bufgix/fix-acme-version
DriesSmit May 27, 2021
071742c
feat: Example custom network spec.
KaleabTessera May 27, 2021
a0fd629
fix: Fix _get_dpg_feed inside MADDPGNetworkedTrainer.
DriesSmit May 27, 2021
5123782
fix: Fix _get_dpg_feed inside MADDPGNetworkedTrainer.
DriesSmit May 27, 2021
8af2d45
feature: Add assert to test if networked architecture is setup correc…
DriesSmit May 27, 2021
d9cd5db
Merge pull request #216 from instadeepai/feature/custom-network-example
arnupretorius May 27, 2021
fba8c91
fix: change trainer from VDN to QMIX
sgrimbly May 27, 2021
5693911
Merge branch 'develop' into feature/fix-qmix
sgrimbly May 27, 2021
2cb1346
feature: Add robocup docker setup to Mava code.
DriesSmit May 31, 2021
f9061d6
fix: In all trainers set Tensorflow to dynamically grow the gpu memor…
DriesSmit Jun 1, 2021
c38cb60
fix: Set render_game to False.
DriesSmit Jun 1, 2021
7b50ae1
Fix: Updated fixed agent.
DriesSmit Jun 1, 2021
ed61496
fix: Update set_growing_gpu_memory to loop over all GPU devices.
DriesSmit Jun 1, 2021
26a4853
Merge pull request #220 from instadeepai/fix/virtual_env_gpu_memory
arnupretorius Jun 1, 2021
6342c9a
add new logo image
arnupretorius Jun 1, 2021
f988a4b
readme overview updated
arnupretorius Jun 1, 2021
5683831
Merge branch 'develop' into feature/fix-qmix
sgrimbly Jun 1, 2021
4284061
add simpe spread debug env image
arnupretorius Jun 2, 2021
d5db55e
feat: try out live polling in readme.
arnupretorius Jun 2, 2021
1c34217
Update README.md
arnupretorius Jun 2, 2021
10683da
added progress images
arnupretorius Jun 2, 2021
5f34043
Merge branch 'feature/update-readme' of https://github.com/instadeepa…
arnupretorius Jun 2, 2021
7371e60
Update README.md
arnupretorius Jun 2, 2021
514aa9e
minor edits to readme
arnupretorius Jun 2, 2021
e91e02c
resolve merge conflicts
arnupretorius Jun 2, 2021
211a654
add debugging env results for discrete and cont. using ff nets.
arnupretorius Jun 2, 2021
fb75743
add pz multi-walker gif
arnupretorius Jun 2, 2021
00b77f5
Update README.md
arnupretorius Jun 3, 2021
cc5b0e1
Add table of contents to readme
arnupretorius Jun 3, 2021
a59e4be
Add link to launchpad and minor edits
arnupretorius Jun 3, 2021
eb0997f
debugging section and resize mava logo
arnupretorius Jun 3, 2021
35b0835
cropped version of mava logo
arnupretorius Jun 3, 2021
6b0c453
resize mava logo
arnupretorius Jun 3, 2021
e94d5f1
Add link to debug examples
arnupretorius Jun 3, 2021
9487c07
Add roadmap section with poll
arnupretorius Jun 3, 2021
9749fcc
add tf logo
arnupretorius Jun 3, 2021
260a550
update system readme
arnupretorius Jun 3, 2021
00e48fe
Minor edits to system readme
arnupretorius Jun 3, 2021
a3bdc92
update robocup link
arnupretorius Jun 3, 2021
89d3b67
fix broken debug env link
arnupretorius Jun 3, 2021
a5ba9b5
Fix broken link in examples readme
arnupretorius Jun 3, 2021
b1c27a8
fix spelling mistakes and remove unused links
arnupretorius Jun 3, 2021
7f2890d
Merge pull request #221 from instadeepai/feature/update-readme
arnupretorius Jun 3, 2021
87dd47d
update discrete results on readme
arnupretorius Jun 3, 2021
ba948a9
Merge pull request #222 from instadeepai/fix/readme
arnupretorius Jun 3, 2021
aeea578
Update license to apache to match code stubs
arnupretorius Jun 3, 2021
d9488c4
removed qmix v2
arnupretorius Jun 3, 2021
1d78da6
change additive network back to how it was
arnupretorius Jun 3, 2021
3253bad
update system and building to use mixing module instead of network
arnupretorius Jun 3, 2021
a9d3e0c
fix minor bug in qmix trainer
arnupretorius Jun 3, 2021
ee5236c
fix mypy issues.
arnupretorius Jun 3, 2021
ab31c0f
Merge branch 'develop' of https://github.com/instadeepai/Mava into fe…
arnupretorius Jun 3, 2021
a3fec9f
add vdn to discrete debug plot
arnupretorius Jun 3, 2021
e4b4ff0
Update VDN status to yellow
arnupretorius Jun 3, 2021
0ae79bd
resolve merge conflict
arnupretorius Jun 3, 2021
70d3ed7
Merge pull request #225 from instadeepai/feature/update-readme-vdn
arnupretorius Jun 4, 2021
6820f47
Update year and add placeholder
arnupretorius Jun 4, 2021
3f8754a
Merge branch 'develop' into fix/update-license
arnupretorius Jun 4, 2021
2fc5d87
Merge pull request #224 from instadeepai/fix/update-license
arnupretorius Jun 4, 2021
5224ada
Merge branch 'develop' of https://github.com/instadeepai/Mava into fe…
arnupretorius Jun 4, 2021
7ab51d0
clean up make file
arnupretorius Jun 4, 2021
a17c42d
Refactor qmix to use generic hypernetwork again
arnupretorius Jun 4, 2021
1a63a82
Remove communication module from qmix
arnupretorius Jun 4, 2021
3634524
minor mypy changes
arnupretorius Jun 4, 2021
cd9b6e0
Merge pull request #209 from instadeepai/feature/fix-qmix
arnupretorius Jun 4, 2021
be31129
Merge branch 'develop' into feature/robocup_environment
arnupretorius Jun 4, 2021
bd703fd
Merge branch 'develop' of https://github.com/instadeepai/Mava into fe…
arnupretorius Jun 4, 2021
f4b6cea
remove commented line in make file
arnupretorius Jun 4, 2021
704ba36
rename example
arnupretorius Jun 4, 2021
78fc3f4
clean agents file
arnupretorius Jun 4, 2021
d518364
cleaned fixed_agent file
arnupretorius Jun 4, 2021
5d700bf
cleaned nn_agents file
arnupretorius Jun 4, 2021
67c5597
update custom executor
arnupretorius Jun 4, 2021
478f39d
update copyright stubs
arnupretorius Jun 4, 2021
8735ba5
clean message parser
arnupretorius Jun 4, 2021
c640226
clean player world model
arnupretorius Jun 4, 2021
a3de0dc
clean trainer world model
arnupretorius Jun 4, 2021
2c6b506
clean trainer world model
arnupretorius Jun 4, 2021
230ac7e
clean robocup env
arnupretorius Jun 4, 2021
21db40d
add todo to move wrapper to wrapper folder
arnupretorius Jun 4, 2021
959043c
Merge branch 'feature/robocup_environment' of https://github.com/inst…
arnupretorius Jun 4, 2021
4890d44
fix mypy issues - todo need to check typing
arnupretorius Jun 4, 2021
4e11825
fix: Moved RoboCup files to their appropriate places in Mava.
DriesSmit Jun 4, 2021
0dbf259
feature: Update the main README.md with the RoboCup installation proc…
DriesSmit Jun 4, 2021
b67749b
correct copyright
arnupretorius Jun 4, 2021
a8d4219
fix: Update the readme file.
DriesSmit Jun 4, 2021
be2c61e
Small changes.
DriesSmit Jun 4, 2021
0f2633c
add inits and update typing
arnupretorius Jun 4, 2021
a260f03
Merge branch 'feature/robocup_environment' of https://github.com/inst…
arnupretorius Jun 4, 2021
0e2b586
copy stubs to inits
arnupretorius Jun 4, 2021
b9f960c
Fix mypy.
DriesSmit Jun 4, 2021
ae1027a
Pull changes.
DriesSmit Jun 4, 2021
cfdbc48
Automatically reformat code.
DriesSmit Jun 4, 2021
6312f8d
Merge pull request #219 from instadeepai/feature/robocup_environment
arnupretorius Jun 4, 2021
886d902
update stub
arnupretorius Jun 4, 2021
d9f5e3c
Merge pull request #227 from instadeepai/fix/stubs
arnupretorius Jun 4, 2021
8e0556c
feature: Make checkpointing optional.
KaleabTessera Jun 7, 2021
a50fb8f
feat(training): Allow different policy optims per agent.
KaleabTessera Jun 7, 2021
bd3a8dc
feat(wrapper): Cleaned up PZ wrapper.
KaleabTessera Jun 7, 2021
ed88f32
merge: Merge remote-tracking branch 'origin/develop' into feature/cod…
KaleabTessera Jun 7, 2021
362b93e
bugfix: Reverted optim change in qmix.
KaleabTessera Jun 7, 2021
5fa28ef
chore: Cleaned up comments.
KaleabTessera Jun 7, 2021
a970bd3
bugfix: Only checkpoint if checkpointer is not none.
KaleabTessera Jun 8, 2021
4f66fd3
Merge pull request #228 from instadeepai/feature/code-cleanup
arnupretorius Jun 8, 2021
0a47cbd
update copyright to InstaDeep Ltd
arnupretorius Jun 8, 2021
47e650a
fix setup file
arnupretorius Jun 8, 2021
97f097c
Merge pull request #229 from instadeepai/fix/copyright-stubs
arnupretorius Jun 8, 2021
b363448
fix(launchpad): Fixed stopping of launchpad program.
KaleabTessera Jun 8, 2021
44109e8
Merge remote-tracking branch 'origin/develop' into bugfix/stop-launch…
KaleabTessera Jun 8, 2021
7581e66
chore: Removed unused var.
KaleabTessera Jun 8, 2021
cc18b2f
debug: Debugging github actions.
KaleabTessera Jun 8, 2021
27ce633
fix(ci): Updated tf and reverb versions.
KaleabTessera Jun 8, 2021
d6c0e09
Merge pull request #230 from instadeepai/bugfix/stop-launchpad-run
KaleabTessera Jun 8, 2021
e8b2428
chore(release): Minor changes for release.
KaleabTessera Jun 9, 2021
a71af7a
docs: Updated readme to split core dependencies and non-core dependen…
KaleabTessera Jun 9, 2021
4b36832
docs: Updated docs to include package information.
KaleabTessera Jun 9, 2021
b737828
Merge pull request #231 from instadeepai/feature/release-prep
KaleabTessera Jun 9, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
fix: add SCII envs to tests.
  • Loading branch information
sgrimbly committed May 21, 2021
commit 9113abf19297eac1858a424c9f7319e8a3d04dec
40 changes: 17 additions & 23 deletions mava/wrappers/starcraft.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,10 +30,9 @@

from mava import types
from mava.utils.wrapper_utils import convert_np_type, parameterized_restart
from mava.wrappers.env_wrappers import ParallelEnvWrapper # , SequentialEnvWrapper
from mava.wrappers.env_wrappers import ParallelEnvWrapper


# Is it ParallelEnvWrapper or SequentialEnvWrapper
class SMACEnvWrapper(ParallelEnvWrapper):
"""
Wraps a StarCraft II MARL environment (SMAC) as a Mava Parallel environment.
Expand Down Expand Up @@ -182,39 +181,34 @@ def env_done(self) -> bool:
return self._environment.env_done # TODO Check SMAC has this function

def observation_spec(self) -> types.Observation:
observation_specs = {}
for agent in self._environment.possible_agents:
observation_specs[agent] = types.OLT(
return {
agent: types.OLT(
observation=_convert_to_spec(
self._environment.observation_spaces[agent]
),
legal_actions=_convert_to_spec(self._environment.action_spaces[agent]),
terminal=specs.Array((1,), np.float32),
)
return observation_specs
for agent in self._environment.possible_agents
}

def action_spec(self) -> Dict[str, specs.DiscreteArray]:
action_specs = {}
for agent in self._environment.possible_agents:
action_specs[agent] = _convert_to_spec(
self._environment.action_spaces[agent]
)
return action_specs
return {
agent: _convert_to_spec(self._environment.action_spaces[agent])
for agent in self._environment.possible_agents
}

def reward_spec(self) -> Dict[str, specs.Array]:
reward_specs = {}
for agent in self._environment.possible_agents:
reward_specs[agent] = specs.Array((), np.float32)

return reward_specs
return {
agent: specs.Array((), np.float32)
for agent in self._environment.possible_agents
}

def discount_spec(self) -> Dict[str, specs.BoundedArray]:
discount_specs = {}
for agent in self._environment.possible_agents:
discount_specs[agent] = specs.BoundedArray(
(), np.float32, minimum=0, maximum=1.0
)
return discount_specs
return {
agent: specs.BoundedArray((), np.float32, minimum=0, maximum=1.0)
for agent in self._environment.possible_agents
}

def extra_spec(self) -> Dict[str, specs.BoundedArray]:
return {}
Expand Down
13 changes: 12 additions & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,18 @@
"trfl",
]

env_requirements = ["pettingzoo>=1.8.2", "multi_agent_ale_py", "supersuit", "pygame"]
# NOTE (St John, for Kale-ab) The SMAC installed when callling `pip install
# smac` seems to be a different package. We actually need to install SMAC
# via pip `install git+https://github.com/oxwhirl/smac.git`. PySC2 is the
# single agent env for StarCraft II, required by SMAC.
env_requirements = [
"pettingzoo>=1.8.2",
"multi_agent_ale_py",
"supersuit",
"pygame",
"pysc2",
"smac",
]

launchpad_requirements = [
"dm-launchpad",
Expand Down