-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/ Smac wrapper Update, MADQN/QMIX/VDN upgrades and Dockerfile improvements #310
Merged
Merged
Changes from 1 commit
Commits
Show all changes
143 commits
Select commit
Hold shift + click to select a range
97074fe
feat: Log smac metrics such as win_rate and use existing renderer.
KaleabTessera 8e5256e
feat: Updated smac wrapper to better log agent details and correctly …
KaleabTessera 96161bf
Merge remote-tracking branch 'origin/develop' into feature/smac-env-u…
KaleabTessera 3c09aa1
wip(smac): WIP new smac wrapper.
KaleabTessera 21a17c7
Merge remote-tracking branch 'origin/develop' into feature/smac-env-u…
KaleabTessera c746820
fix: Updated eps greedy module and add state info to PZ wrapper.
KaleabTessera 41221a9
fix(wrappers): Updates to PZ wrappers to handle dying agents.
KaleabTessera af664dc
Merge remote-tracking branch 'origin/develop' into feature/smac-env-u…
KaleabTessera a65f013
chore: Consistency in loss name.
KaleabTessera db12989
feat(madqn,vdn): Upgraded action selectors in madqn and vdn.
KaleabTessera 0de351e
Merge remote-tracking branch 'origin/feature/smac-env-upgrades' into …
KaleabTessera 99ec5e1
feat(qmix): Upgraded action selectors in qmix.
KaleabTessera 79ed791
feat(dial): Upgraded action selectors in dial.
KaleabTessera b13308f
feat(examples): Updated custom madqn example.
KaleabTessera e61329d
feat(qmix,vdn): Fixes to creation of agent networks.
KaleabTessera 132f3c4
Merge branch 'chore/update-for-new-acme-version' into feature/smac-en…
KaleabTessera c79fff8
feat(qmix,vdn,qmix): Added epsilon scheduling by time_t.
KaleabTessera 562e97a
chore: Updated mypy.
KaleabTessera ffae883
chore: Changes according to mypy typing.
KaleabTessera a458d38
feat: Added lr decay to all systems.
KaleabTessera 946b903
feat: Improved lr decay scheduling.
KaleabTessera 4f10642
chore: Improved docstrings and lr decay examples.
KaleabTessera fbfe687
chore: Cleanup docstrings and update tests.
KaleabTessera 983315d
fix: Pass learning_rate_schedule in qmix.
KaleabTessera ba61dc1
Merge remote-tracking branch 'origin/develop' into feature/smac-env-u…
KaleabTessera a913b13
fix: Handle tuple observations.
KaleabTessera 59d1575
fix: Update spec from box to compatible spec.
KaleabTessera 11f0f1e
chore: Ignore mypy not relevant issue.
KaleabTessera eaa8e27
fix: Updated tests and fixes in PZ wrapper.
KaleabTessera 86e5d1a
tmp: Force reinstall to get latest mypy version.
KaleabTessera 3c80019
tmp(ci): Force reinstall to get latest mypy version.
KaleabTessera 6acf517
debug: Mypy ci version check.
KaleabTessera a156de6
chore: Made smac pz wrapper optional.
KaleabTessera 67aa06e
chore: Update mypy ignore for github actions.
KaleabTessera 60cd23d
feat: Added code duplication check to pre-commits.
KaleabTessera 77a9828
chore: More info on smac install.
KaleabTessera a8a39b8
debug: Reset epsilon in evaluator to previous value of 1e-10, instead…
KaleabTessera 8732162
debug: Reverted epsilon greedy action selector change.
KaleabTessera 9324100
chore: Reset eval epsilon.
KaleabTessera 0125e2b
fix: Re-added new epsilon decay.
KaleabTessera f1fbca0
debug: Comment out extra logging and after trainer step.
KaleabTessera 86a1575
Merge remote-tracking branch 'origin/develop' into feature/smac-env-u…
KaleabTessera 9e9d302
fix: Improved efficiency of eps scheduling.
KaleabTessera 367953f
debug: Remove all logging.
KaleabTessera 14a313a
feat: Optimized eps scheduling.
KaleabTessera 9e7b305
chore: Updated constant scheduler.
KaleabTessera d09a8a7
chore: Simplified eps scheduling.
KaleabTessera 5b5a0c8
debug: Reverted epsilon greedy action selector change.
KaleabTessera 46d13de
feat: Updated epsilon greedy action selector.
KaleabTessera 8ceeeb7
chore: Removed name scope in eps greedy.
KaleabTessera e36b8f7
fix: Fixed eps scheduling.
KaleabTessera 47bfb99
fix: Updated eps start default.
KaleabTessera f923171
Merge branch 'develop' into feature/smac-env-upgrades
KaleabTessera c30dd6e
chore: Changed learning_rate_schedule to exploration_scheduler_fn.
KaleabTessera bbd08d7
chore: Updated docstring.
KaleabTessera 3ed6838
feat: Improved loggers and logged actions.
KaleabTessera 831ea52
chore: Added sc2 gifs.
KaleabTessera 301e895
feat: Simplified eps sheduling and cleaned up dial implementation.
KaleabTessera c93232a
chore: Removed code duplication check.
KaleabTessera 80bdefb
mrege: Merge remote-tracking branch 'origin/develop' into feature/sma…
KaleabTessera e472290
test: Updated tests.
KaleabTessera 8358b9c
fix: Madqn recurrent execution.
KaleabTessera e838aae
chore: Updated readme.
KaleabTessera 315253a
chore: Updated sc2 gif.
KaleabTessera 2fc4d68
chore: Updated gif.
KaleabTessera 8bc92ad
Merge branch 'develop' into feature/smac-env-upgrades
KaleabTessera bfaf6b2
Fix: removed unused vars.
KaleabTessera 02862a7
chore: Updated readme.
KaleabTessera 96ac7dd
chore: Removed ignore.
KaleabTessera 62455bb
fix: Fix formatting.
KaleabTessera 3e03603
feat: Upgraded tf,reverb and launchpad.
KaleabTessera 088fd1a
fix: Added seeds for deterministic sampling in tf/tf-prob 2.7.
KaleabTessera 496e2e4
chore: Updated pz version.
KaleabTessera 5669793
feat: Added state info and stats to pz wrapper.
KaleabTessera 4408d2c
chore: Updated tf/lp versions.
KaleabTessera ae0cb99
fix: Made extra libraries not required.
KaleabTessera 584d3e7
fix: Removed supersuit forced requirement.
KaleabTessera 2196c62
feat: Updated dependencies/tests/dockerfiles.
KaleabTessera 99c9591
Merge branch 'develop' into feature/smac-env-upgrades
KaleabTessera 705745f
fix: Updated melting pot with madqn scheduler.
KaleabTessera 7808108
ci: Updated tests script.
KaleabTessera 7ad68e7
ci: Updated tests scripts.
KaleabTessera 00d4135
ci: Updated tests scripts to run as sudo.
KaleabTessera 968beb3
fix: Fixed enum flag for py3.6.
KaleabTessera 463dbf9
ci: Updated permissions.
KaleabTessera 9e1143f
ci: Updated permissions.
KaleabTessera c31271a
fix: Ci for py3.6 fix.
KaleabTessera 69b61cb
ci: Test docker github actions.
KaleabTessera b553ae9
ci: Docker github actions test.
KaleabTessera 07c8e41
ci: Docker github actions fix script location.
KaleabTessera b279a22
ci: Docker github actions updated run command.
KaleabTessera b0056d5
ci: Fixed test location.
KaleabTessera c446f56
ci: Updated virtual env install.
KaleabTessera c0c13bc
ci: Changed virtualenv.
KaleabTessera e452574
ci: Check types and format.
KaleabTessera fbbff50
ci: Test github runtime.
KaleabTessera af52cc6
ci: Trying unrar free.
KaleabTessera 9e37d1a
ci: Updated unrar command.
KaleabTessera 46f3729
feat: Added swig dependency.
KaleabTessera f37e403
fix: Updated swig install.
KaleabTessera 2091995
ci: Updated fail fast strategy.
KaleabTessera 29133f3
ci: Updated flake8.
KaleabTessera f8b1fea
ci: Fix box2d.
KaleabTessera ba6f8fd
ci: Updated format checks and tests.
KaleabTessera 5197f08
chore: Removed unused dependency.
KaleabTessera 98fae98
feat: Updated smac dockerfile.
KaleabTessera 98f7e99
merge: Merge remote-tracking branch 'origin/feature/eval-intervals' i…
KaleabTessera e14420c
style: Added pre-commit hook for conventional commits.
KaleabTessera 670f021
docs: Updated py3.6 -> py3.9 in docs.
KaleabTessera 12022a2
feat: Switched from custom smac wrapper to pz smac wrapper.
KaleabTessera d4c0a5b
chore: Mypy fixes.
KaleabTessera fba2da5
fix: Removed deprecated smac utils tests.
KaleabTessera 3df706d
feat: Added dockerfiles and install instructions for melting pot.
KaleabTessera 89ace75
typing: Fixed mypy issues.
KaleabTessera 784d437
chore: Mypy changes.
KaleabTessera 914b733
ci: Updated docstrings check.
KaleabTessera e611837
chore: Removed comments.
KaleabTessera fcfa7c0
merge: branch 'feature/eval-intervals' into feature/smac-env-upgrades
KaleabTessera 6ef477e
Merge branch 'feature/eval-intervals' into feature/smac-env-upgrades
KaleabTessera 5cf83f7
Dockerfile install sudo. Auto yes on unrar install
mmorris44 d085ce9
Updated README
mmorris44 a1e215b
Upgrade flatland to version 3
mmorris44 ca58de9
docs: Updated readme for examples.
KaleabTessera 1053340
chore: Fixed formatting.
KaleabTessera 335353c
merge: Merge branch 'feature/eval-intervals' into feature/smac-env-up…
KaleabTessera 6d4d043
Merge branch 'feature/eval-intervals' into feature/smac-env-upgrades
KaleabTessera bf940f4
chore: Updated flatland dependency.
KaleabTessera 9c7a9e6
feat: Updated dockerfile.
KaleabTessera 39543c1
fix: Base image alias.
KaleabTessera 3df8642
fix: Set DEBIAN_FRONTEND to noninteractive to fix installs.
KaleabTessera 0ccdd05
fix: Updated robocup install script.
KaleabTessera b543fda
fix: Updates to robocup dockerfile for ubuntu 20.04.
KaleabTessera 2feb3db
feat: Added workflows for main/master branch.
KaleabTessera a1cc4ab
fix: Updated robocup install script.
KaleabTessera ef7d6e5
fix: Re-add sudo to install robocup incase it is run outside of docker.
KaleabTessera c2fe2f0
fix: Added sudo to robocup dockerfile.
KaleabTessera 6739ac8
feat: Upgraded version of rcssserver.
KaleabTessera 9c829dc
Merge remote-tracking branch 'origin/develop' into feature/smac-env-u…
KaleabTessera 7e5273b
chore: Removed deprecated dockerfile.
KaleabTessera 50e3054
feat: Simplified meltingpot install.
KaleabTessera d15ea02
Merge branch 'feature/smac-env-upgrades' of github.com:instadeepai/Ma…
KaleabTessera 0afc111
docs: Updated meltingpot readme.
KaleabTessera 27f55f7
fix: Minor updates to meltingpot install.
KaleabTessera File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
chore: Simplified eps scheduling.
- Loading branch information
commit d09a8a70277f9cdbe68cea896f9c712ba41bb278
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this not handled by default in the system, when there is no schedule?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So if we don't pass a schedule, the action selectors still try to call
get_epsilon
. This was the neatest way I could find to allow for constant schedules (e.g. in evaluators), without changing what we pass in or having a lot of messy if statements.