Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Edit conv3d #3

Merged
merged 81 commits into from
Sep 15, 2023
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
81 commits
Select commit Hold shift + click to select a range
c96227a
[BugFix] Fix bug in ppo example config (#1396)
degensean Jul 19, 2023
01f602e
[BugFix] Fix ppo training NaN occurences (#1403)
vmoens Jul 21, 2023
40a0929
[Example] Multiagent examples: MAPPO-IPPO-MADDPG-IDDPG-IQL-QMIX-VDN (…
matteobettini Jul 25, 2023
6641791
[BugFix] Reward sum custom key (#1413)
matteobettini Jul 26, 2023
3558061
[Refactor] Use masking in collectors (#1412)
vmoens Jul 26, 2023
3f4e9aa
[BugFix] Pickable buffer (#1410)
albertbou92 Jul 26, 2023
d41e960
[Tutorial] Multiagent PPO tutorial (#1385)
matteobettini Jul 26, 2023
c06ed70
[Refactor] Remove loss device (#1395)
matteobettini Jul 26, 2023
dc8b7b5
[BugFix] Fix LSTM use with padded/masked segments (#1399)
smorad Jul 26, 2023
e8765c2
[CI] Fix tuto deps (#1416)
matteobettini Jul 27, 2023
fc8fe4c
[BugFix] unlock in tensordictmodules tests (#1417)
vmoens Jul 27, 2023
c693199
[Feature] VMAS obs dict (#1419)
matteobettini Jul 28, 2023
575a0a4
[BugFix] Fix `LazyStackedCompositeSpec` and introducing `consolidate_…
matteobettini Jul 31, 2023
8ad72c9
[BugFix] Fix KLTransform execution with LSTM (#1426)
vmoens Jul 31, 2023
0bd26f4
[Versioning] Python 3.11 (#1433)
vmoens Aug 1, 2023
bbf5545
python 3.11 in README (#1434)
vmoens Aug 1, 2023
71fd4c2
[BugFix] Fix QValueModule multi_one_hot (#1439)
smorad Aug 1, 2023
e8e8e43
[Feature] Optional mapping of "state" in gym specs (#1431)
matteobettini Aug 4, 2023
532ef76
[BugFix] Fix split_trajs (#1444)
vmoens Aug 4, 2023
83dfff3
[Test] Test split trajs (#1445)
vmoens Aug 4, 2023
b210665
[Feature] Heterogeneous Environments compatibility (#1411)
matteobettini Aug 4, 2023
7b21e52
[Tests] Collector compatibility for heterogeneous environments (#1414)
matteobettini Aug 4, 2023
2fe836c
Fix MacOS Mujoco Failure (#1450)
osalpekar Aug 8, 2023
e39e701
[BugFix] Fix Cython for D4RL (#1429)
vmoens Aug 8, 2023
6316a57
[Feature] IsaacGymEnvs integration (#1443)
vmoens Aug 9, 2023
892ff47
[Benchmark] Benchmark number GPU vectorised environments in VMAS (Tor…
matteobettini Aug 9, 2023
a01d79d
[Feature] Auto-DoubleToFloat (#1442)
vmoens Aug 11, 2023
c12d8bc
[Refactor] Change objectives parameter/buffer/target logic (#1424)
vmoens Aug 11, 2023
bd316e9
[BugFix] CompositeSpec.unsqueeze (#1464)
btx0424 Aug 18, 2023
78b2bb2
[BugFix] Discrete SAC rewrite (#1461)
matteobettini Aug 30, 2023
121ecd9
[Feature] Parallel environments lazy heterogenous data compatibility …
matteobettini Aug 30, 2023
b444007
[Algorithm] Online Decision transformer (#1149)
BY571 Aug 30, 2023
16ce926
[CI] Fix Mujoco version (#1475)
vmoens Aug 30, 2023
f8777a6
[Feature] Allow multiple (nested) action, reward, done keys in `env`,…
matteobettini Aug 30, 2023
7205a5a
[CI] Linux Olddeps tests to Nova (#1289)
osalpekar Aug 30, 2023
7bc9955
[Feature] Device transform (#1472)
vmoens Aug 30, 2023
530efa8
[BugFix] Fix reward sum within parallel envs (#1454)
vmoens Aug 30, 2023
7ae6140
Move to More Efficient Windows Runner (#1476)
osalpekar Aug 30, 2023
e3b3879
[Refactor] Faster envs (2) (#1457)
vmoens Sep 1, 2023
4c8f91f
[BugFix] Fix RewardSum spec transform to mimic reward spec (#1478)
matteobettini Sep 1, 2023
dbab7bb
[BugFix] Fix NoopResetEnv behavior when trials exceeded. (#1477)
skandermoalla Sep 1, 2023
2982515
[BugFix] Fix LSTM - VecEnv compatibility (#1427)
vmoens Sep 2, 2023
545a28c
[BugFix] Fix Gym Categorical/One-hot issues (#1482)
vmoens Sep 3, 2023
9fded1a
[Feature] Masking actions (#1421)
vmoens Sep 3, 2023
3595c45
[Doc] Update coding_ppo.py (#1483)
kushaangupta Sep 3, 2023
c7af3a8
[BugFix] Nested key in replay buffer (#1485)
matteobettini Sep 4, 2023
99bd15a
[Feature] VMAS: choose between categorical or one-hot actions (#1484)
matteobettini Sep 4, 2023
3649234
[CI] Fix examples CI (#1489)
matteobettini Sep 5, 2023
e133749
[Feature] Refactor categorical dists: Masked one-hot and pass-through…
vmoens Sep 5, 2023
deb6e6e
[Minor] Missing commit from #1488 (#1490)
vmoens Sep 5, 2023
061a261
[BugFix] Fix action mask test (#1492)
vmoens Sep 5, 2023
bdc573f
[BugFix] Fix typo in CatFrames Transform error message. (#1491)
skandermoalla Sep 6, 2023
d729f3d
[Setup] Update setup.py python versions (#1496)
vmoens Sep 6, 2023
72e8fa4
[Minor] Capture error in CatFrame edit (#1498)
vmoens Sep 6, 2023
0ed2208
[Feature] RoboHive integration (#1119)
vmoens Sep 6, 2023
2e91643
[BugFix] Empty-spec tolerance (#1501)
vmoens Sep 6, 2023
153337e
[Doc] correct typos in pendulum tutorial (#1502)
kushaangupta Sep 7, 2023
786020d
[Feature] Making action masks compatible with q value modules and e-g…
matteobettini Sep 7, 2023
d8504e7
[BugFix] Fix DDPG squeezing (#1487)
matteobettini Sep 7, 2023
bca6adf
[Feature] ClipTransform + rename `min/maximum` -> `low/high` (#1500)
vmoens Sep 7, 2023
4c50f1e
[BugFix] Multiagent "auto" entropy fix in SAC (#1494)
matteobettini Sep 7, 2023
435584e
[Minor] More efficient SAC v1 (#1507)
vmoens Sep 7, 2023
519975b
Merge branch 'main' into HEAD
vmoens Sep 7, 2023
d2e11bf
[BugFix] Fix ClipTransform device (#1508)
vmoens Sep 7, 2023
394690e
lint
vmoens Sep 7, 2023
b7e5299
[CI] Move linux stable to GHA (#1503)
vmoens Sep 8, 2023
c62781c
[BugFix] Add `torch.no_grad()` for rendering in multiagent PPO tutori…
matteobettini Sep 8, 2023
162c26f
[CI] Remove CCI Config (#1456)
osalpekar Sep 8, 2023
147de71
[Performance] Miscellaneous efficiency improvements (#1513)
vmoens Sep 8, 2023
a73428b
[Environment] Petting zoo (#1471)
matteobettini Sep 14, 2023
274cdfc
[CI] Fix CI PettingZoo (#1528)
matteobettini Sep 14, 2023
da50587
[BugFix] Fix envpool (#1530)
vmoens Sep 14, 2023
5f3741b
[Environment, Docs] SMACv2 and docs on action masking (#1466)
matteobettini Sep 15, 2023
1182f46
[CI] Fix CI (#1529)
vmoens Sep 15, 2023
98da063
[BugFix] Small patches to SMAC (#1533)
matteobettini Sep 15, 2023
dbf92c8
[Refactor] Expose all wrappers in torchrl.envs (#1532)
vmoens Sep 15, 2023
9804ab4
[Test] Skip tests in python 3.11 (#1535)
vmoens Sep 15, 2023
dbfce8c
[BugFIx] Fix entropy signature in truncated normal (#1536)
vmoens Sep 15, 2023
1060222
[CI] Fix habitat CI (#1537)
vmoens Sep 15, 2023
43b943e
[Test] Test state_dict and loss modules (#1527)
vmoens Sep 15, 2023
3fcd563
Merge branch 'main' into edit_conv3d
vmoens Sep 15, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
[Tutorial] Multiagent PPO tutorial (pytorch#1385)
Signed-off-by: Matteo Bettini <matbet@meta.com>
Co-authored-by: Vincent Moens <vincentmoens@gmail.com>
  • Loading branch information
matteobettini and vmoens authored Jul 26, 2023
commit d41e9604876f8d7d3893a983ad1a75152f2d5194
5 changes: 5 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -70,6 +70,11 @@ instance/

# Sphinx documentation
docs/_build/
docs/build/
docs/source/gen_modules
docs/source/reference/generated
docs/source/tutorials
docs/src

# PyBuilder
.pybuilder/
Expand Down
1 change: 1 addition & 0 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -49,6 +49,7 @@ Intermediate
.. toctree::
:maxdepth: 1

tutorials/multiagent_ppo
tutorials/torchrl_envs
tutorials/pretrained_models
tutorials/dqn_with_rnn
Expand Down
Loading