Skip to content

Commit

Permalink
Squashed commit of the following:
Browse files Browse the repository at this point in the history
commit 6092dc656cd73ee5a0fb0dd6e29c50b11c9b84ac
Merge: 73537c5f 3ddcbff7
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Fri Nov 8 15:45:07 2024 +0200

    Merge pull request #1130 from instadeepai/fix/sable-pos-encoding

    fix: limit timestep-pos-encoding to rec-Sable

commit 3ddcbff74fe1fa221c037e9701a502fcd6c8aa64
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Fri Nov 8 11:14:21 2024 +0000

    docs: update docs

commit daf1c199b4e2bdf0a9c012f6681d5fdb18781a25
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Fri Nov 8 11:11:45 2024 +0000

    fix: controling timestep positional encoding in acting phase

commit 73537c5f2294773fc73ba9e4f71203e13c97fc59
Merge: 905710fc d3631094
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Thu Nov 7 15:35:30 2024 +0100

    Merge pull request #1126 from instadeepai/fix/mabrax

    fix: mabrax requirement

commit d3631094feec5e8de3b3ff23382ac447414bb8fe
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Thu Nov 7 14:52:54 2024 +0200

    fix: mabrax requirement

commit 905710fc7d14e2567640268be72fc59835e31697
Merge: c86604c4 bb8e1073
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 13:29:08 2024 +0100

    Merge pull request #1113 from instadeepai/feat/sable

    Add Sable [Discrete actions]

commit bb8e1073187cd9bd5ca5d4c04bbf385868ae9546
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 11:05:38 2024 +0100

    Update mava/systems/sable/anakin/ff_sable.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit b3b43ec05ebed5205e465d2bad7f75dc5825baa2
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 11:05:27 2024 +0100

    Update mava/systems/sable/anakin/ff_sable.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 408c027e0e7366d539d163e36831764f323580e3
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 11:05:20 2024 +0100

    Update mava/systems/sable/anakin/rec_sable.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 3c250b838fe2ad7b6bac3e3ec770364aecf38c45
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 11:04:23 2024 +0100

    Update mava/networks/sable_network.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 18f7e662055f12519b4c5a6f3bbc54ea3e8bce16
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Nov 7 10:03:34 2024 +0000

    feat: update decoder file by removing unnecessary functions

commit a0daaebf80d3407c7e5c03389dcab6e2b9d0b2bd
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Nov 7 09:52:32 2024 +0000

    feat: update docs based on review

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 210faddc59c88a44a6a8c16e70e27767802c6116
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Nov 7 09:26:22 2024 +0100

    fix: run pre commits

commit 8546254ccb15febd92694b2a554bc7c8d08d9cbf
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 09:24:41 2024 +0100

    Update mava/systems/sable/anakin/rec_sable.py

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit 5b7156ff75c248a49f817397ca00f9d77215172d
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 09:24:33 2024 +0100

    Update mava/systems/sable/anakin/ff_sable.py

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit d3719baa79b26ff9e580968d90e7ee319bd6c374
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Thu Nov 7 09:24:26 2024 +0100

    Update mava/systems/sable/anakin/ff_sable.py

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit 011995d5cfd6287b7e9d25d1c049d128533d17a0
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 16:18:04 2024 +0100

    feat: move all system specific config setup to the system file

commit e552509e9aac94b4941969fa6e9acb9f4d4282e3
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 15:58:38 2024 +0100

    feat: checkpointer hstate retoring fix

commit bf58ded9038e79c544c3f82e419e71499270273b
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 14:58:38 2024 +0100

    feat: move concat agents and time to jax utils

commit e0ce8f42f16cedea4d20bac709520ff17f40bbf2
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 14:56:19 2024 +0100

    feat: get the positional encoding flag outside the util fn

commit aa9cba8864b020c08d0effc1eafcfd9008108e1c
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 14:50:51 2024 +0100

    feat: split encoder_decoder_Fn to two files

commit 0030b356ad4bbb7248a8b330334af89b92c78cbc
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 14:45:49 2024 +0100

    feat: use input hstate as the output variable instead of using extra hs variable

commit d9432f4aad865b1495e7b9f9e538d4dad3c3bc77
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:58:50 2024 +0000

    feat: rename retentions to retention_heads

commit 73ff86fca63bb7664ffe12dac91de0a407310e6f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:45:54 2024 +0000

    feat: replace init fn of sable net to get_actions one

commit 3998b51a33aed49735dcf1ab9b335424cf1a1263
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:37:48 2024 +0000

    feat: send optimizer update fn directly without intermediate var

commit 9f36fe6ac1fe333b661d4644c7f4468aff978a42
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:35:18 2024 +0000

    feat: move squeezing output of the net to inside the net fns

commit a6370a97493415e2d34bf881bf9edeece3839180
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:27:14 2024 +0000

    docs: update some docs

commit d80cf9186b111e56067cfb16272c57cca41aa9d0
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:18:25 2024 +0000

    feat: replace full attn flag by masking flag

commit 5f214cef462cd968975380fd122c2e17d5a5574e
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 12:14:00 2024 +0000

    feat: use the chunk size only to decide on use chunkwise flag for that

commit b5d39934e21c244f4af31940f82c506c10146620
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Wed Nov 6 12:58:47 2024 +0100

    Update mava/networks/utils/sable/encoder_decoder_fns.py

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit 670de846ed4f9818ec9c3864e16b9448d1a3ab23
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Wed Nov 6 12:58:27 2024 +0100

    Update mava/networks/utils/sable/encoder_decoder_fns.py

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit de16e844f7e82aa20ba736e532bf1d44f70bf5c6
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 11:56:59 2024 +0000

    feat: rename training apply callable type to LearnerApply

commit 8af3bb407a5780bd45dd99fc29ed4aae78efc4ff
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 11:54:52 2024 +0000

    feat: addressing some renaming suggestions

commit f0360d1341c21b2a3ae0d8401ee04d1fd95d7b78
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 11:05:56 2024 +0000

    chore: rename obs_carry to observation

commit 57e3b517b376b88fb0a39d602023cd4e258b41a3
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 10:45:20 2024 +0000

    fix: renmaing the shape related to n_agents and actions_dim

commit 437b8f62f35b988777d8c796ed43ad052f734707
Merge: e0c863c2 b3ac1d9b
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 10:36:46 2024 +0000

    Merge branch 'feat/sable' of github.com:instadeepai/Mava into feat/sable

commit e0c863c233676f791e105d0b22f0b3d187236de8
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 10:35:57 2024 +0000

    feat: update the action type to follow up same MAT standards

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit b3ac1d9bca01d1a6e221d626147b3640b688c6f2
Merge: 7646a2f0 c86604c4
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Nov 6 12:15:45 2024 +0200

    Merge branch 'develop' into feat/sable

commit 7646a2f03d9ce2e255d142b30bfb7490aa8e97e7
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 10:00:06 2024 +0000

    fix: update timeout in workflow to 20 min

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 0dd0eab6a9ff823ad7508177b8bc4b7265cc1ccb
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 09:53:51 2024 +0000

    feat: update shifting action method in autoregressive act
    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 945937cc977db0e469f12ca3227d66c595491d08
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Nov 6 09:36:29 2024 +0000

    feat: standardize the definition of net config to NamedTuple

commit c86604c4a6232d6bafee99a6ebfa7693cd652ebe
Merge: 7f2568a7 fb5c97c6
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Tue Nov 5 16:52:40 2024 +0200

    Merge pull request #1120 from instadeepai/feat/vector-connector-wrapper

    Add vector connector wrapper

commit fb5c97c61ac60d6484b2f51493feb10103b8d1ea
Author: RuanJohn <ruanjdekock@gmail.com>
Date:   Tue Nov 5 15:34:57 2024 +0200

    chore: docstring

commit d1a0c1c6406f95170268afb4c0b548b8ef177e08
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Nov 5 10:27:00 2024 +0000

    feat: make intermediate line to calculate decay_matrix

commit ae652fcefb7d29c7f8877556dcffc275cfeb4886
Merge: 1d8515e5 7f2568a7
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Tue Nov 5 10:19:46 2024 +0200

    Merge branch 'develop' into feat/vector-connector-wrapper

commit 7f2568a7a3944b6ca3195f6561ada55ee163d864
Merge: 3577523b b689a83e
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Tue Nov 5 10:19:22 2024 +0200

    Merge pull request #1123 from instadeepai/chore/num-minibatches-assert

    Chore: Add asserts for number of envs divisible by number of minibatches

commit b689a83e87044c1241dc29aa435fc7ba061336ce
Merge: d555f21a 3577523b
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Tue Nov 5 10:01:53 2024 +0200

    Merge branch 'develop' into chore/num-minibatches-assert

commit eb625901cb3b3c1ed9a99ceaaa3111993c34f2ac
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Nov 4 16:18:45 2024 +0000

    fix: major fix of sending non zero hstate for autoregressive act

commit 69f39a57712561b8a42f5da6671ac6576b80c6f0
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Nov 4 13:15:13 2024 +0000

    feat: rename shape vars in encoder decoder fns file

commit 7068a689cf8f7783043dc9b20a60a50e8fd39fa5
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Nov 4 10:19:20 2024 +0000

    feat: merge the chunkwise and parallel fns into one

commit 938541283de54f71084f83724d11f0ceb40dc3eb
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Nov 4 07:50:22 2024 +0000

    feat: move make eval fn to system files

commit 75ced75c2f989e3221e83583bc0ad8c7097c93e4
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Nov 4 07:26:36 2024 +0000

    feat: move sable util fns to network folder

commit 1d8515e5a36e6b5d1a70ba04b821b6fc96d7019c
Merge: fd276c0f 3577523b
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Nov 4 09:06:34 2024 +0200

    Merge branch 'develop' into feat/vector-connector-wrapper

commit 1d38c24f555405f154bff0e0bcc38f94b81b6923
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Nov 4 05:55:09 2024 +0000

    feat: update checkpointer fn output types

commit 2b80a7d8586fa2fed630519f58fd94b28f321b4c
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sat Nov 2 18:09:22 2024 +0000

    feat: update sable hstate attributes naming

commit 584b0d4551921db42725c375b8872350cdf98dfc
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sat Nov 2 18:03:10 2024 +0000

    chore: update tree map

commit dd21d04dfdec9843e401b91fc1a277ba9a78b9c2
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sat Nov 2 18:00:26 2024 +0000

    chore: docs fixes in sable network file

commit 30351515867db54556521316d990686a79a8343f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sat Nov 2 17:18:57 2024 +0000

    chore: docs fixes in retention file

commit 3577523b3a2f32bd35a2bf2e91f14a539a451e95
Merge: 327e0664 3373c579
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Sat Nov 2 14:27:05 2024 +0100

    Merge pull request #1119 from instadeepai/fix/quickstart-notebook

    Fix quickstart notebook

commit 3373c57929d7cfb7c3b40c0f0218716bcfa3a1f6
Author: WiemKhlifi <w.khlifi@instadeep.com>
Date:   Fri Nov 1 17:06:31 2024 +0100

    revert: point on develop for installation

commit 65d1f2d3dc080816f10f8cec3882471a0013ba90
Merge: d866bd57 327e0664
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Fri Nov 1 16:31:29 2024 +0100

    Merge branch 'develop' into fix/quickstart-notebook

commit fd276c0f5b4df580a1d0a37282ed36c7669c852d
Merge: 3f658ee5 327e0664
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Fri Nov 1 16:25:08 2024 +0100

    Merge branch 'develop' into feat/vector-connector-wrapper

commit d555f21aecb48aa2b8bf5545c24742aa46e51c1a
Author: SimonDuToit <fratinokek@gmail.com>
Date:   Fri Nov 1 17:04:39 2024 +0200

    pre-commit

commit b11fb37123971e220164b722ce26bef19e895de5
Merge: 63785093 327e0664
Author: SimonDuToit <fratinokek@gmail.com>
Date:   Fri Nov 1 16:34:08 2024 +0200

    Merge branch 'develop' into chore/num-minibatches-assert

commit 327e0664fafbb3ba18ea6d1f8a48166c9106c5d4
Merge: 6eed2d2f 7944e41b
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Fri Nov 1 15:32:45 2024 +0100

    Merge pull request #1121 from instadeepai/feat/more-rware-scenarios

    More rware scenario configs

commit 63785093873a0d21df2e3fa6b91b03516e1d0a16
Author: SimonDuToit <fratinokek@gmail.com>
Date:   Fri Nov 1 16:32:28 2024 +0200

    add asserts

commit d866bd575ba205a0ba20bed931edb19ee4151e67
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Nov 1 16:24:55 2024 +0200

    chore: update explainer text in example notebook

commit 648337049aee98822458ceb41c371a77f62ff777
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Fri Nov 1 10:31:19 2024 +0100

    Update mava/configs/network/ff_retention.yaml

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit aa8b455eadc7390a483b17316d0c41b28e6c77aa
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Fri Nov 1 10:31:03 2024 +0100

    Update mava/configs/network/rec_retention.yaml

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 3f658ee50571cb08543d68c2883d2879191c3196
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 31 17:48:33 2024 +0200

    test: add vector connector to integration tests

commit 69db3eb1851b6c9e20f3db71758e88d8b39312ab
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 31 17:10:45 2024 +0200

    feat: separate env config for vector connector

commit 1fdfce910f0dcc9ef44eb2c2c9607f7eb45c5762
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Oct 31 14:00:37 2024 +0000

    fix: define decay scaling factor for ff sable before sending config to enc-dec

    Co-authored By: sash-a

commit 283b6a9dc2f8587f6e0feb2ad70703d9a0fa5d32
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Thu Oct 31 15:09:03 2024 +0200

    feat: use boolean masks instead of jnp.where

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit ba52ce4f463a1101994ee079bdc69cc1296376c9
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Thu Oct 31 15:08:34 2024 +0200

    chore: remove debug print statement

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 77f291cf032f842b9cfb36970ca6e5563d01c61f
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Thu Oct 31 15:08:04 2024 +0200

    chore: comments for view shapes

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 7944e41b494558b6ded8e2227e9f50aff25784a6
Author: RuanJohn <ruanjdekock@gmail.com>
Date:   Wed Oct 30 16:49:51 2024 +0200

    feat: more rware scenario configs

commit 3d36aab988aa36c2fc5e4cdbc7c46ce53e2a8b7d
Author: RuanJohn <ruanjdekock@gmail.com>
Date:   Wed Oct 30 15:58:58 2024 +0200

    feat: add vector connector wrapper

commit dc00782761f9f5e46d4804ba53b1dd8ebe4eec13
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 30 13:15:05 2024 +0000

    fix: fixing the training by adding  causal masking of decoder for ff sable

commit 2fb21c7c1bb9acf3a29048f154bc1b704e9ae989
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 30 12:23:23 2024 +0000

    feat: remove the sable net checker in simple retention

commit 7732d52311f473fb698242530fa059b8213dbc8f
Merge: ef32a219 6eed2d2f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 30 09:48:04 2024 +0000

    feat: merge develop branch

commit eea913b8f6e3e6ec39776f714adcdc730b42e10f
Merge: cc47103d 6eed2d2f
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Wed Oct 30 10:23:16 2024 +0100

    Merge branch 'develop' into fix/quickstart-notebook

commit 6eed2d2fd3b27b55a0d86e7146609eb7b483d584
Merge: 389fbe58 ed3f015c
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Wed Oct 30 10:22:04 2024 +0100

    Merge pull request #1115 from instadeepai/feat/new-dockerfile

    feat: updated dockerfile

commit ed3f015c5d35a0df38b5bd434750a7618c7fe0a1
Merge: 83fa5a9e 389fbe58
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Wed Oct 30 10:11:00 2024 +0100

    Merge branch 'develop' into feat/new-dockerfile

commit 389fbe586e9de425b87fe89ca75bd066849644e2
Merge: 25008fbc 0ec7049d
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 30 11:00:41 2024 +0200

    Merge pull request #1107 from instadeepai/feat/implement-mat

    Add MAT

commit 0ec7049d8bc91b87b3f221dfb35f3fe16a271770
Merge: 3d47bebb 25008fbc
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 30 10:03:16 2024 +0200

    feat: merge in main

commit 25008fbc5af4744be83c2e0ff007812718c27f60
Merge: 8b758133 936c0b8e
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Oct 30 09:57:34 2024 +0200

    Merge pull request #1105 from instadeepai/feat/hasac2

    feat: hasac

commit 3d47bebba6610c8d32c5107864831ee96be4d357
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 30 09:25:23 2024 +0200

    feat: swiglu documentation

commit 7276aa0c93eb5a336281bb21282fd21af1314d41
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 30 09:16:15 2024 +0200

    feat: execution and training apply types

commit d9358311ba4475c8b1df601cba330907cdcb2617
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 30 09:09:55 2024 +0200

    chore: rename embed dim

commit a309bfa3b73056a441234238019553b82fe8b916
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 30 09:03:42 2024 +0200

    chore: remove obs dim in MAT network class

commit 3cb460d405988a5a9c40d6b664bf1c675f348fc5
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 30 08:53:45 2024 +0200

    chore: config comments and reverts

commit ef32a21947e92a9c34d0933f1bd2d308e5159b69
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 29 15:31:37 2024 +0000

    feat: compress net params in net_config

commit 83fa5a9e2bba5f3a63565163fa9383afb8952685
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Tue Oct 29 17:29:17 2024 +0200

    chore: remove docker volumes from makefile

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit 936c0b8e5635a371a197cd1d256f2a181445fc59
Merge: cf45f98e 8b758133
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Tue Oct 29 17:25:18 2024 +0200

    Merge branch 'develop' into feat/hasac2

commit cc47103d305516203ea0143c7ed96be59331172d
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 29 15:17:36 2024 +0200

    chore: remove notebook restarting cells

commit 975df5fd8ec2b9c591e325ac32061dd54d4f60a1
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 29 14:50:37 2024 +0200

    docs: mention that we use python 3.10 on colab

commit 4376b14a425f8355c89cfd5f392fb9ef919743c9
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 29 14:48:49 2024 +0200

    temp: change dir to quickstart notebook for reviewing

commit 71f572cb70efa725ec3e94a2d86fe06a9e8cd878
Merge: 19731683 8b758133
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 29 12:32:20 2024 +0100

    merge develop

commit 19731683b99e9bec89afff3124c8dd9dd90faa0e
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 29 12:24:41 2024 +0100

    feat: prevent decay matrix calculation in case of ff sable

commit 742903cac149b6ad8b0513961d15bb883c0cb68a
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 29 11:46:07 2024 +0100

    fix: fixing the retention output indexing

commit 2f9dd4edb08a30d8c55cb965079ccca33e1d73d4
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 29 10:30:02 2024 +0200

    fix: update quickstart notebook

commit e8b7f57912037214a80c066ac386b9c373364f8f
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 29 09:18:38 2024 +0200

    feat: update pyproject

commit 8a11bcf1e74193b783feeb834fbd0b2c64309f49
Merge: 5424c663 8b758133
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 19:45:37 2024 +0200

    Merge branch 'develop' into feat/new-dockerfile

commit cf45f98e7df00906d2abccbd6463e61ef43b6ad0
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 19:10:42 2024 +0200

    chore: docs

commit 7c8b91964e39edddfda078835ee5f7bfba80927e
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 19:01:57 2024 +0200

    chore: docs

    Co-authored-by: Omayma Mahjoub <omaymamahjoub98@gmail.com>

commit bd4c8bcc32a696e8e268ac5e36ceea44d8d7ea3d
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 17:33:41 2024 +0200

    chore: pre-commit

commit f3c990e222256d8aa8d1630195012e704b33b2bd
Merge: fc2b2bd5 8b758133
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 17:08:29 2024 +0200

    Merge branch 'develop' into feat/implement-mat

commit 8b758133056e86303ab1acbe5aa2ade02e0f6e70
Merge: 54d3b50a 755b4600
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 17:07:26 2024 +0200

    Merge pull request #1106 from instadeepai/feat/merge-qmix

    Add QMIX

commit 755b4600db94fe79da7192ad33cda62025d1f9e0
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 16:43:07 2024 +0200

    chore: remove type hint

commit 880698c203b40c3e9b995ac6b09334856e5d642f
Merge: 3c81350f 54d3b50a
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 16:41:40 2024 +0200

    Merge branch 'develop' into feat/merge-qmix

commit 54d3b50abaa833d805244dc62cf5a9f909948b6a
Merge: 87354a38 e9ff8b87
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 16:41:18 2024 +0200

    Merge pull request #1109 from instadeepai/feat/pyproject-toml

    feat: switch to pyproject.toml

commit fc2b2bd57b45d810829ce4bf7a702e29c685a5c0
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 16:40:05 2024 +0200

    chore: set correct number of keys

commit 123f5b19360f07a097abc62eb6b1ea18206d5d79
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 16:36:30 2024 +0200

    chore: better action encoder init

commit e9ff8b87007e030c5329a8b0413799e7cc8e21dd
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 16:28:08 2024 +0200

    chore: strict zip

commit 3cb5bcd9e84ebe47f43ac1780470c0116ad25fb3
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 16:19:00 2024 +0200

    chore: add dim on new line

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 1b4cdea028e2e847dbb2a582939651c416b05bd3
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 16:18:27 2024 +0200

    chore: strict zip

    Co-authored-by: Wiem Khlifi <w.khlifi@instadeep.com>

commit 7f7b2b514a53e914d476e15babfead11b2b9e058
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 15:52:21 2024 +0200

    feat: type hint jaxmarl and gigastep env

commit 5424c663649b463a3e606c39742c97f32b617116
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 15:51:13 2024 +0200

    chore: uppercase AS in Dockerfile

    Co-authored-by: Wiem Khlifi <w.khlifi@instadeep.com>

commit 3ecd7723c272c1dd7597d6829d96a5d5948c53e0
Merge: 41467f82 87354a38
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 15:22:27 2024 +0200

    Merge branch 'develop' into feat/pyproject-toml

commit 65538d5377c1ff9bc5e8661b3fb6beb273e613a0
Merge: 3b686481 87354a38
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 15:15:18 2024 +0200

    Merge branch 'develop' into feat/implement-mat

commit 2cea286ffb41009291262f150b3285184a0f83d6
Merge: 9682bb29 87354a38
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 14:09:33 2024 +0200

    Merge branch 'develop' into feat/hasac2

commit 9682bb294d592598084b79fd6a909fc7dad3101b
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 14:09:16 2024 +0200

    chore: shape comments

commit 1237117e9073beb8ebd7aa92bbe3e76c72d527c3
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 14:02:46 2024 +0200

    chore: shape comments

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit 87354a38fef29d02b21b980a97271412244a791c
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Mon Oct 28 12:03:53 2024 +0100

    fix: fix logging during evaluation for JaxMARL envs (#1116)

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 5aa0c30d4496e8b1d20211c7f8b6662e4c073b35
Merge: 3ff88416 3d541f2d
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 12:18:20 2024 +0200

    Merge branch 'develop' into feat/hasac2

commit 3b686481b9b43567e4721c3e58f70794c63c85b2
Merge: 9334319f 3d541f2d
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 12:15:20 2024 +0200

    Merge branch 'develop' into feat/implement-mat

commit 3c81350ff70d04b32539345b9cbb48916cad30e7
Merge: e49a22f7 3d541f2d
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 12:14:54 2024 +0200

    Merge branch 'develop' into feat/merge-qmix

commit 3d541f2d85797678da8f154d81199112eacf8f09
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 12:13:41 2024 +0200

    Fix lbf and rware obs spec types (#1114)

    * fix: lbf and rware obs spec types

    * fix: fix obs spec type in gigastep

commit 9334319fcdca7a46a63cd4deb091c18f71c8d7ea
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 12:10:18 2024 +0200

    chore: more lightweight network configs

commit e49a22f7fe633af72e63246cfc4e1bb6f0c751e6
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 12:05:37 2024 +0200

    chore: shape comments legend

commit f11c21ec515f2dcc04eb91b10aae68afc018e402
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 11:58:21 2024 +0200

    chore: corect shape names in the comments

commit 36f54d1846bddba918d0c6cd8fd2bf637dcd5122
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 28 11:56:47 2024 +0200

    feat: udpated dockerfile

commit 98378f3f1554f67e34f00b3f78609afc3a083b73
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 11:40:56 2024 +0200

    feat: add MAT network config type

commit aff9feb11ed4a84be3558324b9d64a78845369c6
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 11:30:58 2024 +0200

    feat: use network for MAT network init

commit 66884fb88b31868461b898066102002837edb5bf
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 09:39:29 2024 +0200

    test: add mat to integration tests

commit 738ec3c7049cdfce8a8f205b147e789ad922d9cd
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 09:29:15 2024 +0200

    feat: add qmix to intergration tests

commit c620e17f3a784bea3b1f65d5ba8a79cc3b0be036
Merge: c00f54fd cd31e205
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Mon Oct 28 09:25:03 2024 +0200

    Merge branch 'develop' into feat/implement-mat

commit 5f3f8e031945bba018d23888e0b9af59951c1c94
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 09:20:31 2024 +0200

    chore: rename data variables in training

commit fc091890c91f1887cb3087101f01038a04e4cc2a
Merge: c80da623 cd31e205
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 28 09:06:30 2024 +0200

    chore: merge in main

commit a6ae60296d80ee3d4e5d9b3d486dcfe88e46937e
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sun Oct 27 17:27:21 2024 +0100

    fix: minor documentations edits

commit 3ec23e80308c95a263f770b3d793bdf82db575f6
Merge: 8c56da70 cd31e205
Author: Omayma Mahjoub <omaymamahjoub98@gmail.com>
Date:   Sun Oct 27 17:17:51 2024 +0100

    Merge branch 'develop' into feat/sable

commit 8c56da7090303181bc546398b012f5795047b480
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sun Oct 27 16:16:57 2024 +0000

    feat: checkpointer update based on MAT PR

commit 7601bba98f491ed448f6c1dbf5b545d305b0368d
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sun Oct 27 15:55:54 2024 +0000

    feat: add sable to the integration test

commit 8abc50104dd0b1df4e487d80217b7d9165a9e20f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sun Oct 27 15:25:28 2024 +0000

    fix: fixing the apply fn output ordering

commit 78f99c9fb1d82e642e04af135f63d293a9bebdbf
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Sat Oct 26 16:32:24 2024 +0100

    fix: minor updates to net config

commit c80da6236578e3b2421eedb953de2369d95e59b7
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 25 18:12:36 2024 +0200

    fix: correct spec typing in lbf and rware

commit cd31e2056f3a0e59bf31118c11cc53742fb9eb1d
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 17:23:28 2024 +0200

    feat: smaller networks for tests (#1111)

    * feat: smaller networks and new way to modify test config

    * feat: faster find_replace

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
    Co-authored-by: Wiem Khlifi <w.khlifi@instadeep.com>

    * refactor: move find_replace to test/utils.py

    * chore: pre-commit

    ---------

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
    Co-authored-by: Wiem Khlifi <w.khlifi@instadeep.com>

commit 41467f82df0339b609b2da40c62806d7e5443939
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 16:54:52 2024 +0200

    fix: add unwrapped method to gigastep and jaxmarl wrappers

commit 3ff88416e649371081a8e26af2b8f64ad49f4994
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 16:38:48 2024 +0200

    chore: pre-commit

commit 617504bd178e0e9bef476ff1beb7d90256289ad0
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 16:27:47 2024 +0200

    chore: shape and global state comments

commit cb6bb68e092e45b32d0a810400359ca0463f89c6
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 25 12:00:20 2024 +0200

    fix: increase sample sequence length in testing config

commit 9a4fcbc6cc23bec3069baf38a0b0dc1d6289af18
Merge: 3b6bd930 bc6eb1a9
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Fri Oct 25 11:22:50 2024 +0200

    Merge branch 'develop' into feat/hasac2

commit 3043a9d0c3da871f24efb058ebee01da06a71a40
Merge: 3c4ea141 bc6eb1a9
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Fri Oct 25 11:20:10 2024 +0200

    Merge branch 'develop' into feat/pyproject-toml

commit f1549d19a6f90bbd7aa9d226c36be68fffca22b9
Merge: a2d4215a bc6eb1a9
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Fri Oct 25 11:19:22 2024 +0200

    Merge branch 'develop' into feat/merge-qmix

commit c00f54fd84c76cc8b6d6e57359370f07d3cef9b4
Merge: ee3aff6a bc6eb1a9
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Fri Oct 25 11:19:10 2024 +0200

    Merge branch 'develop' into feat/implement-mat

commit a2d4215aeab8e45b6389de9721638b9a0f90ebc1
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 25 11:17:46 2024 +0200

    chore: reset config defaults

commit dfdfd3232ec7d8a10ac82eaec47346277c0de910
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 25 11:15:35 2024 +0200

    chore: rename performance variable

commit aae973d1758899852e502598d245cffcff09b626
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 25 11:09:01 2024 +0200

    chore: rename data_first and and data_next

commit 3c4ea141680341c27956fbb78dfa7049d76066df
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 11:04:24 2024 +0200

    chore: typo

    Co-authored-by: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>

commit 5df0e1e41420d01260d0129871d48673ffd04142
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 25 11:02:49 2024 +0200

    chore: change comment in configs about sequence length

commit bc6eb1a9564cac9ffc861fe3e3ce34cd423ea8ad
Merge: 57c8e640 dfef2387
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 11:00:55 2024 +0200

    Merge pull request #1112 from instadeepai/feat/github-actions-uv

    feat: uv for github actions

commit dfef2387f37331e94daa3547eba6ff3a173adaca
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 09:41:39 2024 +0200

    chore: pre-commit autoupdate

commit d221a85656ad7f5d592f3a05348d137e40ae62d7
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Fri Oct 25 09:33:10 2024 +0200

    feat: uv for github actions

commit ee3aff6a54d62ed1fd96215a0a818d137de8dcc1
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 18:04:41 2024 +0200

    feat: use model params and optimiser state directly instead of named tuples

commit f205b9edc2bcdf1c3188ef016b094d7d95bc6d72
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 16:57:35 2024 +0200

    feat: use .at[].set() with drop instead of jax.lax.cond to update shifted actions

commit 26654b8a77b6fdfdbfde76ecf8bcd6d2a24cbaab
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 16:32:51 2024 +0200

    feat: use make mlp method

commit 91391c7a8d01b9748ce5c5447bfb3df6825a8fcf
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 16:04:45 2024 +0200

    chore: output projection

commit 32e458ae1185f59567ece1827d8f2fd32230ac9a
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 16:03:15 2024 +0200

    chore: use capital letters for dimensions

commit eee0217b552eb0c010640926db5850f5ef7c19d9
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 15:48:23 2024 +0200

    chore: todo about using einops in the future

commit 20a10f5515fbc42929df0c036bb119800197237e
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 15:25:59 2024 +0200

    feat: rename dimensions

commit aae87cdbe8e0dd45f43fc1f7c3a1f4cf01c3ce41
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 15:10:55 2024 +0200

    chore: pass in less seeds

commit 2fc8b929fa21f8fe42219e32ccde1c564df043dd
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 14:47:35 2024 +0200

    feat: split less keys

commit b678bf270d44251901fbf23500200ebe8c589b3d
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 14:35:52 2024 +0200

    chore: linter

commit 80711fd0ada3a5b17252b4d30da7b633cfb198f5
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 14:34:30 2024 +0200

    feat: pass in full observation object to network

commit 2bd4e2ca31c416a11f5ba1a63b9f592d37084b7b
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Thu Oct 24 14:30:46 2024 +0200

    feat: switch to pyproject and update mypy rules

commit 33117027e998315d6acac9ffd3c86e4b479c05c3
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 14:03:47 2024 +0200

    chore: use marlenv type

commit db10ce4b005e204c911f104d1cae6017d05852f7
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 14:01:59 2024 +0200

    chore: don't check action space type on strings

commit eedc8d75aa82d62397ce79b1a0068658b7423c4f
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 13:56:45 2024 +0200

    chore: rename v_loc to value

commit 3688e4021085cc5ba7832904db3a089d27c9cbcd
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 13:53:52 2024 +0200

    chore: move SwiGLU network to torsos file

commit 5e2bbb580ff35786922c38b0a45b2a9d75021be1
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 13:44:24 2024 +0200

    chore: expand mask dims without reshape

commit 8888a5c96bf926ae484fca1ad41567321ede5203
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 24 13:36:09 2024 +0200

    chore: remove old comments

commit 3b6bd93065d9a65befb74653ce4997058ac6b6f5
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Oct 23 17:23:55 2024 +0200

    chore: minor fixes from PR review

commit 2607db4ef4aec7ea25833dff56996392fcf6c594
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Oct 23 16:16:30 2024 +0200

    fix: small logger bug for arrays with a single element

commit 388dc6a9f13fd2378a1bf6df122c09779139bf45
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Oct 23 16:16:14 2024 +0200

    chore: update default hasac config

commit 35db17ddf5eac51826e9ad851114a59587a5c979
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 16:01:08 2024 +0200

    chore: slightly more lightweight configs and comment clean up

commit 7d5e2393323307580d65161867836580082b2c93
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 15:53:22 2024 +0200

    Update mava/configs/system/q_learning/rec_qmix.yaml

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit a7e3734958f45919e2346e1ffc06699d5ea7b591
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 15:53:12 2024 +0200

    Update mava/configs/env/smax.yaml

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 4fcce3fbdbb0c7868b666ac9995588be8d652f9f
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 15:53:01 2024 +0200

    Update mava/configs/system/q_learning/rec_qmix.yaml

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit ba71cc58998241eee741bb25d3ede885ead3ad2e
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 15:51:17 2024 +0200

    chore: fixed update_fn return type

commit f6f81e41b4c4b46a5bc176b7019e86860627dcdf
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 15:49:46 2024 +0200

    feat: paramterise learner state with qmix and qlearning params

commit 4f2076b3c9086667eae1750373fd9b1866167c7c
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 15:39:30 2024 +0200

    feat: store q_error and reuse when logging

commit bde58fd30547eab49cbc3eb3e5c6972a04b7237b
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 15:35:10 2024 +0200

    chore: comment clean up and variable renaming

commit 2dcaaceb301c41091a287a5669ea63df3487c7ff
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 14:55:34 2024 +0200

    chore: whitespace removed

commit a19d5fa684b86d69e82512522fc59a7d56a1f02f
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:52:02 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit cce233a2fc19a4ef914c69538e108c90259a8ee8
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:51:03 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 9294ee859c28acdaf8698a6ad01f3eb3589fda38
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:49:02 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit d8d80f741fa758aa9bc71a7aeb9026b0e3f44f6b
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:48:26 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 59fe5e2d145b13c839399efb6a6b9606eb1570e8
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:47:41 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 448495bb25fbe2dcdf3f82b6ed7655a9ce6bc045
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:45:48 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit fece034aa964792f027380eb587f4f72951569d6
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:44:32 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 9c4aea55a0e111109b1c9f7f2641eccbe22fa68d
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:24:03 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 7482c8f47d651e10dd1cccb800f726cfa30d8ba9
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:23:35 2024 +0200

    Update mava/systems/q_learning/anakin/rec_qmix.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit f10b2953ad57b5319c33a53a849c6bf150b6b825
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:23:25 2024 +0200

    Update mava/networks/torsos.py

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 79f9858ca6cbd81e4e1b5f9acba83a666e3a0d89
Author: Ruan de Kock <33461981+RuanJohn@users.noreply.github.com>
Date:   Wed Oct 23 14:21:37 2024 +0200

    Update mava/configs/system/q_learning/rec_qmix.yaml

    Co-authored-by: Sasha Abramowitz <reallysasha@gmail.com>

commit 034859e6a59c5397552847572e6af884a180e076
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Wed Oct 23 09:18:26 2024 +0200

    chore: match exact rec_iql style

commit 73c4611144457a0359c8cdb4990461c39fd9fdc8
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 22 17:22:20 2024 +0100

    feat: fix pre commits

commit 576d5d4b37a92fd48fe6212a82cb0397722606a0
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 22 14:15:23 2024 +0100

    feat: remove parallel representation

commit 067b4ef33d76d0fdd4a7afcad607d6f7913a6a14
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 14:49:42 2024 +0200

    feat: chore pre-commit

commit b131cb7425e2114cd7fa8efbfe6d9356bb3afe11
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 14:46:13 2024 +0200

    chore: add license

commit 61d70ca734d8ab426c9962a4daff57a58bbd6b89
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 14:42:31 2024 +0200

    chore: remove unused network file

commit 8357ef57c942d50b79f88ed1ccd2b6af197f556a
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 14:01:01 2024 +0200

    chore: duplicate whole info dict at the same time

commit 10647450c9af5b1c8f20f2f2c29648f747257bdc
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 22 12:51:41 2024 +0100

    feat: clean rec sable system file and fix checkpointer

commit 63723c7da531727acfa0e5967c5ffe1ea2aa1ff9
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 12:44:02 2024 +0200

    chore: set correct MLP torso size in configs

commit 59972c1e95e8ea1affd0781509ed9f3e96bc4a30
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 22 11:37:41 2024 +0100

    feat: add util fns for acting and training plus support for chunkwise

commit f0dbc65dfe174cd1ff9ee181cd5247c478187448
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 12:35:20 2024 +0200

    chore: replace jax.tree_map with jax.tree.map

commit 3fb530438f46009c336e225b8c09696821df0495
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 12:31:30 2024 +0200

    feat: instantiate networks with hydra utils

commit b7353346133df7525f760cb2dda9759f8550cbbc
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 10:27:10 2024 +0200

    chore: extra comment on term_or_trunc vs terminal

commit 63eb99f362fdcebdffc2f669e0ff71bfb45fc90b
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 10:24:36 2024 +0200

    feat: add option for hard or soft target updates

commit 64325d7c5715a29e083d4cf3b7f60fc36b197e16
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Tue Oct 22 10:23:05 2024 +0200

    chore: add hasac test

commit 5fcfcf2d66ac4e90faa3636dcdc03cd2dd944737
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Tue Oct 22 10:21:46 2024 +0200

    chore: add system name

commit e13a6e15c698f76f694d94c5a5c0200ea4ba82db
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 10:10:48 2024 +0200

    chore: type hints

commit 3b8d76195d23f45b0954849cd7c04b19929299b0
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 09:53:38 2024 +0200

    chore: clean up comments

commit 928c9c55fbfdba2c1998280b7d430307680fea4c
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Tue Oct 22 09:50:58 2024 +0200

    fix: jax utils

commit 4915b97e29e7817a0c07ae1e2035bdd0ac6dd72c
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 09:45:27 2024 +0200

    chore: move types to qlearning types file

commit e3195becaf47aec252168e5b6ed0dedb63277a29
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 09:36:37 2024 +0200

    chore: move torso and qmix network files

commit cefe4da21828c223b73e2eae2cf0d575c87efc8f
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 09:19:20 2024 +0200

    feat: replace rec_qmix code

commit 11546a22d757b095882c961de02cd2b81590b3f4
Merge: 97e23cfe 57c8e640
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 22 09:15:28 2024 +0200

    Merge branch 'develop' into feat/merge-qmix

commit 8d85d323004d6e66af4d06a25bef65cf8d985cbd
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 21 17:48:31 2024 +0200

    feat: move decoding functions to network utils

commit fd09d59704e4a2bd5705a26b091a9523b78ef931
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 21 16:05:21 2024 +0200

    feat: use get_action_head util instead of manually setting action space type

commit 6fe1f9c4782b2afd8caa7210fdf2beaa7b528e5a
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 21 14:54:31 2024 +0100

    feat: add chunkwise timestep fn to the modular net

commit 649a70ff6dedb31c1345ba0d1e3d729dfbd11bb3
Merge: 4e3bf428 57c8e640
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 21 15:41:09 2024 +0200

    Merge branch 'develop' into feat/implement-mat

commit 519025b705b0a572acebc91801e277511d85b617
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 21 15:22:22 2024 +0200

    chore: update config to new mava and cleanup

commit f03e6ca79a489dbe8c9a6cf1cf394c81ab59bbdf
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 21 14:17:06 2024 +0100

    feat: modular net sable

commit 0eeaa58ad89073cc51fd64092ab595b6d3a349a5
Merge: eeda7f50 57c8e640
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 21 14:51:10 2024 +0200

    Merge branch 'develop' into feat/hasac2

commit 57c8e64059bd59005d80c1c8278eef65855253b9
Merge: c4e40ce2 a14cfb2f
Author: Wiem Khlifi <w.khlifi@instadeep.com>
Date:   Mon Oct 21 13:25:03 2024 +0100

    Merge pull request #1104 from instadeepai/feat/act_head

    feat: set the action head automatically

commit eeda7f508e13d1c291dacdd41c5efe074761f4eb
Merge: f19d9bc6 c4e40ce2
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Mon Oct 21 13:53:07 2024 +0200

    Merge branch 'develop' into feat/hasac2

commit a14cfb2fc3ca7b979a27e4076d168970789dad63
Author: WiemKhlifi <w.khlifi@instadeep.com>
Date:   Mon Oct 21 11:31:04 2024 +0100

    feat: return action type with act head

commit 97e23cfe0f8632960f5a61dd29a3e3093916f899
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 21 11:40:26 2024 +0200

    feat: follow old qmix in trainer

commit f1cb0f20fd1fcc8871539a18803fa3b61d7b6979
Author: WiemKhlifi <w.khlifi@instadeep.com>
Date:   Fri Oct 18 17:17:49 2024 +0100

    feat: use action_sepc to select action head type

commit 59d354fab785db0c9bc761498e0770880e053835
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Fri Oct 18 16:41:42 2024 +0100

    feat: add only timestep positional encoding

commit 60d8ffa9d7d30829c79692145014a58a69017156
Author: WiemKhlifi <w.khlifi@instadeep.com>
Date:   Fri Oct 18 15:30:32 2024 +0100

    fix: update ff_ippo_store_experience file

commit 8ac214a208619aca95763df15c427e8dc140db2a
Author: WiemKhlifi <w.khlifi@instadeep.com>
Date:   Fri Oct 18 15:07:28 2024 +0100

    feat: set the action head automatically based on env name

commit 4e3bf428f480dc4b801f153e8b6c7e6d7a59273d
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 18 14:21:05 2024 +0200

    feat: infer batch size and num agents from obs rep instead of manually passing in

commit c18e2339c203138ed429e609d9704b8f05a473a6
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 18 14:04:04 2024 +0200

    chore: comment cleanup

commit 5e233b0cc8ff49db49743de3e4706159d7dd4072
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 17 18:42:44 2024 +0200

    feat: continuous actions training

commit 562c82a22011f5c1988f77dea935a5e90753eca9
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 17 18:17:19 2024 +0200

    feat: pass key through trainer to prepare for continuous action spaces

commit b08388dc261dd9d1c64dc518c14f9570a5ab05b2
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 17 18:02:43 2024 +0200

    feat: squeeze inside of network and not in system run file

commit 42b48bb69c70f9a6101afc1f120fd8d2495c1a01
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Oct 17 16:51:15 2024 +0100

    fix: minor fix to the positional encoding for timestep

commit 13e42a7fe3e4390a3a46f3168f18e16b3d7ac087
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Oct 17 16:28:10 2024 +0100

    feat: timestep encoding for rec sable

commit e4bc9692667580b3fc9e1a70d9b1fa688f2e1b06
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 17 16:53:25 2024 +0200

    feat: use jax.tree.map instead of deprecated jax.tree_map

commit 2d4f7edba23d18ec1ba726e7a0745f3543d479b3
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 17 16:44:30 2024 +0200

    chore: remove redundant obs being passed around

commit 848c625cc455428fb4b35c5b43d685beaa517612
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Thu Oct 17 16:36:57 2024 +0200

    feat: prepare to starting using mava discrete action head

commit d589b7e3300b9ad15d838f635dd9f7014ae527a1
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Oct 17 12:20:29 2024 +0100

    feat: pos encoding setup

commit 3fab043212b90875f1f27bb9e2e76289f96e318f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 16 14:59:17 2024 +0100

    fix: fix args documentation for learner_fn

commit b180ef2486613b8f37559b50e08482883b7ddb45
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 16 14:06:15 2024 +0100

    feat: add Sable non memory

commit 8d7398a24e77c6b1e6ac8a46c57b1f47da9cd8e3
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 16 12:35:57 2024 +0100

    feat: add evaluator to sbale

commit 094cc652adcb10dfa91233121655e6bd91ff3724
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 16 11:39:11 2024 +0100

    feat: update types used for sable

commit b983cca0991a75fcdedc8a564af1e762bc68ec3f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 15 14:55:06 2024 +0100

    feat: minor update

commit 38af5baa15259587b83a74553a34eba6007a1ad1
Merge: f44e6a5a c4e40ce2
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 15 14:42:14 2024 +0100

    feat: merge develop branch

commit 4964fa8b36019ad76bc0fd02e75274b4ed126ea4
Merge: c8005cb3 666660b3
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 14:57:59 2024 +0200

    feat: merge in network refactor

commit f44e6a5ab2164c2d918129e4d02a9613d698260d
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 15 13:54:39 2024 +0100

    feat: run pre commits

commit 6b928c76eda224f6c25be029da23b82c7d4775e2
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Tue Oct 15 13:53:16 2024 +0100

    feat: sable clean code and documentation (types still uncorrect

commit c8005cb3c6e41260715de1b258b7296b409e68b8
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 12:47:39 2024 +0200

    feat: use tfp instead of distrax

commit 53dd9d7e939af98bfc8b2dd6c5939d34ca5f41b0
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 12:06:59 2024 +0200

    feat: remove autoregressive scans

commit ff5ec1030990a219922f35bea1d5289c7471e7a2
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 10:25:26 2024 +0200

    feat: use MAT types

commit 84f0852088339bfc61c42bba04c601766cefeb85
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 09:31:43 2024 +0200

    feat: remove value norm

commit 30d29477c3c2957527c8550c3871aac473012fd5
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 09:19:51 2024 +0200

    feat: remove huber loss

commit 2905604b271fcb1fd8490cfc42382147366f0673
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Tue Oct 15 09:05:30 2024 +0200

    feat: add discrete MAT and training on rware

commit 88a619ab21f5e79bb24be30ee7b29c945774331e
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 14 17:03:46 2024 +0100

    feat: clean util functions

commit efcd97528a38a1cdd34bed9613c186fd61a086e6
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 14 16:19:54 2024 +0100

    feat: rename sable memory to rec sable

commit c15edb06e45b0149bd7d8f2684f10d0ea3845c6f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 14 15:50:33 2024 +0100

    feat: add trainable sable system (unclean) to mava

commit 8b1860285fba4470d69d6b5646764c6aad477724
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Mon Oct 14 15:42:20 2024 +0200

    feat: set correct sequence length and reward dim in buffer init

commit 72b00fdd9438d0afe998a121c475ea4b4893230c
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 14 14:31:04 2024 +0100

    feat: run pre commits

commit 95c12657e6f6b802f81a787a7f95758898aaec2f
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Mon Oct 14 14:20:59 2024 +0100

    feat: add sable network file

commit c7685edb31bd1a126f34f173506e2a15e7d900cb
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 11 15:41:51 2024 +0200

    feat: qmix training with new API

commit 1c2009308b2a8891913118bdc2875fa2d97d8482
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 11 12:24:52 2024 +0200

    feat: qmix piping through with distributional networks

commit b2bd79a267589d9def756a94d21797e3b0730e64
Merge: 43f14e5e 2a1d2d8b
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 11 11:06:48 2024 +0200

    feat: merge in develop

commit 43f14e5e5a5341ed5f59904252b329f18c4d8e83
Author: Ruan de Kock <ruanjdekock@gmail.com>
Date:   Fri Oct 11 11:05:53 2024 +0200

    feat: qmix with new evaluator piping through

commit 8d35f400b270ee23d9e5be05316b30a2ecd8a80b
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Thu Oct 10 11:26:57 2024 +0100

    feat: add retention file

commit e767bd90381a69f58179cc023991044c812c92e8
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 9 12:22:37 2024 +0100

    feat: move ff and rnn networks into a folder

commit 09d5fdfbbf04aa16c18e1173655d5128cf0aeca7
Author: OmaymaMahjoub <omayma.mahjoub@ensi-uma.tn>
Date:   Wed Oct 9 12:17:20 2024 +0100

    feat: add config files of sable

commit f19d9bc6d8a460817723520cefb2d3ea56bbc328
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Aug 7 13:42:53 2024 +0200

    fix: optimizers for multiple parameters

commit 4673da87ea6f88ac662ea69fa6633cb10b364072
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Aug 7 13:03:08 2024 +0200

    feat: grad clip + fix final return

commit aad6a0eefd45d9e5f6df75251e2434367cd0fd67
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Aug 7 12:16:04 2024 +0200

    fix: evaluator working for hasac

commit 658f6277f665b2b437f9ba091b1241cbb9f34d8a
Author: Sasha Abramowitz <reallysasha@gmail.com>
Date:   Wed Aug 7 11:46:21 2024 +0200

    feat: hasac
  • Loading branch information
Louay-Ben-nessir committed Nov 12, 2024
1 parent 245aecc commit 649b93b
Show file tree
Hide file tree
Showing 97 changed files with 6,587 additions and 712 deletions.
26 changes: 26 additions & 0 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
.dockerignore

.DS_Store
.idea
.vscode

.git
.github
.gitignore
.gitlab-ci.yml
.gitmodules

.conda
.neptune
.pytest_cache
.mypy_cache
.ruff_cache

.pre-commit-config.yaml
commitlint.config.js
LICENSE

*.egg-info
docs/
outputs/
results/
28 changes: 19 additions & 9 deletions .github/workflows/tests_linters.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,26 +4,36 @@ on: [ pull_request ]

jobs:
tests-and-linters:
name: "Python ${{ matrix.python-version }} on ${{ matrix.os }}"
runs-on: "${{ matrix.os }}"
timeout-minutes: 10
name: "Python ${{ matrix.python-version }} on ubuntu-latest"
runs-on: ubuntu-latest
timeout-minutes: 20

strategy:
matrix:
python-version: ["3.12", "3.11"]
os: [ubuntu-latest]

steps:
- name: Checkout mava
uses: actions/checkout@v3
- uses: actions/setup-python@v4
uses: actions/checkout@v4

- name: Install uv
uses: astral-sh/setup-uv@v3
with:
version: "0.4.26"
enable-cache: true
cache-dependency-glob: "requirements/requirements**.txt" # invalidate cache when requirements file changes

- uses: actions/setup-python@v5
with:
python-version: "${{ matrix.python-version }}"
- name: Upgrade pip
run: pip install --upgrade pip

- name: Install python dependencies 🔧
run: pip install .[dev]
run: uv pip install .[dev]
env:
UV_SYSTEM_PYTHON: 1

- name: Run linters 🖌️
run: pre-commit run --all-files --verbose

- name: Run tests 🧪
run: pytest -p no:warnings
8 changes: 4 additions & 4 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -1,11 +1,11 @@
default_stages: [ "commit", "commit-msg", "push" ]
default_stages: [ "pre-commit", "commit-msg", "pre-push" ]
default_language_version:
python: python3

repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
# Ruff version.
rev: v0.4.8
rev: v0.7.1
hooks:
# Run the linter.
- id: ruff
Expand All @@ -16,7 +16,7 @@ repos:
types_or: [ python, pyi, jupyter ]

- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v4.6.0
rev: v5.0.0
hooks:
- id: end-of-file-fixer
name: "End of file fixer"
Expand All @@ -42,7 +42,7 @@ repos:
pass_filenames: false

- repo: https://github.com/alessandrojcm/commitlint-pre-commit-hook
rev: v9.16.0
rev: v9.18.0
hooks:
- id: commitlint
name: "Commit linter"
Expand Down
48 changes: 13 additions & 35 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -1,45 +1,23 @@
FROM nvidia/cuda:11.8.0-cudnn8-runtime-ubuntu22.04
# Stage 1: Build environment
FROM python:3.12-slim AS core

# Ensure no installs try to launch interactive screen
ARG DEBIAN_FRONTEND=noninteractive
# Add git
RUN apt-get update && apt-get install -y git build-essential pkg-config libhdf5-dev

# Update packages and install python3.9 and other dependencies
RUN apt-get update -y && \
apt-get install -y software-properties-common git && \
add-apt-repository -y ppa:deadsnakes/ppa && \
apt-get install -y python3.12 python3.12-dev python3-pip python3.12-venv && \
update-alternatives --install /usr/bin/python python /usr/bin/python3.12 10 && \
python -m venv mava && \
apt-get clean && \
rm -rf /var/lib/apt/lists/*
# Add uv and use the system python (no need to make venv)
USER root
COPY --from=ghcr.io/astral-sh/uv:0.4.20 /uv /bin/uv
ENV UV_SYSTEM_PYTHON=1

# Setup virtual env and path
ENV VIRTUAL_ENV /mava
ENV PATH /mava/bin:$PATH
WORKDIR /home/app/mava

# Location of mava folder
ARG folder=/home/app/mava

# Set working directory
WORKDIR ${folder}

# Copy all code needed to install dependencies
COPY ./requirements ./requirements
COPY setup.py .
COPY README.md .
COPY mava/version.py mava/version.py
COPY . .

RUN echo "Installing requirements..."
RUN pip install --quiet --upgrade pip setuptools wheel && \
pip install -e .
RUN uv pip install -e .

# Need to use specific cuda versions for jax
ARG USE_CUDA=true
ARG USE_CUDA=false
RUN if [ "$USE_CUDA" = true ] ; \
then pip install "jax[cuda12]==0.4.30" ; \
then uv pip install jax[cuda12]==0.4.30 ; \
fi

# Copy all code
COPY . .

EXPOSE 6006
17 changes: 3 additions & 14 deletions Makefile
Original file line number Diff line number Diff line change
@@ -1,19 +1,8 @@
# Check if GPU is available
NVCC_RESULT := $(shell which nvcc 2> NULL)
NVCC_TEST := $(notdir $(NVCC_RESULT))
ifeq ($(NVCC_TEST),nvcc)
GPUS=--gpus all
else
GPUS=
endif

# For Windows use CURDIR
ifeq ($(PWD),)
PWD := $(CURDIR)
endif
# Check if GPU is available - if `nvidia-smi` works then use GPUs
GPUS := $(shell command -v nvidia-smi > /dev/null && nvidia-smi > /dev/null 2>&1 && echo "--gpus all" || echo "")

# Set flag for docker run command
BASE_FLAGS=-it --rm -v ${PWD}:/home/app/mava -w /home/app/mava
BASE_FLAGS=-it --rm
RUN_FLAGS=$(GPUS) $(BASE_FLAGS)

DOCKER_IMAGE_NAME = mava
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -159,7 +159,7 @@ cd mava
pip install -e .
```

We have tested `Mava` on Python 3.11 and 3.12, but earlier versions may also work. Note that because the installation of JAX differs depending on your hardware accelerator,
We have tested `Mava` on Python 3.11 and 3.12, but earlier versions may also work. Specifically, we use Python 3.10 for the Quickstart notebook on Google Colab since Colab uses Python 3.10 by default. Note that because the installation of JAX differs depending on your hardware accelerator,
we advise users to explicitly install the correct JAX version (see the [official installation guide](https://github.com/google/jax#installation)). For more in-depth installation guides including Docker builds and virtual environments, please see our [detailed installation guide](docs/DETAILED_INSTALL.md).

## Quickstart ⚡
Expand Down
Loading

0 comments on commit 649b93b

Please sign in to comment.