Tags · google-deepmind/bsuite

This repository has been archived by the owner on Oct 7, 2024. It is now read-only.

0.3.5

Relax versioning constraints for tf/tfp, bump patch version to 0.3.5.

PiperOrigin-RevId: 358134728
Change-Id: Iba086222e5c97d8e6410097b38668a3935c923dc

Feb 18, 2021
a07485f
zip
tar.gz
Notes

0.3.4

Bump version again (incorrect tag was used before) and use PyPI's rlax.

PiperOrigin-RevId: 334553236
Change-Id: I88eca16269fabf975dd0c8f7ccd8a8c2374b78c9

Sep 30, 2020
36a1d1e
zip
tar.gz

0.3.3

Internal change.

PiperOrigin-RevId: 334334765
Change-Id: I1080d6b302fb77b3d75633c4d2ed487e6b26730d

Sep 29, 2020
cf2b5cb
zip
tar.gz

0.3.2

Bump version to 0.3.2.

PiperOrigin-RevId: 315450863
Change-Id: I23a258dd001aae14bde3edd7c29df79579030eaf

Jun 9, 2020
8d52410
zip
tar.gz
Notes

0.3.1

Drop meaningless "level" columns from the correct dataframe.

(Does not affect performance)

PiperOrigin-RevId: 313763435
Change-Id: I8973213e969be65a0a59ca73b07a1b8a628751a2

May 29, 2020
876be3a
zip
tar.gz
Notes

0.3.0

Calculate best episode using full episode return in cartpole_swingup.

Return is non-monotonic in this problem; currently this cherry-picks the peak of return during the episode.

Also applied same change to base cartpole for consistency and efficiency, but cartpole return is monotonic (so not a bug).

PiperOrigin-RevId: 308033113
Change-Id: I9add00d41f8e87d518e00c3fef9cd9ad7ad18d0b

Apr 23, 2020
beb1630
zip
tar.gz
Notes

0.2.0

Re-organize baselines into subdirectories according to their provenan…

…ce/libraries used.

- tf: TensorFlow 2/Sonnet 2/TRFL-based agents.
- jax: JAX/Haiku/rlax-based agents.
- third_party: Agents created by third parties (not DeepMind).

Also adopt more standard naming practice within each agent folder (agent.py).

PiperOrigin-RevId: 305674544
Change-Id: I3d4f076fb96d2e0250cfbb3f1adf163ce6932e97

Apr 9, 2020
0fdf026
zip
tar.gz
Notes

0.2

Re-organize baselines into subdirectories according to their provenan…

…ce/libraries used.

- tf: TensorFlow 2/Sonnet 2/TRFL-based agents.
- jax: JAX/Haiku/rlax-based agents.
- third_party: Agents created by third parties (not DeepMind).

Also adopt more standard naming practice within each agent folder (agent.py).

PiperOrigin-RevId: 305674544
Change-Id: I3d4f076fb96d2e0250cfbb3f1adf163ce6932e97

Apr 9, 2020
0fdf026
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.3.5

0.3.4

0.3.3

0.3.2

0.3.1

0.3.0

0.2.0

0.2

Tags: google-deepmind/bsuite