Skip to content
This repository has been archived by the owner on Oct 7, 2024. It is now read-only.

Tags: google-deepmind/bsuite

Tags

0.3.5

Toggle 0.3.5's commit message
Relax versioning constraints for tf/tfp, bump patch version to 0.3.5.

PiperOrigin-RevId: 358134728
Change-Id: Iba086222e5c97d8e6410097b38668a3935c923dc

0.3.4

Toggle 0.3.4's commit message
Bump version again (incorrect tag was used before) and use PyPI's rlax.

PiperOrigin-RevId: 334553236
Change-Id: I88eca16269fabf975dd0c8f7ccd8a8c2374b78c9

0.3.3

Toggle 0.3.3's commit message
Internal change.

PiperOrigin-RevId: 334334765
Change-Id: I1080d6b302fb77b3d75633c4d2ed487e6b26730d

0.3.2

Toggle 0.3.2's commit message
Bump version to 0.3.2.

PiperOrigin-RevId: 315450863
Change-Id: I23a258dd001aae14bde3edd7c29df79579030eaf

0.3.1

Toggle 0.3.1's commit message
Drop meaningless "level" columns from the correct dataframe.

(Does not affect performance)

PiperOrigin-RevId: 313763435
Change-Id: I8973213e969be65a0a59ca73b07a1b8a628751a2

0.3.0

Toggle 0.3.0's commit message
Calculate best episode using full episode return in cartpole_swingup.

Return is non-monotonic in this problem; currently this cherry-picks the peak of return during the episode.

Also applied same change to base cartpole for consistency and efficiency, but cartpole return is monotonic (so not a bug).

PiperOrigin-RevId: 308033113
Change-Id: I9add00d41f8e87d518e00c3fef9cd9ad7ad18d0b

0.2.0

Toggle 0.2.0's commit message
Re-organize baselines into subdirectories according to their provenan…

…ce/libraries used.

- tf: TensorFlow 2/Sonnet 2/TRFL-based agents.
- jax: JAX/Haiku/rlax-based agents.
- third_party: Agents created by third parties (not DeepMind).

Also adopt more standard naming practice within each agent folder (agent.py).

PiperOrigin-RevId: 305674544
Change-Id: I3d4f076fb96d2e0250cfbb3f1adf163ce6932e97

0.2

Toggle 0.2's commit message
Re-organize baselines into subdirectories according to their provenan…

…ce/libraries used.

- tf: TensorFlow 2/Sonnet 2/TRFL-based agents.
- jax: JAX/Haiku/rlax-based agents.
- third_party: Agents created by third parties (not DeepMind).

Also adopt more standard naming practice within each agent folder (agent.py).

PiperOrigin-RevId: 305674544
Change-Id: I3d4f076fb96d2e0250cfbb3f1adf163ce6932e97