Compute the reward function for vanilla policy gradient. #1612
tests.yaml
on: pull_request
Matrix: test-atari-envs
Matrix: test-atari-multigpu-envs
Matrix: test-core-envs
Matrix: test-envpool-envs
Matrix: test-mujoco-envs
Matrix: test-mujoco_py-envs
Matrix: test-pettingzoo-envs
Matrix: test-procgen-envs
Annotations
3 errors
test-mujoco-envs (3.10, 1.7, ubuntu-22.04)
Process completed with exit code 1.
|
test-mujoco-envs (3.9, 1.7, ubuntu-22.04)
Process completed with exit code 1.
|
test-mujoco-envs (3.8, 1.7, ubuntu-22.04)
Process completed with exit code 1.
|