dobro12 / off_policy_trpo Public

Notifications You must be signed in to change notification settings
Fork 0
Star 1

Simple implementation of off policy TRPO.

1 star 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
PPO		PPO
TRPO		TRPO
imgs		imgs
off_policy_PPO		off_policy_PPO
off_policy_TRPO		off_policy_TRPO
utils		utils
.dobro_package		.dobro_package
.gitignore		.gitignore
README.md		README.md
visualize.py		visualize.py

Repository files navigation

Off-Policy TRPO

This is a simple implemenation of off-policy TRPO (link).

requirement

python 3.7 or greater
gym
mujoco-py (https://github.com/openai/mujoco-py)
stable-baselines3
torch==1.10.0 or greater
requests
wandb

results

HalfCheetah-v2

obtained by training with three seeds.
{algo_name}-Norm: training with state normalization.

About

Simple implementation of off policy TRPO.

Report repository

Releases

No releases published

Packages

No packages published

Languages