dobro12 / CPO Public

Notifications You must be signed in to change notification settings
Fork 5
Star 23

Constrained Policy Optimization implementation on Safety Gym

23 stars 5 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
car_env		car_env
tf1		tf1
torch		torch
.gitignore		.gitignore
README.md		README.md

Repository files navigation

CPO

A simple Tensorflow1 & PyTorch implementation of constrained policy optimization (CPO) on Safety Gym.

requirement

gym
mujoco
safety_gym
tensorflow 1.13.1
pytorch 1.10.1

how to use

tf1

python train.py #training
python train.py test #test

torch

python main.py #training
python main.py --test --resume {#_of_checkpoint} #test

reference

CPO paper: https://arxiv.org/abs/1705.10528
Original code: https://github.com/openai/safety-starter-agents

About

Constrained Policy Optimization implementation on Safety Gym

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%