hw1

CS294-112 HW 1: Imitation Learning

Dependencies:

Python 3.5
Numpy version 1.14.5
TensorFlow version 1.10.5
MuJoCo version 1.50 and mujoco-py 1.50.1.56
OpenAI Gym version 0.10.5

Once Python 3.5 is installed, you can install the remaining dependencies using pip install -r requirements.txt.

Note: MuJoCo versions until 1.5 do not support NVMe disks therefore won't be compatible with recent Mac machines. There is a request for OpenAI to support it that can be followed here.

Note: Students enrolled in the course will receive an email with their MuJoCo activation key. Please do not share this key.

The only file that you need to look at is run_expert.py, which is code to load up an expert policy, run a specified number of roll-outs, and save out data.

In experts/, the provided expert policies are:

Ant-v2.pkl
HalfCheetah-v2.pkl
Hopper-v2.pkl
Humanoid-v2.pkl
Reacher-v2.pkl
Walker2d-v2.pkl

The name of the pickle file corresponds to the name of the gym environment.

Name		Name	Last commit message	Last commit date
parent directory ..
experts		experts
figures		figures
HW1.ipynb		HW1.ipynb
README.md		README.md
behaviorcloning.csv		behaviorcloning.csv
behaviorcloning.py		behaviorcloning.py
dagger.py		dagger.py
demo.bash		demo.bash
functions.py		functions.py
hw1.pdf		hw1.pdf
hyperparameters.py		hyperparameters.py
load_policy.py		load_policy.py
requirements.txt		requirements.txt
run_expert.py		run_expert.py
tf_util.py		tf_util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hw1

hw1

README.md

CS294-112 HW 1: Imitation Learning

Files

hw1

Directory actions

More options

Directory actions

More options

Latest commit

History

hw1

Folders and files

parent directory

README.md

CS294-112 HW 1: Imitation Learning