udacity_rl_project_three

udacity reinforcement learning project 3: Collaboration and Competition.

The Environment

For this project, student have to train an 2 agents to playing tennis.

A reward of +0.1 receives by an agent if ball hit the ground or hits the ball out of bounds, it receives a reward of -0.01. Thus, the goal of each agent is to keep the ball in play.

The state space consists of 8 variables corresponding to the position and velocity of the ball and racket.

Each action is a vector with 2 numbers, corresponding to movement toward (or away from) the net, and jumping. Every entry in the action vector must be a number between -1 and 1.

The task is episodic, and in order to solve the environment, agent must get an average score of +0.5 over 100 consecutive episodes.

Software requirements

The following python3 libraries are required:

numpy == 1.16.2

pytorch == 0.4.0 - (GPU enabled)

unity ML-agent - available at github

Code implementation

This notebook contains full pipeline of training networks:

Initialization a unity environment;
Initialization Replay buffer and Agents determined in ddpg_collab_agent.py
Initialization Actor and Critic neural networks determined in model.py
Training and saving neural networks models at models folder.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DDPG		DDPG
imgs		imgs
README.md		README.md
report.md		report.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

udacity_rl_project_three

The Environment

Software requirements

Code implementation

About

Releases

Packages

Languages

alex-f1tor/udacity_rl_project_three

Folders and files

Latest commit

History

Repository files navigation

udacity_rl_project_three

The Environment

Software requirements

Code implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages