impala

Reproducing Importance Weighted Actor-Learner Architecture (IMPALA) Algorithm Results

This repository contains scripts that enable training agents using the IMPALA Algorithm on MuJoCo and Atari environments. We follow the original paper Proximal Policy Optimization Algorithms by Espeholt et al. 2018.

Examples Structure

Please note that we provide 2 examples, one for single node training and one for distributed training. Both examples rely on the same utils file, but besides that are independent. Each example contains the following files:

Main Script: The definition of algorithm components and the training loop can be found in the main script (e.g. impala_single_node_ray.py).
Utils File: A utility file is provided to contain various helper functions, generally to create the environment and the models (e.g. utils.py).
Configuration File: This file includes default hyperparameters specified in the original paper. For the multi-node case, the file also includes the configuration file of the Ray cluster. Users can modify these hyperparameters to customize their experiments (e.g. config_single_node.yaml).

Running the Examples

You can execute the single node IMPALA algorithm on Atari environments by running the following command:

python impala_single_node.py

You can execute the multi-node IMPALA algorithm on Atari environments by running the following command:

python impala_single_node_ray.py

or

python impala_single_node_submitit.py

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
config_multi_node_ray.yaml		config_multi_node_ray.yaml
config_multi_node_submitit.yaml		config_multi_node_submitit.yaml
config_single_node.yaml		config_single_node.yaml
impala_multi_node_ray.py		impala_multi_node_ray.py
impala_multi_node_submitit.py		impala_multi_node_submitit.py
impala_single_node.py		impala_single_node.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

impala

impala

README.md

Reproducing Importance Weighted Actor-Learner Architecture (IMPALA) Algorithm Results

Examples Structure

Running the Examples

Files

impala

Directory actions

More options

Directory actions

More options

Latest commit

History

impala

Folders and files

parent directory

README.md

Reproducing Importance Weighted Actor-Learner Architecture (IMPALA) Algorithm Results

Examples Structure

Running the Examples