ddppo

DDPPO (Decentralized Distributed Proximal Policy Optimization)

DDPPO is a method for distributed reinforcement learning in resource-intensive simulated environments based on PPO. DD-PPO is distributed (uses multiple machines), decentralized (lacks a centralized server), and synchronous (no computation is ever stale), making it conceptually simple and easy to implement.

Installation

conda create -n rllib-ddppo python=3.10
conda activate rllib-ddppo
pip install -r requirements.txt
pip install -e '.[development]'

Usage

DDPPO Example

Name		Name	Last commit message	Last commit date
parent directory ..
examples		examples
src/rllib_ddppo/ddppo		src/rllib_ddppo/ddppo
tests		tests
tuned_examples		tuned_examples
BUILD		BUILD
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ddppo

ddppo

README.md

DDPPO (Decentralized Distributed Proximal Policy Optimization)

Installation

Usage

Files

ddppo

Directory actions

More options

Directory actions

More options

Latest commit

History

ddppo

Folders and files

parent directory

README.md

DDPPO (Decentralized Distributed Proximal Policy Optimization)

Installation

Usage