pg

PG (Vanilla Policy Gradient)

PG is the most basic reinforcement learning algorithm that learns a policy by taking a gradient of action log probabilities and weighting them by the return. This algorithm is also known as REINFORCE.

Installation

conda create -n rllib-pg python=3.10
conda activate rllib-pg
pip install -r requirements.txt
pip install -e '.[development]'

Usage

PG Example

Name		Name	Last commit message	Last commit date
parent directory ..
examples		examples
src/rllib_pg/pg		src/rllib_pg/pg
tests		tests
tuned_examples		tuned_examples
BUILD		BUILD
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pg

pg

README.md

PG (Vanilla Policy Gradient)

Installation

Usage

Files

pg

Directory actions

More options

Directory actions

More options

Latest commit

History

pg

Folders and files

parent directory

README.md

PG (Vanilla Policy Gradient)

Installation

Usage