Stars
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
TextStarCraft2,a pure language env which support llms play starcraft2
Reinforcement Learning inside a 3D soccer simulation
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
PyTorch Implementation of the Sequential Multiagent Rollout algorithm
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set …
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
Long-Term Evolution Project of Reinforcement Learning
Optax is a gradient processing and optimization library for JAX.
Flax is a neural network library for JAX that is designed for flexibility.
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)
GPU-acceleration of Nocturne via Madrona
rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").