zhonghai1995

Follow

hai zhonghai1995

Follow

9 followers · 146 following

Highlights

Pro

Stars

gxywy / rl-plotter

✨ A plotter for reinforcement learning (RL)

Python 206 30 Updated Dec 8, 2021

google-deepmind / alphastar

Python 398 51 Updated Sep 8, 2022

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,032 220 Updated Oct 4, 2024

histmeisah / Large-Language-Models-play-StarCraftII

TextStarCraft2,a pure language env which support llms play starcraft2

Python 192 12 Updated Aug 21, 2024

Michael-Beukman / RobocupGym

Reinforcement Learning inside a 3D soccer simulation

Python 19 Updated Sep 15, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,064 124 Updated Aug 3, 2023

cor3bit / bertsekas-marl

PyTorch Implementation of the Sequential Multiagent Rollout algorithm

Python 10 2 Updated Jun 28, 2024

corl-team / xland-minigrid

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

Python 192 15 Updated Aug 16, 2024

proroklab / VectorizedMultiAgentSimulator

VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set …

Python 318 68 Updated Sep 24, 2024

Farama-Foundation / Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,234 270 Updated Aug 18, 2024

Haichao-Zhang / PEX

Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)

Python 45 5 Updated Apr 4, 2023

PKU-MARL / DexterousHands

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Python 627 74 Updated Jun 20, 2024

shariqiqbal2810 / maddpg-pytorch

PyTorch Implementation of MADDPG (Lowe et. al. 2017)

Python 560 128 Updated Nov 26, 2019

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 296 41 Updated Aug 22, 2024

ikostrikov / rlpd

Python 206 24 Updated Feb 13, 2023

vitchyr / viskit

rllab's viskit with some added features

Python 73 35 Updated May 1, 2023

google-deepmind / distrax

Python 532 32 Updated Sep 18, 2024

my-yy / s2v_rc

Speech2Vec Reality Check

Python 75 3 Updated Feb 21, 2023

RLE-Foundation / rllte

Long-Term Evolution Project of Reinforcement Learning

Python 464 84 Updated Aug 26, 2024

shadps4-emu / shadPS4

PS4 emulator for Windows,Linux,MacOS

C++ 9,941 581 Updated Oct 5, 2024

google-deepmind / optax

Optax is a gradient processing and optimization library for JAX.

Python 1,650 181 Updated Oct 4, 2024

google / flax

Flax is a neural network library for JAX that is designed for flexibility.

Python 6,009 635 Updated Oct 4, 2024

minitorch / minitorch

The full minitorch student suite.

Python 1,886 366 Updated Aug 17, 2024

facebookresearch / Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,598 157 Updated Sep 22, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,153 1,599 Updated Aug 1, 2024

jayeshs999 / sapg

Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)

Jupyter Notebook 38 2 Updated Sep 17, 2024

Emerge-Lab / gpudrive

GPU-acceleration of Nocturne via Madrona

Jupyter Notebook 200 18 Updated Oct 4, 2024

mantle2048 / rlplot

rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").

Python 26 3 Updated Dec 8, 2023

google-research / rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 753 46 Updated Aug 12, 2024

denisyarats / drq

DrQ: Data regularized Q

Jupyter Notebook 404 52 Updated Jan 13, 2023