Skip to content
View zhonghai1995's full-sized avatar

Highlights

  • Pro

Block or report zhonghai1995

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨ A plotter for reinforcement learning (RL)

Python 206 30 Updated Dec 8, 2021

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,032 220 Updated Oct 4, 2024

TextStarCraft2,a pure language env which support llms play starcraft2

Python 192 12 Updated Aug 21, 2024

Reinforcement Learning inside a 3D soccer simulation

Python 19 Updated Sep 15, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,064 124 Updated Aug 3, 2023

PyTorch Implementation of the Sequential Multiagent Rollout algorithm

Python 10 2 Updated Jun 28, 2024

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

Python 192 15 Updated Aug 16, 2024

VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set …

Python 318 68 Updated Sep 24, 2024

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,234 270 Updated Aug 18, 2024

Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)

Python 45 5 Updated Apr 4, 2023

This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym

Python 627 74 Updated Jun 20, 2024

PyTorch Implementation of MADDPG (Lowe et. al. 2017)

Python 560 128 Updated Nov 26, 2019

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 296 41 Updated Aug 22, 2024
Python 206 24 Updated Feb 13, 2023

rllab's viskit with some added features

Python 73 35 Updated May 1, 2023
Python 532 32 Updated Sep 18, 2024

Speech2Vec Reality Check

Python 75 3 Updated Feb 21, 2023

Long-Term Evolution Project of Reinforcement Learning

Python 464 84 Updated Aug 26, 2024

PS4 emulator for Windows,Linux,MacOS

C++ 9,941 581 Updated Oct 5, 2024

Optax is a gradient processing and optimization library for JAX.

Python 1,650 181 Updated Oct 4, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Python 6,009 635 Updated Oct 4, 2024

The full minitorch student suite.

Python 1,886 366 Updated Aug 17, 2024

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,598 157 Updated Sep 22, 2024

LLM101n: Let's build a Storyteller

29,153 1,599 Updated Aug 1, 2024

Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)

Jupyter Notebook 38 2 Updated Sep 17, 2024

GPU-acceleration of Nocturne via Madrona

Jupyter Notebook 200 18 Updated Oct 4, 2024

rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").

Python 26 3 Updated Dec 8, 2023

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 753 46 Updated Aug 12, 2024

DrQ: Data regularized Q

Jupyter Notebook 404 52 Updated Jan 13, 2023
Next