Skip to content
View zerlinwang's full-sized avatar
😃
Say hello
😃
Say hello

Block or report zerlinwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zerlinwang/README.md

Hi there! I am a Computer Science Ph.D. student at the University of Oxford in the FLAIR and WhiRL labs. I am fortunated to be co-supervised by Professor Jakob Foerster and Professor Shimon Whiteson. I previously received a Master's degree from Tsinghua University, supervised by Prof. Zhiyong Wu. Before that, I did my undergraduate at Dalian University of Technology. 😊

My research focuses on developing general decision-making agents capable of interacting with complex and dynamic environments. Currently, I am exploring Deep Reinforcement Learning (Deep RL) as a core solution methodology. ✨

In my sparse time, I enjoy running, hiking, badminton, and fitness, as well as anime and video games. Additionally, I am a ploy-glot, fluent in Mandarin, English, and Japanese. I love learning new languages because I believe a new language means a fresh perspective on this wonderful world! 💫

Popular repositories Loading

  1. minRLHF minRLHF Public

    Forked from thomfoster/minRLHF

    A (somewhat) minimal library for finetuning language models with PPO on human feedback.

    Python 2

  2. DeepRL DeepRL Public

    Forked from NeuronDance/DeepRL

    Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone

    1

  3. mahjong mahjong Public

    mahjong learning note

    1

  4. synthetic-corpus-vocoder synthetic-corpus-vocoder Public

    Official repository for the paper "A SYNTHETIC CORPUS GENERATION METHOD FOR NEURAL VOCODER TRAINING"

    Python 1

  5. RL4LMs RL4LMs Public

    Forked from allenai/RL4LMs

    A modular RL library to fine-tune language models to human preferences

    Python 1

  6. trl trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python 1