pengshuyuan

Follow

Peng Shuyuan pengshuyuan

Follow

NLP

4 followers · 32 following

Beijing

Achievements

Achievements

Stars

hkust-nlp / llm-compression-intelligence

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 127 6 Updated Sep 20, 2024

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,988 790 Updated Oct 14, 2024

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,016 524 Updated Oct 17, 2024

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,196 3,374 Updated Oct 13, 2024

SchedMD / slurm

Slurm: A Highly Scalable Workload Manager

C 2,636 659 Updated Oct 17, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,186 1,170 Updated Oct 1, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,848 2,462 Updated Oct 17, 2024

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,491 147 Updated Sep 25, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,450 403 Updated Oct 18, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 69,814 8,216 Updated Sep 30, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,884 313 Updated Oct 18, 2024

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 868 46 Updated Jun 25, 2024

cofe-ai / MSG

Masked Structural Growth for 2x Faster Language Model Pre-training

Python 21 2 Updated Apr 28, 2024

juvi21 / CoPE-cuda

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 18 Updated Jun 5, 2024

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

18,289 1,483 Updated Oct 14, 2024

laekov / fastmoe

A fast MoE impl for PyTorch

Python 1,547 187 Updated Jul 5, 2024

codecaution / Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

526 40 Updated Sep 4, 2023

pygments / pygments

Pygments is a generic syntax highlighter written in Python

Python 1,810 665 Updated Oct 13, 2024

github-linguist / linguist

Language Savant. If your repository's language is being reported incorrectly, send us a pull request!

Ruby 12,245 4,232 Updated Sep 27, 2024

kyegomez / phi-1

Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation

Python 75 9 Updated Sep 18, 2023

dmlc / dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 13,462 3,012 Updated Oct 16, 2024

kimiyoung / transformer-xl

Python 3,612 762 Updated Sep 21, 2022

microsoft / semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,700 3,217 Updated Oct 17, 2024

microsoft / rat-sql

A relation-aware semantic parsing model from English to SQL

Python 406 117 Updated Aug 22, 2023

AlibabaResearch / DAMO-ConvAI

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,202 186 Updated Sep 23, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 13,072 1,598 Updated Oct 18, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,598 4,244 Updated Oct 18, 2024

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,279 1,792 Updated Jul 26, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,527 453 Updated Oct 17, 2024

acl-org / acl-style-files

Official style files for papers submitted to venues of the Association for Computational Linguistics

TeX 714 177 Updated May 20, 2024