Skip to content
View pengshuyuan's full-sized avatar

Block or report pengshuyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]

Python 127 6 Updated Sep 20, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,988 790 Updated Oct 14, 2024

LLM training code for Databricks foundation models

Python 4,016 524 Updated Oct 17, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,197 3,373 Updated Oct 18, 2024

Slurm: A Highly Scalable Workload Manager

C 2,636 659 Updated Oct 18, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,186 1,170 Updated Oct 1, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,850 2,462 Updated Oct 18, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,493 147 Updated Sep 25, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,450 403 Updated Oct 18, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 69,823 8,218 Updated Sep 30, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,884 314 Updated Oct 18, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 868 46 Updated Jun 25, 2024

Masked Structural Growth for 2x Faster Language Model Pre-training

Python 21 2 Updated Apr 28, 2024

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 18 Updated Jun 5, 2024

Awesome-LLM: a curated list of Large Language Model

18,291 1,483 Updated Oct 14, 2024

A fast MoE impl for PyTorch

Python 1,547 187 Updated Jul 5, 2024

A curated reading list of research in Mixture-of-Experts(MoE).

526 40 Updated Sep 4, 2023

Pygments is a generic syntax highlighter written in Python

Python 1,810 665 Updated Oct 13, 2024

Language Savant. If your repository's language is being reported incorrectly, send us a pull request!

Ruby 12,245 4,232 Updated Sep 27, 2024

Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation

Python 75 9 Updated Sep 18, 2023

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 13,463 3,012 Updated Oct 18, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,702 3,218 Updated Oct 17, 2024

A relation-aware semantic parsing model from English to SQL

Python 406 117 Updated Aug 22, 2023

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,202 186 Updated Sep 23, 2024

Development repository for the Triton language and compiler

C++ 13,075 1,598 Updated Oct 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,606 4,244 Updated Oct 18, 2024

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 6,280 1,792 Updated Jul 26, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,527 453 Updated Oct 17, 2024

Official style files for papers submitted to venues of the Association for Computational Linguistics

TeX 714 177 Updated May 20, 2024
Next