Skip to content
View zhentingqi's full-sized avatar

Highlights

  • Pro

Block or report zhentingqi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,643 266 Updated Oct 16, 2024

Code repo for paper "Quantifying Generalization Complexity for Large Language Models"

Python 2 Updated Oct 14, 2024

Collection of Open Source Speech Data

130 7 Updated Oct 1, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 32,593 3,993 Updated Oct 17, 2024

Code for Quiet-STaR

Python 597 81 Updated Aug 21, 2024
Python 4 1 Updated May 7, 2024
Python 411 42 Updated Oct 14, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,278 66 Updated Oct 15, 2024

Awesome LLM compression research papers and tools.

1,127 68 Updated Oct 17, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,644 443 Updated Oct 18, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,515 1,083 Updated May 23, 2024

A framework for few-shot evaluation of language models.

Python 6,720 1,786 Updated Oct 17, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,751 5,812 Updated Aug 19, 2024

Fast and memory-efficient exact attention

Python 13,797 1,269 Updated Oct 15, 2024

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 116 12 Updated Oct 17, 2024

Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"

Python 71 7 Updated Apr 12, 2023

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 357 28 Updated Jun 29, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,765 455 Updated May 3, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,032 192 Updated Apr 24, 2024

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python 524 41 Updated Mar 10, 2024

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 60 5 Updated Feb 22, 2024

Python package for measuring memorization in LLMs.

Jupyter Notebook 115 18 Updated Sep 16, 2024

Ongoing research training transformer models at scale

Python 10,314 2,312 Updated Oct 16, 2024
Python 2,496 304 Updated May 19, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,779 2,172 Updated Aug 12, 2024

A repository of Language Model Vulnerabilities and Exposures (LVEs).

Python 106 12 Updated Mar 12, 2024

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 704 82 Updated Aug 14, 2024

Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023

Python 124 6 Updated Apr 30, 2024

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Python 309 26 Updated Jan 4, 2024