zhentingqi

Zhenting Qi zhentingqi

Carpe Diem.

25 followers · 0 following

Achievements

Highlights

Stars

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,643 266 Updated Oct 16, 2024

zhentingqi / scylla

Code repo for paper "Quantifying Generalization Complexity for Large Language Models"

Python 2 Updated Oct 14, 2024

hlt-mt / mosel

Collection of Open Source Speech Data

130 7 Updated Oct 1, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 32,593 3,993 Updated Oct 17, 2024

ezelikman / quiet-star

Code for Quiet-STaR

Python 597 81 Updated Aug 21, 2024

zhentingqi / rag-privacy

Python 4 1 Updated May 7, 2024

zhentingqi / rStar

Python 411 42 Updated Oct 14, 2024

sustcsonglin / flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,278 66 Updated Oct 15, 2024

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,127 68 Updated Oct 17, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 5,644 443 Updated Oct 18, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,515 1,083 Updated May 23, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 6,720 1,786 Updated Oct 17, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,751 5,812 Updated Aug 19, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,797 1,269 Updated Oct 15, 2024

asaparov / prontoqa

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 116 12 Updated Oct 17, 2024

nkandpa2 / long_tail_knowledge

Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"

Python 71 7 Updated Apr 12, 2023

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 357 28 Updated Jun 29, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,765 455 Updated May 3, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,032 192 Updated Apr 24, 2024

declare-lab / instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python 524 41 Updated Mar 10, 2024

JonasGeiping / carving

Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives

Python 60 5 Updated Feb 22, 2024

iamgroot42 / mimir

Python package for measuring memorization in LLMs.

Jupyter Notebook 115 18 Updated Sep 16, 2024

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,314 2,312 Updated Oct 16, 2024

openai / weak-to-strong

Python 2,496 304 Updated May 19, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,779 2,172 Updated Aug 12, 2024

lve-org / lve

A repository of Language Model Vulnerabilities and Exposures (LVEs).

Python 106 12 Updated Mar 12, 2024

andyzoujm / representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 704 82 Updated Aug 14, 2024

McGill-NLP / length-generalization

Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023

Python 124 6 Updated Apr 30, 2024

nelson-liu / lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Python 309 26 Updated Jan 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhenting Qi zhentingqi

Achievements

Achievements

Highlights

Block or report zhentingqi

Stars

xjdr-alt / entropix

zhentingqi / scylla

hlt-mt / mosel

hiyouga / LLaMA-Factory

ezelikman / quiet-star

zhentingqi / rag-privacy

zhentingqi / rStar

sustcsonglin / flash-linear-attention

HuangOwen / Awesome-LLM-Compression

sgl-project / sglang

naklecha / llama3-from-scratch

EleutherAI / lm-evaluation-harness

karpathy / nanoGPT

Dao-AILab / flash-attention

asaparov / prontoqa

nkandpa2 / long_tail_knowledge

princeton-nlp / LESS

jzhang38 / TinyLlama

deepseek-ai / DeepSeek-VL

declare-lab / instruct-eval

JonasGeiping / carving

iamgroot42 / mimir

NVIDIA / Megatron-LM

openai / weak-to-strong

haotian-liu / LLaVA

lve-org / lve

andyzoujm / representation-engineering

McGill-NLP / length-generalization

nelson-liu / lost-in-the-middle