66RING

😈

Chaos !ncoming

66RING

😈

Chaos !ncoming

Ultimate Super Badass. XDU | MSRA NRG | USTC. MLsys, Arch, Storage, blade learner and the CHAOTIC.

96 followers · 382 following

Achievements

Highlights

Organizations

Stars

llm

8 repositories

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,579 363 Updated Jul 11, 2024

microsoft / SuperScaler

An experimental parallel training platform

47 11 Updated Mar 25, 2024

microsoft / LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,467 249 Updated Aug 22, 2024

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 524 51 Updated Aug 22, 2024

imagination-research / sot

[ICLR 2024] Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Python 139 15 Updated Mar 1, 2024

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,378 1,771 Updated Sep 27, 2024

LumingSun / ML4DB-paper-list

Papers for database systems powered by artificial intelligence (machine learning for database)

628 84 Updated Sep 23, 2024

triton-inference-server / tutorials

This repository contains tutorials and examples for Triton Inference Server

Python 534 91 Updated Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

66RING

Achievements

Achievements

Highlights

Organizations

Block or report 66RING

llm

mit-han-lab / streaming-llm

microsoft / SuperScaler

microsoft / LLMLingua

feifeibear / LLMSpeculativeSampling

imagination-research / sot

eosphoros-ai / DB-GPT

LumingSun / ML4DB-paper-list

triton-inference-server / tutorials