Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 3,898 344 Updated Oct 19, 2024

pavlin-policar / openTSNE

Extensible, parallel implementations of t-SNE

Python 1,460 161 Updated Aug 13, 2024

ziqihuangg / Awesome-Evaluation-of-Visual-Generation

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

183 12 Updated Sep 19, 2024

TIGER-AI-Lab / VideoScore

official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]

Python 43 1 Updated Oct 18, 2024

jdf-prog / LLM-Engines

Python 26 3 Updated Oct 14, 2024

Vchitect / Vchitect-2.0

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 621 17 Updated Sep 18, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,674 154 Updated Oct 4, 2024

CONE-MT / LLaMAX

Python 60 4 Updated Jul 26, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 3,738 249 Updated Sep 4, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,767 452 Updated Sep 19, 2024

xiujiesong / Cog-Bench

Python 2 Updated Jun 27, 2024

togethercomputer / MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,572 352 Updated Oct 17, 2024

xufangzhi / ENVISIONS

A Neural-Symbolic Self-Training Framework

C 96 3 Updated Jul 23, 2024

karpathy / build-nanogpt

Video+code lecture on building nanoGPT from scratch

Python 3,514 485 Updated Aug 13, 2024

mutonix / pyramidinfer

Python 28 Updated Aug 19, 2024

wjn1996 / Chain-of-Knowledge

Python 15 2 Updated Jun 13, 2023

lmarena / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 585 71 Updated Oct 15, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,958 558 Updated Oct 15, 2024

mustafaaljadery / gemma-2B-10M

Gemma 2B with 10M context length using Infini-attention.

Python 941 58 Updated May 12, 2024

allenai / WildBench

Benchmarking LLMs with Challenging Tasks from Real Users

Python 189 33 Updated Aug 1, 2024

DamascenoRafael / reminders-menubar

Simple macOS menu bar application to view and interact with reminders. Developed with SwiftUI and using Apple Reminders as a source.

Swift 2,546 115 Updated Sep 20, 2024

tianlinxu312 / Everything-about-LLMs

A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.

Jupyter Notebook 186 10 Updated Sep 14, 2023

Timothyxxx / KVCachePapers

19 Updated May 24, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,582 1,087 Updated May 23, 2024

opengear-project / GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 140 12 Updated Jul 12, 2024

66RING / tiny-flash-attention

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 181 14 Updated Jun 18, 2024

intel / auto-round

Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python 237 20 Updated Oct 18, 2024

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,500 148 Updated Sep 25, 2024