Skip to content
View DRSY's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report DRSY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,473 531 Updated Feb 29, 2024

📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024

Jupyter Notebook 28 3 Updated Oct 15, 2024

Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 3,898 344 Updated Oct 19, 2024

Extensible, parallel implementations of t-SNE

Python 1,460 161 Updated Aug 13, 2024

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

183 12 Updated Sep 19, 2024

official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]

Python 43 1 Updated Oct 18, 2024
Python 26 3 Updated Oct 14, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 621 17 Updated Sep 18, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,674 154 Updated Oct 4, 2024
Python 60 4 Updated Jul 26, 2024

Kolors Team

Python 3,738 249 Updated Sep 4, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,767 452 Updated Sep 19, 2024
Python 2 Updated Jun 27, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,572 352 Updated Oct 17, 2024

A Neural-Symbolic Self-Training Framework

C 96 3 Updated Jul 23, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,514 485 Updated Aug 13, 2024
Python 28 Updated Aug 19, 2024
Python 15 2 Updated Jun 13, 2023

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 585 71 Updated Oct 15, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,958 558 Updated Oct 15, 2024

Gemma 2B with 10M context length using Infini-attention.

Python 941 58 Updated May 12, 2024

Benchmarking LLMs with Challenging Tasks from Real Users

Python 189 33 Updated Aug 1, 2024

Simple macOS menu bar application to view and interact with reminders. Developed with SwiftUI and using Apple Reminders as a source.

Swift 2,546 115 Updated Sep 20, 2024

A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.

Jupyter Notebook 186 10 Updated Sep 14, 2023

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,582 1,087 Updated May 23, 2024

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 140 12 Updated Jul 12, 2024

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 181 14 Updated Jun 18, 2024

Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python 237 20 Updated Oct 18, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,500 148 Updated Sep 25, 2024
Next