Stars
Image to prompt with BLIP and CLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
An open source implementation of CLIP.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Open-source and strong foundation image recognition models.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
Mastering Diverse Domains through World Models
[ICCV 2023 Oral] A New Paradigm for End-to-end Autonomous Driving to Alleviate Causal Confusion
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
Large Language Model Text Generation Inference
A high-throughput and memory-efficient inference and serving engine for LLMs
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Accessible large language models via k-bit quantization for PyTorch.
QLoRA: Efficient Finetuning of Quantized LLMs
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
The official repository of "Video assistant towards large language model makes everything easy"
The Time Series Visualization Tool that you deserve.