Skip to content
View 0uMuMu0's full-sized avatar

Block or report 0uMuMu0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Image to prompt with BLIP and CLIP

Python 2,667 433 Updated May 15, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,697 624 Updated Aug 5, 2024

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,163 103 Updated Dec 20, 2023

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 283 21 Updated Jul 17, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,159 58 Updated Mar 14, 2024
Python 2,546 193 Updated Oct 4, 2024

An open source implementation of CLIP.

Python 9,937 959 Updated Aug 19, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,256 147 Updated Aug 23, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,659 440 Updated Sep 19, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,778 271 Updated Aug 1, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,886 1,379 Updated Sep 5, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,025 1,572 Updated Oct 7, 2024

"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository

Python 222 23 Updated Apr 23, 2024

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 408 19 Updated Sep 5, 2024

Implementation of Dreamer v3 in pytorch.

Python 390 87 Updated Sep 27, 2024

Mastering Diverse Domains through World Models

Python 1,306 224 Updated Jul 29, 2024

[ICCV 2023 Oral] A New Paradigm for End-to-end Autonomous Driving to Alleviate Causal Confusion

Python 195 14 Updated Jan 11, 2024

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 632 42 Updated Aug 13, 2024

[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer

Python 531 46 Updated Jan 20, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 690 53 Updated Jul 24, 2024

Large Language Model Text Generation Inference

Python 8,869 1,046 Updated Oct 7, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,932 4,125 Updated Oct 7, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,344 938 Updated Oct 1, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,138 616 Updated Oct 2, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,962 820 Updated Jun 10, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,390 184 Updated Jul 16, 2024

The official repository of "Video assistant towards large language model makes everything easy"

Python 204 14 Updated Feb 22, 2024
Python 206 24 Updated Feb 13, 2023

The Time Series Visualization Tool that you deserve.

C++ 4,373 611 Updated Aug 10, 2024
Next