vision-language-models

Here are 11 public repositories matching this topic...

baaivision / EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

clip vlm instruction-following large-language-models llm mllm multimodal-large-language-models vision-language-models encoder-free-vlm

Updated Oct 2, 2024
Python

snap-research / MyVLM

Star

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

personalization vision-language-models

Updated Jul 5, 2024
Python

baaivision / DenseFusion

Star

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

vlm image-descriptions visual-perception mllm multimodal-large-language-models vision-language-models

Updated Sep 27, 2024
Python

BAAI-Agents / GPA-LM

Star

This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".

games ai gcc planning gameplay awesome-list agents gameai vlm multimodal agent-framework large-language-models llm generative-ai vision-language-models general-computer-control

Updated Sep 3, 2024

elkhouryk / RS-TransCLIP

Star

Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"

remote-sensing satellite-imagery scene-classification transductive-learning zero-shot-classification vision-language-models

Updated Sep 12, 2024
Python

yu-rp / apiprompting

Star

[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models

visual-prompting prompting vision-language-model large-vision-language-model large-vision-language-models large-multimodal-models vision-language-models

Updated Sep 26, 2024
Python

NishilBalar / Awesome-LVLM-Hallucination

Star

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

mlm hallucination large-language-models llm mllm large-vision-language-models multimodal-large-language-models hallucination-evaluation hallucination-detection vision-language-models lvlm hallucination-mitigation hallucination-survey hallucination-research hallucination-benchmark multimodal-language-model

Updated Sep 26, 2024

erfanshayegani / Jailbreak-In-Pieces

Star

[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

alignment ai-safety vlm llm vision-language-models cross-modality-safety-alignment multi-modal-models

Updated Jun 6, 2024
Python

vanillaer / CPL-ICML2024

Star

[ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data"

unlabeled-data pseudolabels vision-language-models

Updated Jun 21, 2024
Python

chu0802 / SnD

Star

This is an official implementation of our work, Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models, accepted to ECCV'24

eccv continual-learning vision-language-models eccv2024

Updated Jul 1, 2024

Ibtissam-SAADI / CLIVP-FER

Star

Facial Expression Recognition using vision language models (VLMs)

facial-expression-recognition vision-language-models driver-s-emotion contrastive-language--image-pretraining

Updated May 9, 2024
Python

Improve this page

Add a description, image, and links to the vision-language-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-models

Here are 11 public repositories matching this topic...

baaivision / EVE

snap-research / MyVLM

baaivision / DenseFusion

BAAI-Agents / GPA-LM

elkhouryk / RS-TransCLIP

yu-rp / apiprompting

NishilBalar / Awesome-LVLM-Hallucination

erfanshayegani / Jailbreak-In-Pieces

vanillaer / CPL-ICML2024

chu0802 / SnD

Ibtissam-SAADI / CLIVP-FER

Improve this page

Add this topic to your repo