Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
data-filtering data-quality-assessment large-language-models llava multimodal-large-language-models image-text-data
-
Updated
Dec 30, 2024 - Python