[ACM MMGR '24] 🔍 Shotluck Holmes: A family of small-scale LLVMs for shot-level video understanding
python nlp video-summarization video-captioning multi-modality visual-language-learning llm vision-language-model shotluck-holmes
-
Updated
Oct 26, 2024 - Python