XxxZzD

Follow

XxxZzD

Follow

0 followers · 3 following

Stars

TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models

Python 98 5 Updated Jun 12, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,973 175 Updated Oct 4, 2024

akskuchi / groovist

GROOViST: A Metric for Grounding Objects in Visual Storytelling – EMNLP 2023

Python 2 1 Updated Oct 8, 2024

xichenpan / ARLDM

Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models

Python 191 29 Updated Jul 9, 2023

NiuTrans / Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 79 5 Updated Oct 16, 2024

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 65,468 8,049 Updated Nov 5, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,952 363 Updated Oct 29, 2024

feiyangqingyun / qtkaifajingyan

自己总结的这十多年做Qt开发以来的经验，以及Qt相关武林秘籍电子书，会一直持续更新增加，欢迎各位留言增加内容或者提出建议，谢谢！公众号：Qt实战/Qt入门和进阶/Qt教程

3,964 906 Updated Sep 28, 2024

JiwanChung / esper

ESPER

Python 22 2 Updated Mar 29, 2024

haoningwu3639 / StoryGen

[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

Python 204 11 Updated Oct 19, 2024

beichenzbc / Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 663 33 Updated Aug 13, 2024

showlab / Image2Paragraph

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Python 789 53 Updated Apr 28, 2023

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 56,028 5,752 Updated Aug 24, 2024

HAWLYQ / InfoMetIC

Python 10 Updated Sep 5, 2023

jiayev / GPT4V-Image-Captioner

Python 787 58 Updated Oct 7, 2024

peteanderson80 / bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Jupyter Notebook 1,432 378 Updated Feb 3, 2023

seungheondoh / llm-tag-to-caption

Python 8 Updated Jan 22, 2024

jhc13 / taggui

Tag manager and captioner for image datasets

Python 737 35 Updated Nov 1, 2024

ryankiros / neural-storyteller

A recurrent neural network for generating little stories about images

Python 2,961 539 Updated Oct 18, 2017

sjy0727 / CLIP-Text-Image-Retrieval

该项目旨在通过输入文本描述来检索与之相匹配的图片。

Python 26 3 Updated Aug 24, 2023

DingyiYang / StyleVSG

Codes for Paper: Attractive Storyteller: Stylized Visual Storytelling with Unpaired Text

Python 3 Updated Dec 18, 2023

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 142,482 26,870 Updated Nov 6, 2024

lllyasviel / ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Python 4,721 374 Updated Aug 8, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 30,292 2,723 Updated Feb 25, 2024

towhee-io / examples

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

Jupyter Notebook 449 112 Updated Feb 9, 2024

pharmapsychotic / clip-interrogator

Image to prompt with BLIP and CLIP

Python 2,690 432 Updated May 15, 2024

lichengunc / vist_eval

vist story telling evaluation tool

Python 21 8 Updated Dec 5, 2023

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,461 1,701 Updated Apr 25, 2024

coding-pot / Zero2Story

Zero2Story

Python 146 18 Updated Nov 30, 2023

aim-uofa / AutoStory

145 4 Updated Aug 23, 2024