Skip to content
View XxxZzD's full-sized avatar

Block or report XxxZzD

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A RLHF Infrastructure for Vision-Language Models

Python 98 5 Updated Jun 12, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,973 175 Updated Oct 4, 2024

GROOViST: A Metric for Grounding Objects in Visual Storytelling – EMNLP 2023

Python 2 1 Updated Oct 8, 2024

Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models

Python 191 29 Updated Jul 9, 2023

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 79 5 Updated Oct 16, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 65,468 8,049 Updated Nov 5, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,952 363 Updated Oct 29, 2024

自己总结的这十多年做Qt开发以来的经验,以及Qt相关武林秘籍电子书,会一直持续更新增加,欢迎各位留言增加内容或者提出建议,谢谢!公众号:Qt实战/Qt入门和进阶/Qt教程

3,964 906 Updated Sep 28, 2024

ESPER

Python 22 2 Updated Mar 29, 2024

[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

Python 204 11 Updated Oct 19, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 663 33 Updated Aug 13, 2024

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Python 789 53 Updated Apr 28, 2023

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 56,028 5,752 Updated Aug 24, 2024
Python 10 Updated Sep 5, 2023

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Jupyter Notebook 1,432 378 Updated Feb 3, 2023
Python 8 Updated Jan 22, 2024

Tag manager and captioner for image datasets

Python 737 35 Updated Nov 1, 2024

A recurrent neural network for generating little stories about images

Python 2,961 539 Updated Oct 18, 2017

该项目旨在通过输入文本描述来检索与之相匹配的图片。

Python 26 3 Updated Aug 24, 2023

Codes for Paper: Attractive Storyteller: Stylized Visual Storytelling with Unpaired Text

Python 3 Updated Dec 18, 2023

Stable Diffusion web UI

Python 142,482 26,870 Updated Nov 6, 2024

Nightly release of ControlNet 1.1

Python 4,721 374 Updated Aug 8, 2024

Let us control diffusion models!

Python 30,292 2,723 Updated Feb 25, 2024

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

Jupyter Notebook 449 112 Updated Feb 9, 2024

Image to prompt with BLIP and CLIP

Python 2,690 432 Updated May 15, 2024

vist story telling evaluation tool

Python 21 8 Updated Dec 5, 2023

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,461 1,701 Updated Apr 25, 2024

Zero2Story

Python 146 18 Updated Nov 30, 2023
Next