Stars
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Simple, powerful, cross-platform SQLite client and ORM for .NET
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Barrage Fly——让弹幕飞,一个弹幕转发、过滤、处理平台(支持B站、斗鱼、虎牙、抖音、快手,支持弹幕发送)
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN
Contains some simple and commonly used WPF controls
rtmp streaming from opencv with ffmpeg / avcodec using C++ or Python
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
Using modified BiSeNet for face parsing in PyTorch
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
MuseTalk ComfyUI Preprocess and Postprocess Nodes
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.
Real time interactive streaming digital human