Stars
ComfyUI nodes to edit videos using Genmo Mochi
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
collection of diffusion model papers categorized by their subareas
Lumina-T2X is a unified framework for Text to Any Modality Generation
Official inference repo for FLUX.1 models
[NeurIPS 2024] Boosting the performance of consistency models with PCM!
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
A paper list of some recent Mamba-based CV works.
Official Implementation for "Block and Detail: Scaffolding Sketch-to-Image Generation"
Official Code for MotionCtrl [SIGGRAPH 2024]
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
App showcasing multiple real-time diffusion models pipelines with Diffusers
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
A non-exhaustive list of details that make a good web interface.
A realtime CRDT-based document store, backed by S3.
A diffusion model to colorize black and white images
Generative Models by Stability AI
Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"
ImageBind One Embedding Space to Bind Them All
[IJCV 2022] Bridging Composite and Real: Towards End-to-end Deep Image Matting