gu-ma

gu-ma

30 followers · 46 following

https://blog.massol.me/

Achievements

Highlights

Organizations

Lists (8)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

ArdenButterfield / Maim

Audio plugin for custom MP3 distortion and digital glitches

C 255 3 Updated Aug 30, 2024

Tencent / DepthCrafter

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 731 34 Updated Oct 12, 2024

FedericoCeratto / dashing

Terminal dashboards for Python

Python 433 34 Updated May 14, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,405 248 Updated Oct 10, 2024

gorakhargosh / watchdog

Python library and shell utilities to monitor filesystem events.

Python 6,536 695 Updated Oct 10, 2024

PeculiarVentures / GammaCV

GammaCV is a WebGL accelerated Computer Vision library for browser

JavaScript 177 23 Updated Oct 10, 2024

BespokeSynth / BespokeSynth

Software modular synth

C++ 4,065 239 Updated Sep 21, 2024

Gourieff / sd-webui-reactor

Forked from s0md3v/sd-webui-roop

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)

Python 2,514 267 Updated Oct 7, 2024

weihaox / awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

1,461 135 Updated Oct 14, 2024

flowese / UdioWrapper

UdioWrapper is a Python package that enables the generation of music tracks using Udio's API through textual prompts. This package is based on the reverse engineering of the Udio API (https://www.…

Python 124 22 Updated Jul 22, 2024

gcui-art / suno-api

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

TypeScript 1,294 296 Updated Sep 13, 2024

EnVision-Research / Lotus

Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 323 15 Updated Oct 13, 2024

microsoft / autogen

A programming framework for agentic AI 🤖

C# 31,917 4,644 Updated Oct 14, 2024

Sharrnah / whispering-ui

Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)

Go 218 12 Updated Oct 11, 2024

zml / zml

High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild

Zig 1,520 54 Updated Oct 14, 2024

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,844 401 Updated Aug 10, 2024

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 17,828 1,338 Updated May 23, 2024

Vchitect / Vchitect-2.0

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 616 16 Updated Sep 18, 2024

replicate / replicate-javascript

Node.js client for Replicate

TypeScript 481 200 Updated Oct 9, 2024

replicate / create-replicate

Generate a simple Node.js project structure for running AI models with Replicate's API

JavaScript 30 15 Updated Sep 24, 2024

ishangupta3 / CinemaNet

PureBasic 6 10 Updated Jul 11, 2019

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 8,029 749 Updated Oct 14, 2024

Tabstle / compp

1 Updated Jun 4, 2024

lunarring / latentblending

Create butter-smooth transitions between prompts, powered by stable diffusion

Python 353 24 Updated Mar 29, 2024

Stable-X / StableNormal

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 370 16 Updated Oct 11, 2024

ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 897 67 Updated Aug 24, 2024

fedirz / faster-whisper-server

Python 600 87 Updated Oct 14, 2024

AnswerDotAI / rerankers

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 991 53 Updated Sep 11, 2024

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 3,795 653 Updated Jul 30, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,008 1,834 Updated Oct 9, 2024

gu-ma

Highlights

Organizations

Lists (8)

3D

Audio

ComfyUI

LLM

OS

SD

TD

Webui

Stars