Skip to content
View gu-ma's full-sized avatar

Highlights

  • Pro

Organizations

@iartag @digitalideation

Block or report gu-ma

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Audio plugin for custom MP3 distortion and digital glitches

C 255 3 Updated Aug 30, 2024

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 731 34 Updated Oct 12, 2024

Terminal dashboards for Python

Python 433 34 Updated May 14, 2024

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,405 248 Updated Oct 10, 2024

Python library and shell utilities to monitor filesystem events.

Python 6,536 695 Updated Oct 10, 2024

GammaCV is a WebGL accelerated Computer Vision library for browser

JavaScript 177 23 Updated Oct 10, 2024

Software modular synth

C++ 4,065 239 Updated Sep 21, 2024

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)

Python 2,514 267 Updated Oct 7, 2024

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

1,461 135 Updated Oct 14, 2024

UdioWrapper is a Python package that enables the generation of music tracks using Udio's API through textual prompts. This package is based on the reverse engineering of the Udio API (https://www.…

Python 124 22 Updated Jul 22, 2024

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

TypeScript 1,294 296 Updated Sep 13, 2024

Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Python 323 15 Updated Oct 13, 2024

A programming framework for agentic AI 🤖

C# 31,917 4,644 Updated Oct 14, 2024

Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)

Go 218 12 Updated Oct 11, 2024

High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild

Zig 1,520 54 Updated Oct 14, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,844 401 Updated Aug 10, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 17,828 1,338 Updated May 23, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 616 16 Updated Sep 18, 2024

Node.js client for Replicate

TypeScript 481 200 Updated Oct 9, 2024

Generate a simple Node.js project structure for running AI models with Replicate's API

JavaScript 30 15 Updated Sep 24, 2024
PureBasic 6 10 Updated Jul 11, 2019

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 8,029 749 Updated Oct 14, 2024
1 Updated Jun 4, 2024

Create butter-smooth transitions between prompts, powered by stable diffusion

Python 353 24 Updated Mar 29, 2024

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 370 16 Updated Oct 11, 2024

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 897 67 Updated Aug 24, 2024

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 991 53 Updated Sep 11, 2024

Foundational model for human-like, expressive TTS

Python 3,795 653 Updated Jul 30, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,008 1,834 Updated Oct 9, 2024
Next