Skip to content
View Pur1zumu's full-sized avatar

Highlights

  • Pro

Block or report Pur1zumu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,199 3,373 Updated Oct 18, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 4,242 375 Updated Oct 18, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,131 108 Updated May 10, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 680 43 Updated Oct 13, 2024

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Python 496 20 Updated May 30, 2024

Notebooks accompanying Anthropic's "Toy Models of Superposition" paper

Jupyter Notebook 92 10 Updated Sep 14, 2022

(ML) audio engineering i/o utils

Jupyter Notebook 53 7 Updated Jan 5, 2024

Lightweight Armoury Crate alternative for Asus laptops and ROG Ally. Control tool for ROG Zephyrus G14, G15, G16, M16, Flow X13, Flow X16, TUF, Strix, Scar and other models

C# 7,381 265 Updated Oct 15, 2024
Python 23 2 Updated Jun 26, 2024

Boost LaTeX typesetting efficiency with preview, compile, autocomplete, colorize, and more.

TypeScript 10,649 530 Updated Oct 17, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 865 97 Updated Sep 5, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,050 86 Updated Aug 6, 2024
Python 110 19 Updated Sep 25, 2024

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

142 6 Updated Oct 17, 2024

Code and data for "Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue" (ACL 2024)

Python 21 1 Updated Aug 7, 2024

A generative speech model for daily dialogue.

Python 31,581 3,437 Updated Oct 17, 2024

Benchmark popular audio i/o packages

Python 138 10 Updated Dec 19, 2023
Python 95 18 Updated Jul 25, 2024

Karras et al. (2022) diffusion models for PyTorch

Python 2,289 375 Updated Jul 16, 2024

Manage scalable open LLM inference endpoints in Slurm clusters

Python 227 22 Updated Jul 11, 2024
Python 438 43 Updated Oct 7, 2024

Repository for training models for music source separation.

Python 431 56 Updated Oct 10, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,533 533 Updated Jul 25, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,527 391 Updated Oct 14, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,275 99 Updated Sep 24, 2023

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,167 109 Updated Jul 11, 2024

End-to-End Speech Processing Toolkit

Python 8,402 2,174 Updated Oct 10, 2024

A modern Python package and dependency manager supporting the latest PEP standards

Python 7,881 392 Updated Oct 17, 2024

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting…

407 29 Updated Sep 28, 2022

An easy to understand TTS / SVS / SVC framework

Python 641 81 Updated Oct 7, 2024
Next