Skip to content
View Robinatp's full-sized avatar

Block or report Robinatp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A timeline of the latest AI models for audio generation, starting in 2023!

1,889 68 Updated Jan 4, 2024

Community list of startups working with AI in audio and music technology

1,553 136 Updated Aug 9, 2024
Python 1 Updated Dec 12, 2023

OpenUTAU renderer for diffsinger / 适用于diffsinger的OpenUTAU渲染器,使用方法:https://github.com/xunmengshe/OpenUtau/wiki/%E4%BD%BF%E7%94%A8%E6%96%B9%E6%B3%95%EF%BC%88%E4%B8%AD%E6%96%87%EF%BC%89

C# 23 1 Updated May 30, 2023
C# 268 27 Updated Sep 9, 2024

泠鸢yousa的Diffsinger模型v1版

42 4 Updated Oct 21, 2024

Open-source file format designed for high-quality, customizable singing synthesis.

Python 11 5 Updated Nov 6, 2024

a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine

Python 32 7 Updated Oct 15, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,101 5,040 Updated Oct 10, 2024

A latent text-to-image diffusion model

Jupyter Notebook 68,295 10,161 Updated Jun 18, 2024

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Python 135 11 Updated Oct 3, 2023

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Python 321 42 Updated Nov 4, 2024

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,712 285 Updated Nov 8, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,066 2,513 Updated Nov 10, 2024
Jupyter Notebook 140 15 Updated Jan 7, 2024

A minimum inference engine for DiffSinger

Python 34 8 Updated Apr 5, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,281 100 Updated Sep 24, 2023

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Python 231 12 Updated Feb 29, 2024

PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor

5 1 Updated Jan 25, 2023

A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023

Python 24 2 Updated Sep 9, 2023

变声技术综合评比

Python 1 2 Updated Sep 14, 2023

Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation

Python 111 10 Updated Nov 25, 2023

singing voice change based on whisper, and lora for singing voice clone

Python 623 78 Updated Nov 3, 2023

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Python 72 14 Updated Aug 3, 2021

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,440 222 Updated Oct 14, 2024

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

Python 149 19 Updated Feb 1, 2023

Official implementation of SawSing (ISMIR'22)

Python 254 37 Updated Aug 28, 2022
Python 185 16 Updated Dec 29, 2022

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 318 44 Updated Feb 21, 2022
Next