FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
-
Updated
Jan 25, 2024 - Python
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Easily train a good VC model with voice data <= 10 mins!
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages voice cloning and tts
Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Audio tour guide website with K-POP celebrity's voice using Text-To-Speech model
Client project for Fontys University of Applied Sciences. Semester 4 Creative Technology
Serverless Voice Cloning Application on AWS
This repository provides a Google Colab notebook for voice cloning using the Coqui XTTS-V2 model. It allows users to clone voices from audio samples and generate speech in multiple languages.
Add a description, image, and links to the voicecloning topic page so that developers can more easily learn about it.
To associate your repository with the voicecloning topic, visit your repo's landing page and select "manage topics."