voice-cloning

Star

Here are 129 public repositories matching this topic...

CorentinJ / Real-Time-Voice-Cloning

Star

Clone a voice in 5 seconds to generate arbitrary speech in real-time

python deep-learning tensorflow pytorch tts voice-cloning

Updated Aug 14, 2024
Python

RVC-Boss / GPT-SoVITS

Star

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

text-to-speech tts voice-cloning vits voice-clone voice-cloneai

Updated Dec 19, 2024
Python

coqui-ai / TTS

Star

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 16, 2024
Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Dec 24, 2024
Python

FunAudioLLM / CosyVoice

Star

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

python text-to-speech japanese chatbot multi-lingual tts english chinese korean cantonese natural-language-generation cross-lingual fine-grained fine-tuning voice-cloning audio-generation chatgpt gpt-4o cosyvoice

Updated Dec 18, 2024
Python

Huanshere / VideoLingo

Star

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

localization dubbing video-translation voice-cloning ai-translation

Updated Dec 20, 2024
Python

Camb-ai / MARS5-TTS

Star

MARS5 speech model (TTS) from CAMB.AI

text-to-speech speech speech-synthesis prosody voice-cloning voice-cloneai

Updated Aug 1, 2024
Jupyter Notebook

abus-aikorea / voice-pro

Star

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.

text-to-speech translator translation podcasts tts speech-synthesis subtitles speech-recognition webui speech-to-text transcription gradio stt whisper voice-conversion voice-cloning yt-dlp faster-whisper

Updated Dec 22, 2024
Python

IAHispano / Applio

Star

A simple, high-quality voice conversion tool focused on ease of use and performance.

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Dec 24, 2024
Python

voice-cloning-app / Voice-Cloning-App

Star

A Python/Pytorch app for easily synthesising human voices

python text-to-speech deep-learning pytorch tts voice-cloning tacotron2

Updated Dec 2, 2024
Python

coqui-ai / open-speech-corpora

Star

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jun 6, 2024

DrewThomasson / ebook2audiobook

Star

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages