This code is designed to read chat messages from YouTube and then utilize OpenAI's GPT-3 language model to generate responses. The output from GPT-3 is then read out loud using a TTS (Text-to-Speech) engine provided by ElevenLabs.
Install dependencies
git clone https://github.com/Koischizo/AI-Vtuber/
cd AI-Vtuber
pip install -r requirements.txt
It also requires ffmpeg
to be installed
Edit the variables EL_key
and OAI_key
in config.json
EL_key
is the API key for ElevenLabs. Found in Profile Settings
OAI_key
is the API key for OpenAI. Found here
Then run run.py
python run.py -id STREAMID
python run.py -id STREAMID -tts EL
then you're set
Replace STREAMID
with the stream's ID that you can find on the Youtube Stream link
You can change the voice by changing voice
in config.json
. You can find the ID's here in Get Voices
I used This VTS plugin and VB Audio cable to make her mouth move and be able to play music at the same time
Please note that this project was created solely for fun and as part of a YouTube video, so the quality and reliability of the code may be questionable. Also, after the completion of the project checklist, there won't be much activity in updating or improving this repository. Nonetheless, we hope that this project can serve as a source of inspiration for anyone interested in building their own AI Vtuber.
- Clean up
- GUI
- Executables (exe, bat or sh)
- Extra features (maybe) (Prompt injection protection, questions only mode, virtual audio)
This program is under the MIT license