Skip to content

gitmylo/audio-webui

Repository files navigation

Readme 💀x7

Please read

This code works on python 3.10, i have not tested it on other versions. Some older versions will have issues. Common issues

Installing

Installation is done automatically in a venv when run.bat or run.sh is ran without the --skip-install flag.

Alternatively, run install.bat or install.sh to just install, and nothing else. To install with install.bat in a custom environment which is currently active. Do install.bat --skip-venv or install.sh --skip-venv.

Running

Running should be as simple as running run.bat or run.sh depending on your OS. If you want to run with custom command line flags, copy run_user_example.(bat/sh) and put whatever flags you want on every run in there. recommended flags are already in the example. (skip install and cpu offload) Everything should get installed automatically.

This has not been tested beyond 2 of my pcs. If there's an issue with running, please create an issue

Command line flags

Name Args Short Usage Description
--skip-install [None] -si -si Skip installing packages
--skip-venv [None] -sv -sv Skip creating/activating venv, also skips install. (for advanced users)
--bark-low-vram [None] [None] --bark-low-vram Use low vram for bark
--bark-cpu-offload [None] [None] --bark-cpu-offload Use cpu offloading to save vram while still running on gpu
--bark-use-cpu [None] [None] --bark-use-cpu Use cpu for bark
--bark-cloning-large-model [None] [None] --bark-cloning-large-model Use the larger voice cloning model. (It hasn't been tested as much yet)
--tts-use-gpu [None] [None] --tts-use-gpu Use your GPU for TTS with the TTS library
--share [None] -s -s Share the gradio instance publicly
--username username (str) -u, --user -u username Set the username for gradio
--password password (str) -p, --pass -p password Set the password for gradio
--theme theme (str) [None] --theme "gradio/soft" Set the theme for gradio
--listen [None] -l -l Listen a server, allowing other devices within your local network to access the server. (or outside if port forwarded)
--port port (int) [None] --port 12345 Set a custom port to listen on, by default a port is picked automatically
--hide-pip-log [None] [None] --hide-pip-log Hide pip install logs, only show warnings and errors.

Current goals and features

  • Text-to-audio
    • Text-to-speech
      • Bark
        • Speech generation
        • Voice cloning
          • Basic voice cloning
          • Accurate voice cloning
        • Disable stopping token option to let the AI decide how it wants to continue
    • AudioLDM text-to-audio generation
  • Audio-to-audio
    • Bark audio-to-audio
    • RVC (retrieval based voice conversion)
      • RVC training
  • Automatic-speech-recognition
    • Whisper speech recognition

More readme

  • Bark info
  • RVC info Link