Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Help for Singing Voice Synthesis #5923

Open
funmolde opened this issue Oct 8, 2024 · 7 comments
Open

Help for Singing Voice Synthesis #5923

funmolde opened this issue Oct 8, 2024 · 7 comments
Labels
Music Music processing Question Question

Comments

@funmolde
Copy link

funmolde commented Oct 8, 2024

Hello -_-
How to use SVS: Singing Voice Synthesis ?

Windows 11 64bits

Thank you for your help

@funmolde funmolde added the Question Question label Oct 8, 2024
@sw005320 sw005320 added the Music Music processing label Oct 8, 2024
@sw005320
Copy link
Contributor

sw005320 commented Oct 8, 2024

@ftshijt, can you answer it?

@ftshijt
Copy link
Collaborator

ftshijt commented Oct 10, 2024

For training/inference/evaluation, please check https://espnet.github.io/espnet/recipe/svs1.html
For pre-trained models, please check https://huggingface.co/models?other=singing-voice-synthesis&sort=trending&search=espnet

We are also actively working towards an interactive demo (which should be ready in 2 weeks, and will be presented in this year's ACMMM demo, together with https://arxiv.org/abs/2409.07226)

@funmolde
Copy link
Author

Thank you -_-
I tried to use it but I don't understand the doc🤯 . i have Windows no linux.
There are no pre-trained models: in English, French, Spanish.
I'm waiting for the interactive demo. 😀

@migperfer
Copy link

Hi!
I'm also interested in this!
Any news on the interactive demo? :)

@funmolde
Copy link
Author

Hi! I'm also interested in this! Any news on the interactive demo? :)
Hello , here's the demo
https://huggingface.co/spaces/espnet/svs

@migperfer
Copy link

Many thanks! :)

@ftshijt
Copy link
Collaborator

ftshijt commented Dec 27, 2024

Thanks for sharing! @funmolde

Yeah, please refer to the model there @migperfer However, the demo was launched in CPU, so it is largely blocked to decoding speed. If you can use GPU for decoding, it would be super fast in general.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Music Music processing Question Question
Projects
None yet
Development

No branches or pull requests

4 participants