Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to get a new spk2info.dict? #14

Open
gu76h opened this issue Aug 4, 2023 · 4 comments
Open

How to get a new spk2info.dict? #14

gu76h opened this issue Aug 4, 2023 · 4 comments

Comments

@gu76h
Copy link

gu76h commented Aug 4, 2023

I want to train a new model with other dataset,but I don't find the way to get a new spk2info.dict.

@auspicious3000
Copy link
Owner

It's very simple. The lo and hi are fixed values for male and female voices respectively. The speaker embeddings can be extracted using the well-known Resemblyzer.

@gu76h
Copy link
Author

gu76h commented Aug 8, 2023

So,we just need the dict to use speaker embeddings ,lo and hi? The last value after these is useless?

@freds0
Copy link

freds0 commented Aug 11, 2023

I developed this script that uses parselmouth or pyreaper. It needs some adjustments, because I'm creating the embeddings for all the files, and in the original I believe that an average per speaker is created.

create_contentvec_dict.zip

@SandroChen
Copy link

SandroChen commented Aug 14, 2023

@auspicious3000 Thank you for sharing the tool Resemblyzer. Here is another question...if you don't mind: is there any convenient method to get frame-aligned pseudo label files on a new dataset ? I know that montreal forced aligner can do this, but the format of its output is quite different from the .km file...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants