-
Notifications
You must be signed in to change notification settings - Fork 208
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
differences in mel-spectogram #97
Comments
Your frequency axis and time axis are swapped. |
thanks, you are right, the axes were swapped, these are the new plots: but still, there are some differences. the size of the spectrograms is not the same. I used the code make_spect.py so i thought i should get the same results as the demo. thanks |
They should only differ by the amount of silence before and after. Please confirm if this is true. |
@amiteliav @auspicious3000 Hi, I am trying to get a good conversion quality using this repo, but I cannot. Could you please let me know what hyper-parameters you use for "dim_neck", "freq", "batch_size", and "num_itrs"? I am using the small data which is prepared in this repo. Could you pleas let me know if you use any other dataset that gives you a good conversion? |
Hi
i working with your git - it is really good! thanks
im trying to generate my own mel-spectrogram with your code in "make_spect.py"
here is the Demo mel-spectrogram: p225_003
here is my mel-spectogram : (p225_003)
the sizes are not the same:
Demo: (376, 80)
My: (475, 80)
and you can see the spectrogram don't look the same, the demo is all over the range of the spectrogram
whereas my isn't. mine looks the same but more compressed.
when using the demo spectrogram - the conversion works.
when using my spectrogram - it doesn't
any idea why the spectrograms are different? and how the correct it?
thanks
Amit
The text was updated successfully, but these errors were encountered: