You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First of all, thank you very much for putting out your work. I used it to derive my own use-case however the loss never goes down after it reaches 0.02, could you please?
I have been working on an AutoVC project. I am kinda stuck because my AutoVC model is not learning beyond a point. No matter how hard I try, I cannot get the loss below 0.02 See below.
I have setup the colab notebook in such a way that you dont have to edit it. Simple edit following line with correct model path after the autovc training 'hp.m_avc.gen.best_model_path = "/content/AVC/static/model_chk_pts/autovc/final_1000.pth"'
Here are the initial and predicted mel-spects. The predicted one is definitely not the way it should be. Left one is the original audio used and the right one is the predictions of the postnet.
The text was updated successfully, but these errors were encountered:
Hi There,
First of all, thank you very much for putting out your work. I used it to derive my own use-case however the loss never goes down after it reaches 0.02, could you please?
I have been working on an AutoVC project. I am kinda stuck because my AutoVC model is not learning beyond a point. No matter how hard I try, I cannot get the loss below 0.02 See below.
Because of this the final mel-spect is not as expected and hence audio conversion from that mel is not as expected. Could you please help me?
Github - https://github.com/gkv856/end2end_auto_voice_conversion
Colab - https://github.com/gkv856/KaggleData/blob/main/AutoVC_training_.ipynb
I have setup the colab notebook in such a way that you dont have to edit it. Simple edit following line with correct model path after the autovc training 'hp.m_avc.gen.best_model_path = "/content/AVC/static/model_chk_pts/autovc/final_1000.pth"'
Here are the initial and predicted mel-spects. The predicted one is definitely not the way it should be. Left one is the original audio used and the right one is the predictions of the postnet.
The text was updated successfully, but these errors were encountered: