-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Decoding error in DAC when using HuggingFace models #5944
Comments
Additionally, I just found another bug (although it's very minor, so I'm commenting here instead of opening a new issue).
This error seems to be caused by calling the wrong script name in the versa package. Thanks! |
Thanks for mentioning it! Yeah, I'm recently changing some of the interface for versa and plan to update the usage here after I have a converged one at versa. Will have a PR soon to fix. |
There were some unused hyper-parameters set in the previous version of code where the model is pre-trained on. Thanks for pointing out this issue, will working on fix it. |
Thank you for your answer! (And apologies for the delayed reply; I've tried the one-pass operation.)
Anyway, thank you for your work; it's been extremely helpful! |
Describe the bug
I am encountering a decoding error while using the DAC model in conjunction with HuggingFace models.
This issue seems to arise from discrepancies between
dac.py
configuration (i.e., https://github.com/espnet/espnet/blob/master/espnet2/gan_codec/dac/dac.py#L645) andconfig.yaml
uploaded alongside the model (e.g., https://huggingface.co/espnet/libritts_dac_16k/blob/main/exp/codec_train_dac_16k_raw_fs16000/config.yaml#L184).Therefore, this error occurred when using other HF models such as
espnet/amuse_dac_16k
(https://huggingface.co/espnet/amuse_dac_16k/blob/main/exp_16k/codec_train_dac_fs16000_raw_fs16000/config.yaml#L184).Basic environments:
3.10.15 (main, Oct 3 2024, 07:27:34) [GCC 11.2.0]
espnet 202409
pytorch 2.3.0
4c55d6c9071fb36addcc8426f2befd8f9a1bd11e
Fri Nov 1 23:20:28 2024 +0200
Environments from
torch.utils.collect_env
:Task information:
To Reproduce
Steps to reproduce the behavior:
cd egs2/libritts/codec1
run.sh
withstage==6
anddownload_model=="espnet/libritts_dac_16k"
e.g.,:
Error logs
The text was updated successfully, but these errors were encountered: