-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于 loss = nan #2
Comments
你好,我检查了一下,模型应该没有问题。 |
好的,非常非常感谢! |
2333,不用谢,欢迎交流 |
你好 我用你给的sample data 运行 train_ECM.py 时遇到了loss = nan 的问题,这是什么原因呢?
以下是几个step的loss: 从第60步开始就nan了。
Start training ...
step 20, loss = 10.188582,perp: 17026.870
(0.380 sec/step)
step 40, loss = 5.171560,perp: 79.408
(0.380 sec/step)
step 60, loss = nan,perp: nan
(0.396 sec/step)
但是我运行 train.py时就可以收敛
Start training ...
step 20, loss = 3.336257,perp: 26.973
(0.323 sec/step)
step 40, loss = 2.431179,perp: 10.790
(0.398 sec/step)
step 60, loss = 1.425767,perp: 3.964
(0.397 sec/step)
step 80, loss = 0.656351,perp: 1.895
(0.386 sec/step)
是不是 ECM_model 存在一些bug?
The text was updated successfully, but these errors were encountered: