-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于loss的走势? #4
Comments
大模型训练经常出现阶梯型loss变化 |
那这是过拟合吗,大佬 ,还是正常现象? @BaolanChen |
现在的现象是大模型训练loss经常成阶梯型,这是正常的训练结果。具体是什么原因还都在进行推测研究。具体看需要训练几个epoch还是自己找验证数据测试一下指标比较好。 |
不过大模型过拟合有时候不一定是不好的事情,可能是格式过拟合了也许是好事,也可能是 answer 过拟合了这种就不符合期望,大部分都是由于数据本身的问题以及训练epoch的问题 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
您好,最近我也在训练一个多模态的小模型。请问下,可以公开下你的训练的loss图吗?我想看看loss是怎么降低的
The text was updated successfully, but these errors were encountered: