-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ready]show dominant parameters #705
Conversation
|
||
for p in param_group: | ||
for p, named_p in zip(param_group, group_params_names): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What if group_param_names
is None
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It could not be None
. The most import part of this pr is tracking name of each parameter.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The default value was None.
Example log with command
|
fixing issue #697
The main idea is passing names of each
batch
parameters through various functions.An option
"--show-dominant-parameters"
is added,True
for debugging andFalse
for normal-case.param_rms
is updated withp
in functionScaledAdam::_show_gradient_dominating_parameter
Does this design meet the requirement? @danpovey