[Feature] Add OptimizerHook #716

aakhundov · 2022-11-26T19:57:13Z

Description

The new type of hook, OptimizerHook, is added as suggested in #704.

Motivation and Context

See the suggested change in #704.

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

New feature (non-breaking change which adds core functionality)
Documentation (initial update in the documentation)

Backward compatibility with the existing optimizer parameter of Trainer.__init__ is respected.
Minor breaking change: previously grad_norm key in the losses_td is now grad_norm_0.

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

The change made in the documentation is very initial: optimizer hook is just briefly mentioned in the list of possible hooks and the OptimizerHook class is added to the "Trainer and hooks" table. Probably, wider documentation coverage of the new hook type is needed.

cc @BY571

aakhundov · 2022-11-26T20:16:31Z

@vmoens As currently implemented, the new hook's signature is different from other hooks: besides accepting (and returning) a TensorDict instance, it also accepts the clip norm parameters and the index i in the trainer._optimizer_ops list (for distinguishing each hook's grad_norm_{i} key added to the returned TensorDict).

If we want the optimizer hook to be more similar to other hooks, we could:

Add clip_norm and clip_norm_grad constructor parameters to the OptimizerHook.__init__. Then, when creating the "default" optimizer hook from its optimizer argument, Trainer.__init__ will just pass its own clip_norm and clip_norm_grad arguments as is to the OptimizerHook.__init__. Bonus: users will be able to set different gradient clipping configuration for each hook separately.
Return grad_norm value from the OptimizerHook.__call__ instead of the updated TensorDict parameter. Then in Trainer._optimizer_hook we could set the grad_norm in the losses_td with the respective index, without the need to pass the index to the hook call.
Move detaching the losses_td outside of Trainer._optimizer_hook: to be done by the calling Trainer.optim_steps right after the Trainer._optimizer_hook returns.

Importantly, even after the above modifications, Trainer._optimizer_hook will be different from other Trainer._*_hook functions, as it will need to set grad_norm in its TensorDict argument besides the usual for-loop over the hook instances.

Please let me know how you'd like to proceed. Thanks!

codecov · 2022-11-26T20:54:25Z

Codecov Report

Merging #716 (860e922) into main (3105819) will increase coverage by 0.12%.
The diff coverage is 97.87%.

@@            Coverage Diff             @@
##             main     #716      +/-   ##
==========================================
+ Coverage   88.77%   88.89%   +0.12%     
==========================================
  Files         122      122              
  Lines       21151    21273     +122     
==========================================
+ Hits        18776    18910     +134     
+ Misses       2375     2363      -12

Flag	Coverage Δ
habitat-gpu	`24.35% <18.75%> (-0.02%)`	⬇️
linux-cpu	`85.00% <97.87%> (+0.12%)`	⬆️
linux-gpu	`85.88% <97.87%> (+0.13%)`	⬆️
linux-jumanji	`29.28% <18.75%> (-0.03%)`	⬇️
linux-outdeps-gpu	`72.28% <97.87%> (+0.20%)`	⬆️
linux-stable-cpu	`84.86% <97.87%> (+0.13%)`	⬆️
linux-stable-gpu	`85.55% <97.87%> (+0.12%)`	⬆️
linux_examples-gpu	`42.68% <18.75%> (-0.06%)`	⬇️
macos-cpu	`84.68% <97.87%> (+0.13%)`	⬆️
olddeps-gpu	`74.94% <97.87%> (+0.19%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
torchrl/trainers/trainers.py	`77.66% <93.75%> (+2.71%)`	⬆️
test/test_trainer.py	`98.23% <100.00%> (+0.30%)`	⬆️
torchrl/envs/vec_env.py	`69.06% <0.00%> (+0.50%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

vmoens

Great stuff!
Thanks a lot for this!

vmoens · 2022-11-26T21:03:28Z

Regarding this

Importantly, even after the above modifications, Trainer.optimizer_hook will be different from other Trainer.*_hook functions, as it will need to set grad_norm in its TensorDict argument besides the usual for-loop over the hook instances.

It's ok, it's backward compatible and it suits most needs.
Thanks again for this!

aakhundov added 5 commits November 25, 2022 19:12

[Feature] Add OptimizerHook (pytorch#704)

dd2c5b6

Add minor fixes to OptimizerHook

0ff516f

Fix return type in OptimizerHook

54c5c3d

Add unit tests for optimizer and OptimizerHook.

9ca988a

Mention OptimizerHook in the docs.

860e922

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 26, 2022

vmoens added the enhancement New feature or request label Nov 26, 2022

vmoens approved these changes Nov 26, 2022

View reviewed changes

vmoens merged commit 2788f06 into pytorch:main Nov 26, 2022

vmoens mentioned this pull request Jan 10, 2023

[Feature Request] Create an optimizer hook for the trainer #704

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add OptimizerHook #716

[Feature] Add OptimizerHook #716

aakhundov commented Nov 26, 2022 •

edited by vmoens

Loading

aakhundov commented Nov 26, 2022

codecov bot commented Nov 26, 2022 •

edited

Loading

vmoens left a comment

vmoens commented Nov 26, 2022

[Feature] Add OptimizerHook #716

[Feature] Add OptimizerHook #716

Conversation

aakhundov commented Nov 26, 2022 • edited by vmoens Loading

Description

Motivation and Context

Types of changes

Checklist

aakhundov commented Nov 26, 2022

codecov bot commented Nov 26, 2022 • edited Loading

Codecov Report

vmoens left a comment

Choose a reason for hiding this comment

vmoens commented Nov 26, 2022

aakhundov commented Nov 26, 2022 •

edited by vmoens

Loading

codecov bot commented Nov 26, 2022 •

edited

Loading