don't save Modules in hparams #915

KnathanM · 2024-06-11T15:09:21Z

Description

Related to #898, this PR address the fact that we double save nn.Modules in both the hparams and in the state_dict. This PR follows David's idea here.

The main downside to this PR is a lot more code is required to implement this versus continuing to double save. The main upside is that model files will be a little smaller.

Other notes

metrics in MPNN is a list of Modules so the whole list is pickled and unpickled to save and load. This is different than all the other Modules like criterion which are rebuilt each time a model is loaded.

The default task_weights for criterion is an array, so I don't need to check if there are task_weights in the state_dict. For the other cases, the default is None/Identity so in those cases, I need to check if their are corresponding parameters in the state_dict and if so build a Module with zeros.

While working on this PR, I found that for multicomponent message passing, if a single block is shared between two components, the weights for that block appear twice in the state_dict. I don't know if the weights just have two key values, or if they are truly saved twice in the model file. Not sure if this is something we should try to change.

KnathanM · 2024-06-11T15:10:37Z

chemprop/models/model.py

+                .squeeze(0)
+                .numpy()


_ScaleTransformMixin unsqueezes and converts to a torch tensor. If I don't include .numpy() here, I get this warning: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor). scale = torch.cat([torch.ones(pad), torch.tensor(scale, dtype=torch.float)])

davidegraff · 2024-06-11T17:06:43Z

woof- on second thought, i don't think my idea was a good one. I don't think it's that much of a problem to double-save like 32 floating point numbers

KnathanM · 2024-06-13T21:47:35Z

Okay, sounds like we will go with #898 for saving the scalers.

Questions:
I added some unit tests here. Are those worth still merging in? They test if the scalers are saved and loaded correctly, which is independent of the mechanics how we save and load them.

I reorganized some of load_from_checkpoint and load_from_file by having the hyperparameters and state dict loaded in load_submodules. This allowed removing a separate load_from_file in MulticomponentMPNN. Is this worth merging in?

KnathanM · 2024-07-05T19:48:51Z

I moved the tests to #955

I decided to not move the changes to load_from_file to another PR because I think having MPNN and MulticomponentMPNN each have their own load_from_file may help people debug in the future.

don't save Modules in hparams

e0e18c9

KnathanM commented Jun 11, 2024

View reviewed changes

KnathanM mentioned this pull request Jun 11, 2024

Explicitly double save scalers/criterion #898

Merged

KnathanM mentioned this pull request Jul 5, 2024

add transform tests #955

Merged

KnathanM closed this Jul 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

don't save Modules in hparams #915

don't save Modules in hparams #915

KnathanM commented Jun 11, 2024

KnathanM Jun 11, 2024

davidegraff commented Jun 11, 2024

KnathanM commented Jun 13, 2024

KnathanM commented Jul 5, 2024

don't save Modules in hparams #915

don't save Modules in hparams #915

Conversation

KnathanM commented Jun 11, 2024

Description

Other notes

KnathanM Jun 11, 2024

Choose a reason for hiding this comment

davidegraff commented Jun 11, 2024

KnathanM commented Jun 13, 2024

KnathanM commented Jul 5, 2024