Model Saving & Reloading #84

mariusmerkle · 2021-02-12T08:33:23Z

Hi,

I am studying how transfer learning can enhance the training of physics-informed neural networks. The NeuroDiffEq sparked my interest and I was wondering whether it is possible to

save a trained model, i.e. the parameters of the network and its architecture
reload the saved model and continue training from that non-random state.

shuheng-liu · 2021-02-12T09:01:17Z

Hi Marius. Yes, it's easy to do that as long as you know basic usage of PyTorch. For saving and loading model, here's a useful link. In the context of neurodiffeq, you want to perform the following procedures:

Make sure you have the model class MyNetwork saved in some file model.py. Note that MyNetwork must be a subclass of torch.nn.Module
Create one or more (depending on the number of functions you are solving for) MyNetwork instances:

my_nets = [MyNetwork(...), MyNetwork(...), ...]

Instantiate your solver and pass your model(s)

solver = Solver1D(
    ...
    nets=my_nets,
)

Do the training and get your networks. Currently, neurodiffeq doesn't make a copy of the networks passed to it, so solver.nets is the same object as my_nets created earlier

solver.fit(max_epochs=xxx, ...)
my_nets = solver.nets # you can skip this step if you still have access to `my_nets` created earlier

Save your model

torch.save({f'net_{i}': net.state_dict() for i, net in enumerate(nets)}, YOUR_MODEL_PATH)

In another script, instantiate your model using exactly the same architecture and load the weights

loaded_nets = [MyNetwork(...), MyNetwork(...), ...]
checkpoint = torch.load(YOUR_MODEL_PATH)
for i, net in enumerate(loaded_nets):
    net.load_state_dict(checkpoint[f'net_{i}'])

Redo step 3~4, but change my_nets to loaded_nets

mariusmerkle · 2021-02-12T10:34:42Z

That sounds great! And it is possible to use both Adam optimiser and L-BFGS/L-BFGS-B, right?

shuheng-liu · 2021-02-12T10:50:05Z

Most optimizers are currently supported, except LBFGS, which is a little tricky (see #83). Luckily, we seem to have a solution proposed just now. Yet, we still need to run the tests.

I'm not familiar with L-BFGS-B, but it appears that this optimizer has not been implemented in PyTorch (see here). So currently, you can't use L-BFGS-B without implementing it yourself.

mariusmerkle closed this as completed Feb 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Saving & Reloading #84

Model Saving & Reloading #84

mariusmerkle commented Feb 12, 2021

shuheng-liu commented Feb 12, 2021 •

edited

Loading

mariusmerkle commented Feb 12, 2021

shuheng-liu commented Feb 12, 2021

Model Saving & Reloading #84

Model Saving & Reloading #84

Comments

mariusmerkle commented Feb 12, 2021

shuheng-liu commented Feb 12, 2021 • edited Loading

mariusmerkle commented Feb 12, 2021

shuheng-liu commented Feb 12, 2021

shuheng-liu commented Feb 12, 2021 •

edited

Loading