Saving and loading escnn models #1

ishaanb92 · 2022-05-03T13:20:55Z

Hi,

Firstly, thanks for making such an accessible library to implement equivariant models alongside such informative documentation!

I wanted to train a toy model with MNIST before moving on to a bigger architecture and chose the model provided in the model.ipynb notebook in the 'examples' folder. After plugging it into my training script, I saved it using the regular PyTorch save procedure:
torch.save(model.state_dict(), 'mnist_model_e2cnn_{}.pt'.format(n_orientations))

In my test script, when I try to load this model using:
model.load_state_dict(torch.load('mnist_model_e2cnn_{}.pt'.format(n_orientations), map_location='cpu'))

However, trying to load the model throws up the following error:

RuntimeError: Error(s) in loading state_dict for MNISTE2CNN:
Missing key(s) in state_dict: "block1.1.filter", "block2.0.filter", "block3.0.filter", "block4.0.filter", "block5.0.filter", "block6.0.filter".

I'm not sure what I'm doing incorrectly, is there a special procedure involved in saving models that use escnn.nn.SequentialModule to stack ops?

EDIT: The torch version I am using is 1.7.0

Cheers,
Ishaan

The text was updated successfully, but these errors were encountered:

Gabri95 · 2022-05-03T14:37:52Z

Hi @kilgore92

Thanks for opening the first issue! :)

You can probably solve this issue by calling model.eval() before storing and loading the model's state-dict.
The reason behind this behaviour is explained in the first warning block here.
Does it solve your problem?

Best,
Gabriele

ishaanb92 · 2022-05-03T15:07:03Z

Hi @Gabri95,

Thanks! That did the trick :)

Cheers,
Ishaan

Peter010103 · 2024-02-21T16:00:19Z

Hi I wanted to save my escnn model using the torch.save(model, PATH) and not the model.state_dict().

I think there are some issues of directly saving it this way due to the library using its bespoke GeometricTensor datatype. Is there a way to directly save the entire model rather than just the state dict?

kalekundert · 2024-02-21T16:33:00Z

No, there's no way to save the whole model right now. The issue is that torch.save() basically just pickles whatever you give it, and ESCNN models are not pickleable. The specific reason doesn't have anything to do with geometric tensors; it has to do with the group, representation, and gspace objects that ESCNN models contain. See #37 and #78 for attempts to fix this. It's not an easy problem.

Gabri95 closed this as completed May 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving and loading escnn models #1

Saving and loading escnn models #1

ishaanb92 commented May 3, 2022 •

edited

Loading

Gabri95 commented May 3, 2022

ishaanb92 commented May 3, 2022

Peter010103 commented Feb 21, 2024

kalekundert commented Feb 21, 2024

Saving and loading escnn models #1

Saving and loading escnn models #1

Comments

ishaanb92 commented May 3, 2022 • edited Loading

Gabri95 commented May 3, 2022

ishaanb92 commented May 3, 2022

Peter010103 commented Feb 21, 2024

kalekundert commented Feb 21, 2024

ishaanb92 commented May 3, 2022 •

edited

Loading