Loading Python Exported Model into TorchSharp #585

jimquittenton · 2022-04-27T11:45:51Z

jimquittenton
Apr 27, 2022

Hi,
I'm new to TorchSharp and am having trouble loading a python trained ResNet18 model. I've been following this article: https://github.com/dotnet/TorchSharp/blob/main/docfx/articles/saveload.md and have exported my python model using the 'save_state_dict' function in this script: https://github.com/dotnet/TorchSharp/blob/main/src/Python/exportsd.py .

In TorchSharp I have copied the ResNet model from https://github.com/dotnet/TorchSharpExamples/blob/main/src/CSharp/Models/ResNet.cs and then call the following:

int numClasses = 3;
ResNet myModel = ResNet.ResNet18(numClasses);
myModel.to(DeviceType.CPU);
myModel.load(mPath);

The load() line throws an exception with message Mismatched module state names: the target modules does not have a submodule or buffer named 'conv1.weight'.

If I examine the state_dict from 'myModel' prior to load(), it contains entries like:

{[layers.conv2d-first.weight, {TorchSharp.Modules.Parameter}]}
{[layers.bnrm2d-first.weight, {TorchSharp.Modules.Parameter}]}
{[layers.bnrm2d-first.bias, {TorchSharp.Modules.Parameter}]}
{[layers.bnrm2d-first.running_mean, {TorchSharp.torch.Tensor}]}
{[layers.bnrm2d-first.running_var, {TorchSharp.torch.Tensor}]}
{[layers.bnrm2d-first.num_batches_tracked, {TorchSharp.torch.Tensor}]}
{[layers.blck-64-0.layers.blck-64-0-conv2d-1.weight, {TorchSharp.Modules.Parameter}]}
{[layers.blck-64-0.layers.blck-64-0-bnrm2d-1.weight, {TorchSharp.Modules.Parameter}]}
{[layers.blck-64-0.layers.blck-64-0-bnrm2d-1.bias, {TorchSharp.Modules.Parameter}]}

whereas the corresponding entries prior to saving from python are:

conv1.weight     torch.Size([64, 3, 7, 7])
bn1.weight       torch.Size([64])
bn1.bias         torch.Size([64])
bn1.running_mean         torch.Size([64])
bn1.running_var          torch.Size([64])
bn1.num_batches_tracked          torch.Size([])
layer1.0.conv1.weight    torch.Size([64, 64, 3, 3])
layer1.0.bn1.weight      torch.Size([64])
layer1.0.bn1.bias        torch.Size([64])

I tried amending the ResNet.cs code to reflect the python names, but could not get them to exactly match.

I also tried calling load() with strict=false myModel.load(mPath, false);. This seemed to get past the Mismatched names exception, but throws another exception with message Too many bytes in what should have been a 7 bit encoded Int32.

I've been struggling with this for a couple of days now so would really appreciate any help you guys could offer.

Thanks
Jim

NiklasGustafsson · 2022-04-29T14:17:32Z

NiklasGustafsson
Apr 29, 2022
Maintainer

Hi @jimquittenton,

Passing 'strict=false' is not a solution here, even if it didn't throw an exception (that seems like a bug to me), it would just end up with a partially loaded model, or nothing loaded at all.

A particular network architecture can be constructed many ways, and the names given to the various layers will depend on the details of how it is constructed. You will have exactly match the Python construction logic in the TorchSharp code, or the weights won't have the same names, as you have discovered.

The examples code is tricky, because it is used to construct a number of different ResNet architectures and uses loops and such to construct it. Another trickiness of the example is that it is built to serve as an example and uses the "toy" CIFAR data set. The input image size and the number of classes may not match what your ResNet18 model was trained on (if I'm stating the obvious, please excuse me).

The exception could be because the C# tensors may have different dimensions that the serialized ones, I would have to take a look at that. We may need to change the serialization format to catch issues like that, if that is indeed the case.

0 replies

NiklasGustafsson · 2022-05-16T20:24:06Z

NiklasGustafsson
May 16, 2022
Maintainer

BTW, I released 0.96.6 on Saturday morning. It has the correct ResNetNN (and a few others) architectures.

0 replies

jimquittenton · 2022-05-19T11:52:53Z

jimquittenton
May 19, 2022
Author

Thank you Niklas, I'll try this out.

…

On Mon, May 16, 2022 at 9:24 PM Niklas Gustafsson ***@***.***> wrote: BTW, I released 0.96.6 on Saturday morning. It has the correct ResNetNN (and a few others) architectures. — Reply to this email directly, view it on GitHub <#585 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHECHSGZ6QLHAWGMM6QV763VKKVHLANCNFSM5UO3RGXQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading Python Exported Model into TorchSharp #585

{{title}}

Replies: 3 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Loading Python Exported Model into TorchSharp #585

jimquittenton Apr 27, 2022

Replies: 3 comments

NiklasGustafsson Apr 29, 2022 Maintainer

NiklasGustafsson May 16, 2022 Maintainer

jimquittenton May 19, 2022 Author

jimquittenton
Apr 27, 2022

NiklasGustafsson
Apr 29, 2022
Maintainer

NiklasGustafsson
May 16, 2022
Maintainer

jimquittenton
May 19, 2022
Author