Error while Loading vit_small weights #26

shubhaminnani · 2023-05-31T14:04:28Z

Hi @Xiyue-Wang ,
Thank you for the amazing repo.
I am trying to load the weights for MoCoV3 vit_small pretrained weights.

model = moco.builder_infence.MoCo_ViT(partial(vits.__dict__['vit_small'], stop_grad_conv1=True))
pretext_model = torch.load('TransPath/vit_small.pth.tar')['state_dict']
model = nn.DataParallel(model).cuda()
model.load_state_dict(pretext_model,strict=False)

But facing an error as below

I checked the keys for model and weights and seems to be same, but still above error.

Looking forward.
Thanks!

The text was updated successfully, but these errors were encountered:

Xiyue-Wang · 2023-06-01T02:45:12Z

you can try
model = moco.builder_infence.MoCo_ViT(
partial(vits.dict[args.arch], stop_grad_conv1=True))

pretext_model = torch.load(r'./vit_small.pth.tar')['state_dict']
model = nn.DataParallel(model).cuda()
model.load_state_dict(pretext_model, strict=True)?
Many people use is no problem, I do not know why you have an error

shubhaminnani · 2023-06-01T14:25:25Z

thank you, but dict can't be defined like above as you mentioned.

Even though I tried, it gave below Error AttributeError: module 'vits' has no attribute 'dict'

Thanks

Xiyue-Wang · 2023-06-01T14:36:17Z

may be remove model = nn.DataParallel(model).cuda()?

shubhaminnani · 2023-06-01T15:26:25Z

Tried by removing that code, complete keys are a mismatch here. Not able to load any keys for that.

shubhaminnani · 2023-06-01T15:36:06Z

Do I need to use model.module.online_encoder.net.head = nn.Identity() similar kind of code from TransPath to extract the features?

shubhaminnani · 2023-06-01T20:55:13Z

Hi @Xiyue-Wang ,
I think I found the problem. The model you trained was with moco.builder and while doing inference you are trying to call the module from moco.builder_infence where the in forward function below code is commented out. Training weights have keys from the former module and they dont match in the moco.builder_infence.

As understanding the code, it seems feature extraction should be done from base encoder only rather the complete architecture, so for that need to find a solution. Better load the complete weights and truncate the model after that. Whats your view?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while Loading vit_small weights #26

Error while Loading vit_small weights #26

shubhaminnani commented May 31, 2023 •

edited

Loading

Xiyue-Wang commented Jun 1, 2023 •

edited

Loading

shubhaminnani commented Jun 1, 2023

Xiyue-Wang commented Jun 1, 2023

shubhaminnani commented Jun 1, 2023

shubhaminnani commented Jun 1, 2023

shubhaminnani commented Jun 1, 2023

Error while Loading vit_small weights #26

Error while Loading vit_small weights #26

Comments

shubhaminnani commented May 31, 2023 • edited Loading

Xiyue-Wang commented Jun 1, 2023 • edited Loading

shubhaminnani commented Jun 1, 2023

Xiyue-Wang commented Jun 1, 2023

shubhaminnani commented Jun 1, 2023

shubhaminnani commented Jun 1, 2023

shubhaminnani commented Jun 1, 2023

shubhaminnani commented May 31, 2023 •

edited

Loading

Xiyue-Wang commented Jun 1, 2023 •

edited

Loading