-
Notifications
You must be signed in to change notification settings - Fork 62
BN/Affine layer related issues #45
Comments
"I wonder, are these params related?" NONLOCAL.USE_BN and NONLOCAL.USE_AFFINE means slightly different things. Please see "According to my understanding, if MODEL.USE_AFFINE=False (which means using SpatialBN), then CHECKPOINT.CONVERT_MODEL should be set as False. "I see SpatialBN is 2d BN, can it be used in the model with 3d convolution?" "If I want to finetue this converted model, should I finetune it with MODEL.USE_AFFINE True or False?" "I ported a network from Pytorch to Caffe2 and converted the Pytorch version weight file to Caffe2 version weight, however, I cannot get the same result" |
Hi, @chaoyuaw , it's very nice of you to respond to my questions, thank you very much. I have another question though, MODEL:
USE_BN: True
USE_AFFINE: False
CHECKPOINT:
CONVERT_MODEL: False how should I set these 2 NONLOCAL:
USE_BN: False or True?
USE_AFFINE: True or False? Are Thanks! |
If your original model uses a BN layer in NL and you don't want to freeze it, you set I recommend taking a look at |
Thanks @chaoyuaw , I understand, thank you. |
Hi @chaoyuaw, sorry to bother you, I have some confusions about SpatialBN layer in this repo.
I see in config files, these params are set as
I wonder, are these params related?
According to my understanding, if
MODEL.USE_AFFINE=False
(which means using SpatialBN), thenCHECKPOINT.CONVERT_MODEL
should be set asFalse
.Is my understanding right?
I ported a network from Pytorch to Caffe2 and converted the Pytorch version weight file to Caffe2 version weight, however, I cannot get the same result as in Pytorch version from the converted weight file. (The pytorch model is trained with 3d BN)
I suppose this have something to do with BatchNorm operations.
I see SpatialBN is 2d BN, can it be used in the model with 3d convolution?
If I want to finetue this converted model, should I finetune it with
MODEL.USE_AFFINE
True
orFalse
?Thanks!
The text was updated successfully, but these errors were encountered: