batch normalization #89

ziqipang · 2019-08-01T16:15:28Z

Thanks for the excellent implementation! But I have some questions on the batch normalization.

In the file model.py, line 1627 -- 1633, I find that batch normalization layers are always put to evaluation mode in the training process. Could anyone please explain the reason to me?

BMG-JTIAN · 2019-08-06T15:53:08Z

Thanks for the excellent implementation! But I have some questions on the batch normalization.

In the file model.py, line 1627 -- 1633, I find that batch normalization layers are always put to evaluation mode in the training process. Could anyone please explain the reason to me?

Please correct me if I'm wrong. BatchNorm layers are not trained during the training session. Based on the test results, if you train BatchNorm layers with a small batch size, it can be harmful. The suggested batch size for batch normalization is 32 (you can check the paper "Bag of tricks in image classification"). Due to the size of images in COCO dataset, the common batch size in mask rcnn is 1 or 2. So, batch normalization layers will not be trained and only use pre-trained weights.

ziqipang · 2019-08-08T23:06:58Z

Thanks for the excellent implementation! But I have some questions on the batch normalization.
In the file model.py, line 1627 -- 1633, I find that batch normalization layers are always put to evaluation mode in the training process. Could anyone please explain the reason to me?

Please correct me if I'm wrong. BatchNorm layers are not trained during the training session. Based on the test results, if you train BatchNorm layers with a small batch size, it can be harmful. The suggested batch size for batch normalization is 32 (you can check the paper "Bag of tricks in image classification"). Due to the size of images in COCO dataset, the common batch size in mask rcnn is 1 or 2. So, batch normalization layers will not be trained and only use pre-trained weights.

Thanks! I got it.

vincentyw95 · 2020-02-24T09:43:01Z

Thanks for the excellent implementation! But I have some questions on the batch normalization.
In the file model.py, line 1627 -- 1633, I find that batch normalization layers are always put to evaluation mode in the training process. Could anyone please explain the reason to me?

Please correct me if I'm wrong. BatchNorm layers are not trained during the training session. Based on the test results, if you train BatchNorm layers with a small batch size, it can be harmful. The suggested batch size for batch normalization is 32 (you can check the paper "Bag of tricks in image classification"). Due to the size of images in COCO dataset, the common batch size in mask rcnn is 1 or 2. So, batch normalization layers will not be trained and only use pre-trained weights.

According to the initialize_weights() function, it seems tht batch normalization have no effects. Why not just remove all the batch normalization?

evinpinar · 2020-10-07T19:30:34Z

@vincentyw95 I guess they are there to enable using pretrained resnet model, whose conv layers are learn along with batchnorm. If these weights are loaded without bn, the learned convolutions would perform suboptimal?

yuruiqi mentioned this issue Jul 17, 2020

为什么没有设置训练模式和验证模式呢？ yuruiqi/Mask_RCNN#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch normalization #89

batch normalization #89

ziqipang commented Aug 1, 2019

BMG-JTIAN commented Aug 6, 2019

ziqipang commented Aug 8, 2019

vincentyw95 commented Feb 24, 2020

evinpinar commented Oct 7, 2020

batch normalization #89

batch normalization #89

Comments

ziqipang commented Aug 1, 2019

BMG-JTIAN commented Aug 6, 2019

ziqipang commented Aug 8, 2019

vincentyw95 commented Feb 24, 2020

evinpinar commented Oct 7, 2020