the learning rate adjustment when training with one gpu or two gpus #80

Young0111 · 2021-04-01T02:20:29Z

Thanks for your code.

When I training this code with two GPUs (Tesla P4), changing image_per_batch 4 by running:
python projects/SparseRCNN/train_net.py
--config-file project/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml
--num-gpus 2 SOLVER.IMS_PER_BATCH 4

when it iter 7319, save the module and the AP of it is 3.915, which is different with your 11.440 (in your log).

Refer to detectron, I adjusted the learning rate to 0.0025 by running:
python projects/SparseRCNN/train_net.py
--config-file project/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml
--num-gpus 2 SOLVER.IMS_PER_BATCH 4 SOLVER.BASE_LR 0.0025

and change GPU to 1 by running:
python projects/SparseRCNN/train_net.py
--config-file project/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml
--num-gpus 1 SOLVER.IMS_PER_BATCH 2 SOLVER.BASE_LR 0.0025

but the result was worse.

Later I found that they use SGD and you use AdamW, maybe this 0.0025 is not applicable,
So I want to know if I need to adjust some parameters. and how to adjust?
Thanks.

iFighting · 2021-04-01T02:23:36Z

you should set BASE_LR as 0.000025 instead of 0.0025.

Young0111 · 2021-04-01T02:27:37Z

When first running it, the learning rate is default, i.e.0.000025, got a lower AP than the author's log, then I changed the learning rate.

iFighting · 2021-04-01T02:36:50Z

When first running it, the learning rate is default, i.e.0.000025, got a lower AP than the author's log, then I changed the learning rate.

maybe you need a small lr than 0.000025

Young0111 · 2021-04-01T02:40:22Z

OK, thanks for your reply, I will try it now.

lingl-space · 2022-03-15T03:02:52Z

So, what changes have you made? I guess do I need to set lr as 0.000025 * 1/8 and total number of iterations 8 times? However, as the number of iterations increases, the entire training duration becomes extremely large

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the learning rate adjustment when training with one gpu or two gpus #80

the learning rate adjustment when training with one gpu or two gpus #80

Young0111 commented Apr 1, 2021 •

edited

Loading

iFighting commented Apr 1, 2021

Young0111 commented Apr 1, 2021

iFighting commented Apr 1, 2021

Young0111 commented Apr 1, 2021

lingl-space commented Mar 15, 2022

the learning rate adjustment when training with one gpu or two gpus #80

the learning rate adjustment when training with one gpu or two gpus #80

Comments

Young0111 commented Apr 1, 2021 • edited Loading

iFighting commented Apr 1, 2021

Young0111 commented Apr 1, 2021

iFighting commented Apr 1, 2021

Young0111 commented Apr 1, 2021

lingl-space commented Mar 15, 2022

Young0111 commented Apr 1, 2021 •

edited

Loading