How to reproduce A3 procedure (ResNet strikes back) #1106
-
Hi @rwightman the paper, ResNet strikes back: An improved training procedure in timm. version = '0.5.2',runs on 4xV100, batch 512 per card my config yaml, has a little difference with _78_25-fusedlamb-cosine-lr0.00800-wd0.020000-n0-rand-m6-mstd0.5-inc1-m0.1-sd0.0-d0.0-ls0.0-100-100-resnet50-args.yaml.txt
the best top1 acc is 76.278%, could you help find the gap? the config yaml and train log, as follows: |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
@EthanChen1234 did you try to eval the best checkpoint at 224x224 crop 0.95? Also, since paper was published, train at 176 was found to be a better match for eval at 224 (but increases the compute/time a bit) |
Beta Was this translation helpful? Give feedback.
@EthanChen1234 did you try to eval the best checkpoint at 224x224 crop 0.95?
Also, since paper was published, train at 176 was found to be a better match for eval at 224 (but increases the compute/time a bit)