Replies: 2 comments 5 replies
-
@mrT23 the inference numbers are at the default inference resolution which is 384 x 384 for the rw-s, and that is 823 in NCHW and ~1000 in NHWC. It's closer to 3000 at 224 NHWC and 2220 at NCHW. But resolution scaling is part of the EfficientNet model scaling scheme and part of the top-1 scores. EDIT: It should be noted that the score in the RBS table is applying our ResNet50 optimized recipe to other nets, training that net with the R50 recipe was awful and not at all recommended. The EfficientNetV2 small in the paper was 83.9 top-1, and my variant and train hparams were 83.8 which is not achievable with a standard ResNet until you approach 200/200-D ResNets with a bit of resolution scaling (maybe 256-288). |
Beta Was this translation helpful? Give feedback.
-
Interestingly, a few other nets I've worked with recently from same family that can also hit 83.8 top-1 (almost exactly) and compare favourably to either of these in inference throughput. All variants of RegNetZ -- essentially an efficientnet w/ grouped convs -- my D8 ( I've been doing quite a few RegNet-Y/Z (and my own 'V' pre-act variant of Y) recently, all on TPU v3 instances with my XLA branch because they train well on TPUs vs GPU where the train throughputs aren't ideal due to the mentioned issue. |
Beta Was this translation helpful? Give feedback.
-
Hi
A small issue i want to raise:
According to the paper "ResNet strikes back", EfficientNetV2-rw-S has an inference speed of 823 im/sec for achieving a score of 80.6 (Table 4).
For comparison, ResNet50 achieves 80.4% with inference speed of 2536 im/sec. So ResNet50 is ~x3 times faster, and achieves simillar accuracy.
The speed of EfficientNetV2-rw-S in the article seems low.
In my speed measurements, ResNet50 achieves 2819 im/sec (pretty similar to yours), but EfficientNetV2-rw-S achieves 2323 im/sec.
might worth re-checking and validating the speed measurement of EfficientNetV2-rw-S.
p.s.
my motivation for this whole testing was the tweet:
https://twitter.com/tanmingxing/status/1481362887272636417
it made me wonder if indeed EfficientNetV2 is a big improvement, and gives the best speed-acc tradeoff today. After digging up a bit i found out that EfficientNetV2 still couples architecture with resolution, so the comparisons in their article are not fully fair. with the same resolution and training, they don't seem to outperform ResNet50
Beta Was this translation helpful? Give feedback.
All reactions