Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Number of Conv layers #2

Open
jan-pair opened this issue May 24, 2023 · 2 comments
Open

Number of Conv layers #2

jan-pair opened this issue May 24, 2023 · 2 comments

Comments

@jan-pair
Copy link

Hi, that's a mind-blowing work, but I believe there is a slight miscommunication in the paper & codebase regarding the number of Conv layers and their influence on the inference speed.

As can be seen in the released code, VanillaNet-6 actually contains eleven Conv layers - one is 4x4 Conv in the initial stem, five are 1x1 Conv-s as separate instances and five are 7x7 Depthwise Conv layers that are applied inside activation function (and run noticeably faster than 1x1 due to huge model width), which is not clear e.g. from the figure 1 (which states that there are only six conv layers total).

It could be beneficial if this was stated more explicitly in the paper

@HantingChen
Copy link
Collaborator

Sorry for the misleading. In fact, we measure the "depth" for a network using the number of its non-linear layers in its main branch instead of the number of conv layers. Therefore, the depth of VanillaNet utilizing 5 activation layers is 6 ,which is called VanillaNet-6. We will correct this misleading statement in the paper. Thanks for the nice suggestion!

@abcsimple
Copy link

abcsimple commented Aug 4, 2023

Hi, it seems the block 4096 * 7 * 7 of VanillaNet-6 in Fig 1 missed a blue cube for the 1 * 1 conv, since there are 5 1 * 1 conv stages in VanillaNet-6, and 5 activation layers will be easy to count based on 5 blue cubes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants