Add ViG models [NeurIPS 2022] #1578

iamhankai · 2022-12-05T15:08:40Z

Add ViG models from paper: Vision GNN: An Image is Worth Graph of Nodes (NeurIPS 2022), https://arxiv.org/abs/2206.00272

Network architecture plays a key role in the deep learning-based computer vision system. The widely-used convolutional neural network and transformer treat the image as a grid or sequence structure, which is not flexible to capture irregular and complex objects. In this paper, we propose to represent the image as a graph structure and introduce a new Vision GNN (ViG) architecture to extract graph-level feature for visual tasks. We first split the image to a number of patches which are viewed as nodes, and construct a graph by connecting the nearest neighbors. Based on the graph representation of images, we build our ViG model to transform and exchange information among all the nodes. ViG consists of two basic modules: Grapher module with graph convolution for aggregating and updating graph information, and FFN module with two linear layers for node feature transformation. Both isotropic and pyramid architectures of ViG are built with different model sizes. Extensive experiments on image recognition and object detection tasks demonstrate the superiority of our ViG architecture. We hope this pioneering study of GNN on general visual tasks will provide useful inspiration and experience for future research.

Model	Params (M)	FLOPs (B)	Top-1
Pyramid ViG-Ti	10.7	1.7	78.5
Pyramid ViG-S	27.3	4.6	82.1
Pyramid ViG-M	51.7	8.9	83.1
Pyramid ViG-B	82.6	16.8	83.7

HuggingFaceDocBuilderDev · 2022-12-05T15:13:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

rwightman · 2022-12-07T00:50:47Z

@iamhankai FYI, you can use register_notrace_function and @register_notrace_module to register leaf functions or modules in your model that won't trace in FX due to boolean and other flow control concerns...

rwightman · 2022-12-08T01:27:17Z

Hmm, seems the tracing issue harder to solve, just preventing trace won't bypass the bool issue without some restructure. I'd also need to tweak some other interface issues wrt to other models.

Trying the model out, the 'base' as example seems roughly on par with a Swin (v1) base for accuracy and param/flops, but it runs at < 1/2 the speed. Any way to improve the runtime performance?

Have there been any weights or attempts to scale the training to larger datasets? Interesting performance differents there vs other vit or vit related hybrid arch?

iamhankai · 2022-12-08T03:19:01Z

We have pretrained ViG on ImageNet-22K. It performs slightly better than Swin Transformer:

Model	Params (M)	FLOPs (B)	IN1K Top-1
Swin-S	50	8.7	83.2
Pyramid ViG-M	51.7	8.9	83.8

As for the runtime, accelerating GNN is an open problem.

iamhankai · 2023-04-03T09:35:49Z

@rwightman Hi, we released the weights to scale the training to larger ImageNet22K dataset: https://github.com/huawei-noah/Efficient-AI-Backbones/releases/download/pyramid-vig/pvig_m_im21k_90e.pth

It performs slightly better than IM22K pretrained Swin Transformer:

Model	Params (M)	FLOPs (B)	IN1K Top-1
Swin-S	50	8.7	83.2
Pyramid ViG-M	51.7	8.9	83.8

iamhankai · 2023-06-08T01:35:11Z

tests/test_models.py

@@ -295,12 +295,6 @@ def test_model_features_pretrained(model_name, batch_size):
        """Create that pretrained weights load when features_only==True."""
        create_model(model_name, pretrained=True, features_only=True)

-EXCLUDE_JIT_FILTERS = [


It seems these lines cannot be removed

I managed to fix the other jit exceptions so would rather not add more, I feel it's likely it can be supported with appropriate type decl, etc

add ViG models

cb725a8

iamhankai added 3 commits December 6, 2022 12:18

update ViG models

f080f1b

fix error of min input size

1d81cd5

fix test errors

38a908b

iamhankai added 4 commits December 7, 2022 18:41

Update vision_gnn.py

00101f9

Update gnn_layers.py

33c6d8e

Update gnn_layers.py

df75883

fix test errors

7e71058

iamhankai added 6 commits December 8, 2022 11:27

Merge branch 'main' into main

6c745cc

Merge branch 'main' into main

d26937b

add im21k pretrained weights

3f46747

Update vision_gnn.py

1daf56d

fix typo

9510681

Merge branch 'main' into main

6368a06

Merge branch 'main' into main

0173a13

iamhankai commented Jun 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ViG models [NeurIPS 2022] #1578

Add ViG models [NeurIPS 2022] #1578

iamhankai commented Dec 5, 2022

HuggingFaceDocBuilderDev commented Dec 5, 2022

rwightman commented Dec 7, 2022

rwightman commented Dec 8, 2022

iamhankai commented Dec 8, 2022 •

edited

Loading

iamhankai commented Apr 3, 2023

iamhankai Jun 8, 2023

rwightman Jun 8, 2023

Add ViG models [NeurIPS 2022] #1578

Are you sure you want to change the base?

Add ViG models [NeurIPS 2022] #1578

Conversation

iamhankai commented Dec 5, 2022

HuggingFaceDocBuilderDev commented Dec 5, 2022

rwightman commented Dec 7, 2022

rwightman commented Dec 8, 2022

iamhankai commented Dec 8, 2022 • edited Loading

iamhankai commented Apr 3, 2023

iamhankai Jun 8, 2023

Choose a reason for hiding this comment

rwightman Jun 8, 2023

Choose a reason for hiding this comment

iamhankai commented Dec 8, 2022 •

edited

Loading