add IDGNN #2

yiweny · 2024-07-10T07:11:16Z

Add idgnn model and test cases.
A slight difference is that we are not using resnet as encoder but only using the StypeWiseEncoder

hybridgnn/nn/encoder.py

zechengz · 2024-07-20T06:36:09Z

hybridgnn/nn/encoder.py

+        tf_dict: Dict[NodeType, torch_frame.TensorFrame],
+    ) -> Dict[NodeType, Tensor]:
+        x_dict = {
+            node_type: self.encoders[node_type](tf)[0].mean(axis=1)


Why we take mean here? Maybe sum is a little bit better. Or we should use ResNet etc.

Do we really what to include resnet?
I want to compare 4 versions, idgnn, idgnn with resent, hybridgnn, hybridgnn with resnet.

ResNet is for single table right? If we don't want to use ResNet, MLP https://github.com/pyg-team/pytorch-frame/blob/d81b8f7a9e0643fa553d7cb7a1343ef662fd6835/torch_frame/nn/models/mlp.py#L28 or sum may be a better choice

Okay I changed to sum. Do you know why it's better to use sum? Is is because it's the best performing one in Kumo?

andyhuang-kumo

See comments, mostly minor suggestions

hybridgnn/nn/encoder.py

andyhuang-kumo · 2024-07-22T17:06:54Z

hybridgnn/nn/encoder.py

+        tf_dict: Dict[NodeType, torch_frame.TensorFrame],
+    ) -> Dict[NodeType, Tensor]:
+        x_dict = {
+            node_type: self.encoders[node_type](tf)[0].sum(axis=1)


is sum the only aggregation method here? If not, maybe provide it as an argument so it is easy to change later on

I fixed the code to support more advanced aggregations.

hybridgnn/nn/encoder.py

andyhuang-kumo · 2024-07-22T17:10:34Z

hybridgnn/nn/models/graphsage.py

+                        (channels, channels), channels, aggr=aggr)
+                    for edge_type in edge_types
+                },
+                aggr="sum",


the input argument has aggr="mean" but here aggr is hard coded

I think it's intended to use sum here? cc @zechengz

I think we can use sum here for now. The aggr = "mean" is used for the SAGEConv aggregation. Here the aggr seems to have a different meaning, which aggregates embeddings for the same node type together (if I remembered correctly)

andyhuang-kumo · 2024-07-22T17:16:48Z

hybridgnn/nn/models/idgnn.py

+    ) -> Tensor:
+        seed_time = batch[entity_table].seed_time
+        x_dict = self.encoder(batch.tf_dict)
+        # Add ID-awareness to the root node


so standard GNN is basically just IDGNN without this id_awareness_emb ? These class can be reused to include standard GNN without ID awareness then just by making this optional, maybe as an argument?

test/nn/test_model.py

andyhuang-kumo · 2024-07-22T17:21:52Z

test/nn/test_model.py

+    batch = next(iter(train_loader))
+
+    assert len(batch[task.dst_entity_table].batch) > 0
+    model = IDGNN(data=data, col_stats_dict=col_stats_dict, num_layers=2,


again, the aggr is hard coded in IDGNN so this sum here is actually not used

hybridgnn/nn/encoder.py

yiweny force-pushed the yyuan/add-id-gnn branch 11 times, most recently from c708da0 to c9444a2 Compare July 14, 2024 04:44

yiweny force-pushed the master branch from cebb4c1 to 135b9f1 Compare July 14, 2024 05:02

yiweny force-pushed the yyuan/add-id-gnn branch from 58c2467 to ac1f221 Compare July 14, 2024 06:22

yiweny changed the title ~~wip add encoders~~ wip add IDGNN Jul 18, 2024

yiweny changed the title ~~wip add IDGNN~~ add IDGNN Jul 20, 2024

yiweny requested review from rusty1s, zechengz, andyhuang-kumo and weihua916 July 20, 2024 04:22

zechengz reviewed Jul 20, 2024

View reviewed changes

yiweny requested a review from zechengz July 20, 2024 21:16

yiweny force-pushed the yyuan/add-id-gnn branch from 357d288 to 3e67440 Compare July 20, 2024 21:39

add idgnn

7fed12d

yiweny force-pushed the yyuan/add-id-gnn branch from 3e67440 to 7fed12d Compare July 20, 2024 21:40

andyhuang-kumo reviewed Jul 22, 2024

View reviewed changes

nit

a01f3bd

akihironitta reviewed Jul 22, 2024

View reviewed changes

hybridgnn/nn/encoder.py Outdated Show resolved Hide resolved

fix code based on review comments

a2c6a27

yiweny merged commit a3c0583 into master Jul 22, 2024
2 checks passed

akihironitta deleted the yyuan/add-id-gnn branch July 29, 2024 10:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add IDGNN #2

add IDGNN #2

yiweny commented Jul 10, 2024 •

edited

Loading

zechengz Jul 20, 2024

yiweny Jul 20, 2024 •

edited

Loading

zechengz Jul 20, 2024

yiweny Jul 20, 2024

andyhuang-kumo left a comment

andyhuang-kumo Jul 22, 2024

yiweny Jul 22, 2024

andyhuang-kumo Jul 22, 2024

yiweny Jul 22, 2024

zechengz Jul 22, 2024

andyhuang-kumo Jul 22, 2024

andyhuang-kumo Jul 22, 2024

add IDGNN #2

add IDGNN #2

Conversation

yiweny commented Jul 10, 2024 • edited Loading

Choose a reason for hiding this comment

yiweny Jul 20, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andyhuang-kumo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yiweny commented Jul 10, 2024 •

edited

Loading

yiweny Jul 20, 2024 •

edited

Loading