Add a unit test for MoE layer. #7069

qihqi · 2024-05-15T23:12:10Z

No description provided.

alanwaketan · 2024-05-19T19:20:08Z

experimental/torch_xla2/test/moe/model.py

+        self.w3 = nn.Parameter(torch.empty(config.num_experts, config.intermediate_size, config.dim))
+
+    def forward(self, x: Tensor, expert_indices: Tensor) -> Tensor:
+        w1_weights = self.w1[expert_indices] # [T, A, D, D]


The memory consumption here seems pretty high? It's like you have num_token duplicated weights? So this will not introduce extra storage? Or maybe you are just considering inference where T is just 1?

Add a unit test for MoE layer.

8f70982

qihqi requested a review from lsy323 May 15, 2024 23:12

lsy323 approved these changes May 16, 2024

View reviewed changes

lsy323 merged commit 8247aec into master May 16, 2024
3 checks passed

alanwaketan reviewed May 19, 2024

View reviewed changes

zpcore pushed a commit that referenced this pull request May 20, 2024

Add a unit test for MoE layer. (#7069)

bce82d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a unit test for MoE layer. #7069

Add a unit test for MoE layer. #7069

qihqi commented May 15, 2024

alanwaketan May 19, 2024

Add a unit test for MoE layer. #7069

Add a unit test for MoE layer. #7069

Conversation

qihqi commented May 15, 2024

alanwaketan May 19, 2024

Choose a reason for hiding this comment