Skip to content

Commit

Permalink
Update fused_moe.py
Browse files Browse the repository at this point in the history
  • Loading branch information
pcmoritz authored Jun 28, 2024
1 parent 4cf44a5 commit 0746b4f
Showing 1 changed file with 5 additions and 1 deletion.
6 changes: 5 additions & 1 deletion vllm/model_executor/layers/fused_moe/fused_moe.py
Original file line number Diff line number Diff line change
Expand Up @@ -346,11 +346,15 @@ def fused_topk(
topk,
dtype=torch.float32,
device=hidden_states.device)
topk_ids = torch.empty(M,
topk,
dtype=torch.int32,
device=hidden_states.device)
token_expert_indicies = torch.empty(M,
topk,
dtype=torch.int32,
device=hidden_states.device)
moe_kernels.topk_softmax(
ops.topk_softmax(
topk_weights,
topk_ids,
token_expert_indicies,
Expand Down

0 comments on commit 0746b4f

Please sign in to comment.