-
Notifications
You must be signed in to change notification settings - Fork 90
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
#0: Properly support trivial single core case for 1D matmuls
- For mcast in1, only mcast to cores with work (similar to mcast in0) - For single core, skip receiver kernel setup - TODO: For sharded in0, K must be divisible by in0_block_w due to separate bugs: * For mcast in0, the sharded reader doesn't support turning off mcast if it's single core ** Bug here is that, we do regular mcast which is mcasting to 0 cores... * For mcast in1, the sharded reader doesn't support slicing along shard width
- Loading branch information
1 parent
7ab17bd
commit bc19f9c
Showing
2 changed files
with
161 additions
and
98 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters