You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Using the IR in punet/base_ir and the latest wmma spec, on the shared/sdxl_quantized branch of IREE, I run into compile issue with the int8 quantized punet model:
Using the IR in punet/base_ir and the latest wmma spec, on the shared/sdxl_quantized branch of IREE, I run into compile issue with the int8 quantized punet model:
The dispatch:
Latest IREE commit:
Attention and matmul spec (wmma):
attention_and_matmul_spec_wmma.mlir.txt
Since we are bringing up on gfx942 this is not an immediate blocker, but filing as we'll need this fixed at some point.
The text was updated successfully, but these errors were encountered: