Skip to content

Commit

Permalink
benchd: inputs: graph: rewrite tests w smaller shapes for bf16
Browse files Browse the repository at this point in the history
  • Loading branch information
h-sadia committed Jan 10, 2025
1 parent 486bfa0 commit 7696807
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions tests/benchdnn/inputs/graph/complex_fusion/harness_mha_all
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,8 @@
--reset --dt=f32,bf16,f16 --in-shapes=0:32x16x128x64+1:32x16x128x64+5:32x16x128x128+8:32x16x128x64 --case=complex_fusion/mha/sdpa-plain-simplified-f16.json
--reset --dt=f32,bf16,f16 --in-shapes=0:acbd+1:acbd+8:acbd --case=complex_fusion/mha/sdpa-plain-simplified-f16.json
--reset --dt=f32,bf16,f16 --in-shapes=3:384,3:384x384,3:1x16x384x384 --case=complex_fusion/mha/sdpa-plain-scale-by-mul-f16.json
--reset --dt=bf16 --in-shapes=0:1x1x16x64+1:1x1x16x64+8:1x1x16x64+5:1x1x1x16 --case=complex_fusion/mha/sdpa-plain-simplified-f16.json
--reset --dt=bf16 --in-shapes=0:1x1x16x32+1:1x1x16x32+9:1x1x16x32+4:1x1x1x1 --case=complex_fusion/mha/sdpa-plain-wo-mask-f16.json

# Re-written int8 graphs
--reset --in-shapes=5:4x16x32x256+4:4x16x256x33+0:4x16x33x256+1:4x1x1x33+3:4x1x32x33 --case=complex_fusion/mha/MHA-GPT-inf-int8-bs1.json
Expand Down

0 comments on commit 7696807

Please sign in to comment.