[aievec] to-llvm/to-cpp flow for emulated fp32xfp32 mul elem #1239

jamestcl-amd · 2024-04-12T22:11:43Z

This PR add the support to the emulated fp32xfp32 elementwise multiplication going through the to-llvm and to-cpp flow.

Changes:

Add some of the upd/msc intrinsics to XLLVM.
Enable vector->aievec support for the f32xf32 mul_elem and add the corresponding conversion test.
Add aievec-to-llvm conversion pattern for the emulated f32xf32 elementwise multiplication.
Add aievec-to-llvm conversion tests for the newly added XLLVM ops.
Add target llvm translation tests.
Add f32xf32_mul_elem e2e tests for the to-llvm and to-cpp flow. This includes the testbench.cc and the test script.

jsetoain · 2024-04-16T14:34:58Z

include/aie/Dialect/XLLVM/IR/XLLVMAIE2IntrOps.td

+    Arguments<(ins VectorOfLengthAndType<[16], [I64]>:$a,
                   I32:$shft,
                   I32:$sign)>;

 def I256V32Acc32SrsIntrOp :
    AIEVec2_IntrOp<"I256.v32.acc32.srs",
        [TypeIs<"res", VectorOfLengthAndType<[32], [I8]>>]>,
-    Arguments<(ins VectorOfLengthAndType<[16], [I64]>:$lhs,
+    Arguments<(ins VectorOfLengthAndType<[16], [I64]>:$a,
                   I32:$shft,
                   I32:$sign)>;

 def I512V16Acc64SrsIntrOp :
    AIEVec2_IntrOp<"I512.v16.acc64.srs",
        [TypeIs<"res", VectorOfLengthAndType<[16], [I32]>>]>,
-    Arguments<(ins VectorOfLengthAndType<[16], [I64]>:$lhs,
+    Arguments<(ins VectorOfLengthAndType<[16], [I64]>:$a,
                   I32:$shft,
                   I32:$sign)>;

 def Vector16AccFloatToV16BF16IntrOp :
    AIEVec2_IntrOp<"v16accfloat.to.v16bf16",
        [TypeIs<"res", VectorOfLengthAndType<[16], [BF16]>>]>,
-    Arguments<(ins VectorOfLengthAndType<[8], [I64]>:$lhs)>;
+    Arguments<(ins VectorOfLengthAndType<[8], [I64]>:$a)>;


Nit.: Can we find names for the arguments better than "a"? I also don't understand why the "shift" parameter can't just be named just "shift" instead of "shft".

I tried to follow the naming convention from the the definition of CPP intrinsics. Here are some examples.

INTRINSIC(v16accfloat) msc_elem_16_2(v32bfloat16 a, v32bfloat16 b, v16accfloat acc1) { INTRINSIC(v16accfloat) mac_elem_16_2(v32bfloat16 a, v32bfloat16 b, v16accfloat acc1) { INTRINSIC(v32bfloat16) insert(v32bfloat16 a, int idx, v16bfloat16 b) { INTRINSIC(v16accfloat) ups_to_v16accfloat(v16bfloat16 a) { INTRINSIC(v32int16) lsrs(v32acc32 acc, int shft, int sign) {

But these are not C++ intrinsics, these are LLVM IR intrinsics, and we should do better than "a/b/c" anyway.

jsetoain · 2024-04-16T14:35:50Z

include/aie/Dialect/XLLVM/IR/XLLVMAIE2IntrOps.td

+class AIE2bf16MACConf : 
+    Arguments<(ins VectorOfLengthAndType<[32], [BF16]>:$lhs,
+                   VectorOfLengthAndType<[32], [BF16]>:$rhs,
+                   VectorOfLengthAndType<[8], [I64]>:$acc,
+                   I32:$conf)>;
+


test/unit_tests/aievec_tests/floatxfloat_mul_elem/floatxfloat_mul_elem-peano.mlir

test/unit_tests/aievec_tests/floatxfloat_mul_elem/floatxfloat_mul_elem.mlir

jamestcl-amd added 8 commits April 11, 2024 13:57

Add support to the fp32 mul_elem

21f3c65

Add e2e unit test for fp32xfp32 mul_elem

ffd7043

Conversion pattern for emulated FP32 mul elem

0d4951d

e2e test for to-llvm flow

e0f96b9

Add additional comment to explain how the emulation works

f79e17e

Add translation test for new XLLVM ops

49a18a7

Add aievec->llvm conversion test

1026de4

Add vector->aievec conversion test

9ec3702

jamestcl-amd requested review from david-vc and jsetoain April 12, 2024 22:11

jsetoain approved these changes Apr 16, 2024

View reviewed changes

jamestcl-amd added 4 commits April 16, 2024 11:38

Address reveiw comment

302d659

Rename suffices from -peano to -llvm for tests

de782a8

Merge remote-tracking branch 'origin/main' into fp32_mul_elem

eaa976d

Merge remote-tracking branch 'origin/main' into fp32_mul_elem

2a2a814

jamestcl-amd merged commit de9e2dc into Xilinx:main Apr 16, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[aievec] to-llvm/to-cpp flow for emulated fp32xfp32 mul elem #1239

[aievec] to-llvm/to-cpp flow for emulated fp32xfp32 mul elem #1239

jamestcl-amd commented Apr 12, 2024

jsetoain Apr 16, 2024

jamestcl-amd Apr 16, 2024

jsetoain Apr 16, 2024

jsetoain Apr 16, 2024

[aievec] to-llvm/to-cpp flow for emulated fp32xfp32 mul elem #1239

[aievec] to-llvm/to-cpp flow for emulated fp32xfp32 mul elem #1239

Conversation

jamestcl-amd commented Apr 12, 2024

jsetoain Apr 16, 2024

Choose a reason for hiding this comment

jamestcl-amd Apr 16, 2024

Choose a reason for hiding this comment

jsetoain Apr 16, 2024

Choose a reason for hiding this comment

jsetoain Apr 16, 2024

Choose a reason for hiding this comment