[aievec] to-llvm flow for i8xi8, i16xi16, bf16xbf16 elementwise multiplication #1139

jamestcl-amd · 2024-03-19T21:03:25Z

This PR includes the e2e tests for i8xi8, i16xi16, bf16xbf16 elementwise multiplication going through the to-llvm flow.

Changes:

Add mul/srs/broadcast/concat/ext/undef intrinsics to XLLVM. The op names would need a better naming convention since more intrinsics are going to be added to the XLLVM.
Add aievec-to-llvm conversion pattern for the new intrinsics.
Add a FoldAIECastOps pattern in AIEVecToLLVM pass. This pattern folds all the aievec.cast ops that are introduced in the vector->aievec pass. Taking this shortcut, we do not need to conditionally introduce aievec.cast ops in the vector->aievec conversion patterns.
Add op folder to aievec.cast op, and remove the FoldAIECastOps pattern originally introduced in the AIEVecOptimizations pass.
Add i8xi8_mul_elem, i8xi8_mul_elem_2, i16xi16_mul_elem, i16xi16_mul_elem_2, bf16xbf16_mul_elem, bf16xbf16_mul_elem_2 e2e tests for the to-llvm flow. This includes updating the testbench.cc and the test script. These tests, like other to-cpp tests, go through the simulator to verify the numeric correctness.
Add aievec-to-llvm conversion tests for all the new XLLVM ops.
Add target llvm translation tests.

jsetoain

Fantastic job so far, James. Thank you! 🙂

lib/Conversion/AIEVecToLLVM/AIEVecToLLVM.cpp

lib/Dialect/AIEVec/Transforms/AIEVecOptimizations.cpp

include/aie/Dialect/XLLVM/IR/XLLVMAIE2IntrOps.td

jamestcl-amd · 2024-04-02T04:52:06Z

@david-vc @jsetoain this PR is ready for review. Thanks.

jsetoain

In the future, you may want to split these into a bunch of separate patches, one per added intrinsic, for instance. Quicker turn-around, and it's easier to review.

It looks good to me, a lot of great work. Thanks, James!

jsetoain · 2024-04-03T09:45:14Z

lib/Conversion/AIEVecToLLVM/AIEVecToLLVM.cpp

        std::stringstream ss;
        ss << "llvm.aie." << getVectorTypeString(resultType) << ".undef";


This also needs to be replaced with an XLLVM intrinsic. In another patch, but this definitely needs clean-up.

Will clean up this in the next PR.

lib/Dialect/AIEVec/IR/AIEVecOps.cpp

david-vc

LGTM

david-vc · 2024-04-03T15:36:01Z

lib/Conversion/AIEVecToLLVM/AIEVecToLLVM.cpp

+  }
+
+  LogicalResult
+  matchAndRewrite(aievec::MulElemOp op, OpAdaptor adaptor,


@jamestcl-amd , @jsetoain : This function is not doing any checking on the number of vector lanes or alternatively the total number of bits of the operands or result. Looking at

mlir-aie/lib/Dialect/AIEVec/IR/AIEVecOps.cpp

Line 903 in 11822b0

ParseResult parseMulFMAElemOp(OpAsmParser &parser, OperationState &result,

I see that this checks consistency, but not the actual number of lanes. So, which piece of are we relying on checking that the number of lanes is correct?

…ngly

jamestcl-amd · 2024-04-03T23:04:42Z

New updates before merging this PR:

Separate the to-llvm and to-cpp tests into two mlir test files.
Add // REQUIRES: peano to the to-llvm tests. In this way, CI regression can correctly mark them unsupported (for now).
Address review comments on aievec.cast folder and update the test accordingly.
Address review comments to the testbench.cc

jamestcl-amd added 9 commits March 19, 2024 09:54

Add lowering for i16xi16_mul_elem

7bdd53e

Add to-llvm tests for i16xi16_mul_elem

f20111f

Update the srs conversion test

e602aef

Add todo comments

2ffac16

Minor cleanups

c7479a1

Add conanicalization pattern for aievec.cast op in to-llvm flow

600fd5c

Add to-llvm flow in i16xi16_mul_elem_2 test

661e222

Add todo comments

091f86d

Add mul_elem aievec to llvm conversion test

f3bab4a

jamestcl-amd requested review from jsetoain and david-vc March 19, 2024 21:03

jsetoain reviewed Mar 20, 2024

View reviewed changes

jamestcl-amd added 18 commits March 20, 2024 13:18

Add XLLVM->LLVM conversion test

80f3894

Add broadcast/ext/concat/undef to xllvm

cb080e0

Add aievec op lowering to xllvm for i8xi8_mul_elem

78d9d0e

Add e2e test for i8xi8_mul_elem

200fdf0

Add aievec srs op lowering to xllvm for i8xi8_mul_elem_2

6d771db

Add e2e test for i8xi8_mul_elem_2

44e9cd2

Update e2e test i8xi8_mul_elem

6b2c78d

Update e2e test i16xi16_mul_elem

edd8761

Update e2e test i8xi8_mul_elem

22f0f0e

Update e2e test i8xi8_mul_elem

1894f29

Update e2e test i8xi8_mul_elem

7ddf55e

Add aievec op lowering to xllvm for bf16xbf16_mul_elem

52f2285

Add e2e test for bf16xbf16_mul_elem

078b75f

Add e2e test for bf16xbf16_mul_elem_2

21fada4

Add aievec.srs conversion test

3697b6c

Add aievec.mul_elem conversion test

20ddbf0

Add aievec.broad_scalar conversion test

0835701

Add aievec.concat conversion test

d0b1b12

jamestcl-amd added 6 commits April 1, 2024 10:06

Add aievec.ext conversion test

e4f63d6

Add all new xllvm ops to translation test

a233288

Reorganize the XLLVM ops

f47a26d

Move foldcast op ppatter no the AIEVecToLLVM pipeline

65cedcb

Add folder to aievec.cast op and the lit tests

6009405

minor clean up to bf16xbf16_mul_elem tests

4a15a8d

jamestcl-amd changed the title ~~[aievec] supporting to-llvm flow for aievec.mul_elem and aievec.srs ops~~ [aievec] to-llvm flow for i8xi8, i16xi16, bf16xbf16 elementwise multiplication Apr 2, 2024

jamestcl-amd requested a review from jsetoain April 2, 2024 04:49

jamestcl-amd marked this pull request as ready for review April 2, 2024 04:49

jamestcl-amd added 2 commits April 2, 2024 15:22

Merge branch 'main' into mul_elem_to_llvm

66628dc

Merge remote-tracking branch 'origin/main' into mul_elem_to_llvm

9789e40

jsetoain reviewed Apr 3, 2024

View reviewed changes

david-vc reviewed Apr 3, 2024

View reviewed changes

jamestcl-amd added 4 commits April 3, 2024 14:38

Merge remote-tracking branch 'origin/main' into mul_elem_to_llvm

774beeb

Refactor test structure to make CI happy

93271a3

Address review comment on the aievec.cast and update the test accordi…

0756607

…ngly

Refactor more tests to make CI regression happy

a980b20

jamestcl-amd merged commit 7f1b684 into Xilinx:main Apr 3, 2024
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[aievec] to-llvm flow for i8xi8, i16xi16, bf16xbf16 elementwise multiplication #1139

[aievec] to-llvm flow for i8xi8, i16xi16, bf16xbf16 elementwise multiplication #1139

jamestcl-amd commented Mar 19, 2024 •

edited

Loading

jsetoain left a comment

jamestcl-amd commented Apr 2, 2024

jsetoain left a comment

jsetoain Apr 3, 2024

jamestcl-amd Apr 3, 2024

david-vc left a comment

david-vc Apr 3, 2024

jamestcl-amd commented Apr 3, 2024

		std::stringstream ss;
		ss << "llvm.aie." << getVectorTypeString(resultType) << ".undef";

[aievec] to-llvm flow for i8xi8, i16xi16, bf16xbf16 elementwise multiplication #1139

[aievec] to-llvm flow for i8xi8, i16xi16, bf16xbf16 elementwise multiplication #1139

Conversation

jamestcl-amd commented Mar 19, 2024 • edited Loading

jsetoain left a comment

Choose a reason for hiding this comment

jamestcl-amd commented Apr 2, 2024

jsetoain left a comment

Choose a reason for hiding this comment

jsetoain Apr 3, 2024

Choose a reason for hiding this comment

jamestcl-amd Apr 3, 2024

Choose a reason for hiding this comment

david-vc left a comment

Choose a reason for hiding this comment

david-vc Apr 3, 2024

Choose a reason for hiding this comment

jamestcl-amd commented Apr 3, 2024

jamestcl-amd commented Mar 19, 2024 •

edited

Loading