Canonicalization of TTIR ops #1670

azecevicTT · 2024-12-26T11:38:25Z

Canonicalization can be roughly divided into three forms:

Traits (properties) of some ops that allows them to be folded (Involution, Idempotence)
Per op foldings
Per op canonicalization (pattern rewriters)

The first one allows us to decleratively add folding for a large class of ops. While traits like Involution already exists in MLIR infrastructure it doesn't account for DPS, so I added a new one that takes care of it.
The second one allows us in practice to define some simple graph rewritings where we replace the producing value of op with some existing value (or constant).
The third one gives us the most freedom, where we can do arbitrary graph rewritings, we usually use it when we have to create a new op during rewriting.

I added a canonicalize pass both before and after ttir-to-ttir-decomposition-pass in ttir-to-ttnn-backend-pipeline, as there are many graph rewritings during that pass, so we might benefit form canonicalization both before and after.

I plan to cover one big part of canonicalization in the future, and that's constant folding. I will also write a short document on adding a canonicalization for new and existing ops. While this PR covers a lot of patterns, it's definitely not an exhaustive list. This MLIR doc is already a great source of information, I just believe we might also benefit from the additional context of TTIR dialect.

This PR should cover a big part of #1264, as I said the missing part is constant folding.

- added InvolutionTrait - added IdempotentTrait - TODO: address other folders - TODO: refactor check into the separate function - TODO: added -canonicalize into the pipeline - TODO: add tests for involution and idempotence

- fixed some tests that failed due to canonicalization

…terialization

- fixed test for fork-join: changed relu to gelu

- added a traits on few other ops as well - TODO: tests for ReverseOp canonicalization

- few op traits modifed - TODO: add tests for bitwise xor op canonicalization

- Broadcast noop folder

- op should be replaced instead of producer op

- tests for PermuteOp canonicalization

- some clean up

- improved comments in few places

azecevicTT

Few comments regarding some changes that are not clear without the context.

azecevicTT · 2024-12-26T11:46:57Z

include/ttmlir/Dialect/TTIR/IR/TTIRBase.td

@@ -41,6 +43,28 @@ def TTIR_Dialect : Dialect {
 //===----------------------------------------------------------------------===//

 class TTIR_Op<string mnemonic, list<Trait> traits = []> :
-        Op<TTIR_Dialect, mnemonic, !listconcat(traits, [Pure])>;
+        Op<TTIR_Dialect, mnemonic, !listconcat([Pure], traits)>;


I've changed the order of listconcatarguments everywhere, because of dependent trait list for traits like TTIR_Involution. Otherwise it fails long before compiling (it check for existence of this traits while still parsing .td file), so it doesn't see a trait/interface like DestinationStyleOpInterface on ops that has it, because it builds list in bottom-up fashion ([trait_of_op, trait_of_ops_class, trait_op_ops_class_class,...]).

From what I saw it doesn't make any difference in generated C++ code. @nsmithtt Did you have an experience with this?

azecevicTT · 2024-12-26T11:49:29Z

test/ttmlir/Dialect/TTNN/linear/linear_tests_positive.mlir

@@ -1,18 +1,5 @@
 // RUN: ttmlir-opt --ttir-to-ttnn-backend-pipeline %s | FileCheck %s
 module {
-  func.func @linear_1d_1d(%arg0: tensor<128xbf16>, %arg1: tensor<128xbf16>) -> tensor<1xbf16> {


There have been a few tests that have failed due to canonicalization changing the graph. I have removed them and wrote tests that specifically check for the correctness of rewriting.

azecevicTT · 2024-12-26T11:51:32Z

test/ttmlir/Dialect/TTNN/optimizer/greedy_l1_interleaved_policy/fork_join.mlir

@@ -27,15 +27,15 @@ module attributes {} {
    // CHECK: %{{.*}} = "ttnn.relu"{{.*}} -> tensor<64x64xbf16, #[[LAYOUT_3]]>
    %1 = "ttir.relu"(%arg0, %0) <{operandSegmentSizes = array<i32: 1, 1>}> : (tensor<64x64xbf16>, tensor<64x64xbf16>) -> tensor<64x64xbf16>
    %2 = tensor.empty() : tensor<64x64xbf16>
-    %3 = "ttir.relu"(%1, %2) <{operandSegmentSizes = array<i32: 1, 1>}> : (tensor<64x64xbf16>, tensor<64x64xbf16>) -> tensor<64x64xbf16>
+    %3 = "ttir.gelu"(%1, %2) <{operandSegmentSizes = array<i32: 1, 1>}> : (tensor<64x64xbf16>, tensor<64x64xbf16>) -> tensor<64x64xbf16>


I've synced offline with @fbajraktariTT regarding this, due to RELU being idempotent op, two consecutive RELUs result in rewriting, so I've just replaced one RELU with GELU, as concrete op wasn't a concern in this test.

odjuricicTT

Optimizer test changes are fine :)

azecevicTT added 22 commits December 26, 2024 10:04

Canonicalizer pass

be93d68

- added InvolutionTrait - added IdempotentTrait - TODO: address other folders - TODO: refactor check into the separate function - TODO: added -canonicalize into the pipeline - TODO: add tests for involution and idempotence

TTIR constant materialization

4c62c25

Refactoring of traits

8f1f4ab

TransposeOp canonicalization

a4dd385

- fixed some tests that failed due to canonicalization

GetDimensionSizeOp removed from decomp and realized through fold + ma…

ab06ed2

…terialization

Refactoring of verification-folding order

292b524

TTIR_BinaryIdempotence trait

2fb3718

Added tests for canonicalizer

d4eac0c

- fixed test for fork-join: changed relu to gelu

Refactoring of traits

1d9ff47

Added canonicalization for ReverseOp

fe89e8a

- added a traits on few other ops as well - TODO: tests for ReverseOp canonicalization

BitwiseXor canonicalization

836947c

- few op traits modifed - TODO: add tests for bitwise xor op canonicalization

Refactoring of TTIRTraits

0bbce3c

Permute canonicalizer

d1761e8

- Broadcast noop folder

LinearOp canonicalization when bias is null

cadc2f9

PermuteOp canononicalization bug fix

fc7ed16

- op should be replaced instead of producer op

ReverseOp tests

09b44ea

PermuteOp composition bug fix

c13505f

- tests for PermuteOp canonicalization

BitwiseXor tests

1464f89

Removed linear -> matmul case from linear tests

eafb9e4

Canonicalize pass both before and after decomposition pass

3e5cfbf

- some clean up

LinearOp folder

f75e998

Refactoring of TTIRTraits.cpp

34a1592

- improved comments in few places

azecevicTT requested review from nobradovictt, odjuricicTT, sdjordjevicTT, svuckovicTT, mtopalovicTT, jserbedzijaTT, jnie-TT and nsmithtt as code owners December 26, 2024 11:38

azecevicTT requested a review from mrakitaTT as a code owner December 26, 2024 11:38

azecevicTT commented Dec 26, 2024

View reviewed changes

Lint fix

b5ac2b5

odjuricicTT approved these changes Dec 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Canonicalization of TTIR ops #1670

Canonicalization of TTIR ops #1670

azecevicTT commented Dec 26, 2024

azecevicTT left a comment

azecevicTT Dec 26, 2024

azecevicTT Dec 26, 2024

azecevicTT Dec 26, 2024

odjuricicTT left a comment

Canonicalization of TTIR ops #1670

Are you sure you want to change the base?

Canonicalization of TTIR ops #1670

Conversation

azecevicTT commented Dec 26, 2024

azecevicTT left a comment

Choose a reason for hiding this comment

azecevicTT Dec 26, 2024

Choose a reason for hiding this comment

azecevicTT Dec 26, 2024

Choose a reason for hiding this comment

azecevicTT Dec 26, 2024

Choose a reason for hiding this comment

odjuricicTT left a comment

Choose a reason for hiding this comment