Fixing reduction ops #1673

mtopalovicTT · 2024-12-27T19:43:04Z

Currently all reduction ops when lowered from forge are failing. Reduction ops in TTIR/TTNN have optional dim_arg attribute which can be used to specify dimension along which reduce should be applied.
Forge uses dim attribute to specify reduction dims, so when lowered to TTIR is completly ignored by our compiler.
This PR fixes the naming of this attribute and also aligns possible values for this attribute from:
OptionalAttr<I32ArrayAttr> -> OptionalAttr<AnyAttrOf<[SI32Attr, I32ArrayAttr]>>

IR before change:

%2 = "ttir.mean"(%arg0, %1) <{dim_arg = [-2 : i32], keep_dim = true}> : (tensor<1x1x49x2048xf32>, tensor<1x1x1x2048xf32>) -> tensor<1x1x1x2048xf32>

IR after change:

%2 = "ttir.mean"(%arg0, %1) <{dim = [-2 : i32], keep_dim = true}> : (tensor<1x1x49x2048xf32>, tensor<1x1x1x2048xf32>) -> tensor<1x1x1x2048xf32>
%2 = "ttir.mean"(%arg0, %1) <{dim = -2 : si32, keep_dim = true}> : (tensor<1x1x49x2048xf32>, tensor<1x1x1x2048xf32>) -> tensor<1x1x1x2048xf32>

fixes #1574

mrakitaTT · 2024-12-27T20:18:42Z

Forge uses dim attribute to specify reduction dims, so when lowered to TTIR is completly ignored by our compiler.

Just curious of the reasoning, why did you decide to make this change in TTIR instead of Forge? You had to change bunch of code and bunch of tests because of this attribute renaming (and missed some).

If we are already renaming, I would choose dimensions instead of dim, because we allow reduce over multiple dimensions. But it seems more natural to just fix Forge in the first place to serialize this attribute name to dim_arg instead of dim.

mtopalovicTT · 2024-12-27T21:10:03Z

@mrakitaTT dim is used in both forge and torch. dim_arg is used in metal.

When I was talking with someone from forge couple days ago they told me they are trying to mirror torch as close as possible with ops so I decided not to do it in forge.

If we were to change it in forge then we would have to either:

Fix it on op level in forge - this would also require bunch of test changes in forge because of renaming
Fix it during lowering to mlir - I would personally like to avoid this since it would complicate lowering logic which is today very very simple.

But regardless of the naming we still have a issue with attribute type which is not 1-1 what forge can lower.

You had to change bunch of code and bunch of tests because of this attribute renaming (and missed some).

I thought personally change was small and simplified stuff, like getting reduce dims. Unfortunately with these kind of changes it's inevitable to test changes . Regarding tests I fixed that, I didn't have stable hlo flag on...

If we are already renaming, I would choose dimensions

Yea I thought this also, not sure why torch developers decided on that name.

mrakitaTT · 2024-12-27T22:32:31Z

When I was talking with someone from forge couple days ago they told me they are trying to mirror torch as close as possible with ops so I decided not to do it in forge.

Yeah but this is not part of the Forge external interface (i.e. changing the Forge ops), this is part of the serialization from Forge graph to TTIR graph.

Fix it during lowering to mlir - I would personally like to avoid this since it would complicate lowering logic which is today very very simple.

I am not too familiar with Forge codebase so please pardon my ignorance, but shouldn't this just be a simple serialization string change?

Edit: I've just checked the Forge repo and I see what you mean, Forge serialization is just matching attribute names from Forge OpNode, but I don't think this is sustainable long term anyways. We can't just adapt TTIR to be 1:1 to Forge, TTIR needs to be general enough WRT many external dialects. We need to pick something that both makes sense to us (for example op or attribute name that makes most sense and is most commonly used in third party dialects) and is also general enough so that every other dialect can easily convert to it (for example we try to have 1:1 conversion for most of the ops in third party dialects).

I see this pattern often where we are focusing too much on Forge, but I would implore everyone to always keep in mind that there are also other frontends and dialects, and to check their definitions too when deciding on something like this. For example we've named some op Select in TTIR just because Forge uses that name, even though all other dialects use that name in context of some other op (Select=Where) (#1675)

But regardless of the naming we still have a issue with attribute type which is not 1-1 what forge can lower.

I am not sure about this change too. Imagine if some other dialect used SI16Attr, some other used SI8Attr, some other used I16ArrayAttr, etc... Should we define the op in TTIR with OptionalAttr<AnyAttrOf<[SI32Attr, I32ArrayAttr, SI16Attr, SI8Attr, I16ArrayAttr, ...]>>? Or should we agree on one definition which is general enough like OptionalAttr<I32ArrayAttr> and require third party dialects to convert to it during ThirdPartyDialect->TTIR conversion? I would argue for the latter.

For example StableHLO uses i64 type for dimensions, so we do this during StableHLO->TTIR conversion:

mlir::ArrayAttr dimArg = rewriter.getI32ArrayAttr(
        llvm::SmallVector<int32_t>(srcOp.getDimensions()));

Arguably we could've also changed TTIR to use i64 instead of i32, though I can't imagine tensor with such large rank ever existing :)

mtopalovicTT · 2024-12-29T10:38:21Z

For this change I'm comfortable with changing attribute name and couple of tests. This is blocking ResNet and we are not just hacking something to make it work (but even if it was not blocking any model I still don't see anything wrong with change).

For example we've named some op Select in TTIR just because Forge uses that name, even though all other dialects use that name in context of some other op (Select=Where).

Well this is perfect example. If it has the same semantics as Where from torch then it should have the same name in Forge. If torch is most commonly used by model writers and we have op with same semantics as torch I would say we should use torch name.

If we want to standardize how we are adding an OP in TTIR I'm all for it (defining types/common names for attributes I like this). But this would require some thought which I don't see as P1 bth.

If you don't agree with above we have Dialect team meeting on Tuesday and we can discuss this :)

svuckovicTT · 2025-01-14T14:34:38Z

include/ttmlir/Dialect/TTNN/IR/TTNNOps.td

@@ -578,10 +578,32 @@ class TTNN_ReductionOp<string mnemonic, list<Trait> traits = []> : TTNN_Op<mnemo

    let arguments = (ins AnyRankedTensor:$input,
                         BoolAttr:$keep_dim,
-                         OptionalAttr<I32ArrayAttr>:$dim_arg);
+                         OptionalAttr<AnyAttrOf<[SI32Attr, I32ArrayAttr]>>:$dim);


Consider changing SI32Attr to I32Attr.

@svuckovicTT Should we prefer I32 when TTNN uses signed type? I'm okay with both, but for simplicity I agree with should settle for the one or the other. SI32 seems more convenient because getter returns int32_t, if I remember correctly I32 getter will by default return uint32_t.

Also here DenseI32ArrayAttr.

@svuckovicTT Should we prefer I32 when TTNN uses a signed type? I'm okay with both, but for simplicity I agree that we should settle for one or the other. SI32 seems more convenient because getter returns int32_t, if I remember correctly I32 getter will by default return uint32_t.

Huh, now that you say it, I agree, SI32Attr does make more sense - we do want it to be obvious that it's signed. It still hurts that there's no signed i32 array. @mtopalovicTT sorry for the noise.

svuckovicTT · 2025-01-14T14:39:00Z

lib/Target/TTNN/TTNNToFlatbuffer.cpp

+  SmallVector<int64_t> dims = op.getReduceDims();
+  SmallVector<int32_t> dims32(dims.begin(), dims.end());
+  auto dimArg =
+      op.getReduceDims().empty() ? 0 : toFlatbuffer<int32_t>(cache, dims32);


Setting to 0 here seems like good bug potential, can we leave it unset for the flatbuffer? Looking at the method signature, it's optional, so I think we shouldn't assume default value:

static Tensor invoke( const Tensor& input_tensor_arg, const std::optional<std::variant<int, ttnn::SmallVector<int>>>& dim_arg = std::nullopt, const bool keepdim = true, const std::optional<MemoryConfig>& memory_config_arg = std::nullopt, const std::optional<DeviceComputeKernelConfig>& compute_kernel_config = std::nullopt, float scalar = 1.0f);

azecevicTT · 2025-01-14T14:52:33Z

include/ttmlir/Dialect/TTIR/IR/TTIROps.td

@@ -615,7 +615,7 @@ class TTIR_ReductionOp<string mnemonic, list<Trait> traits = []> :
    let arguments = (ins AnyRankedTensor:$input,
                         AnyRankedTensor:$output,
                         BoolAttr:$keep_dim,
-                         OptionalAttr<I32ArrayAttr>:$dim_arg);
+                         OptionalAttr<AnyAttrOf<[SI32Attr, I32ArrayAttr]>>:$dim);


Consider changing I32ArrayAttr to DenseI32ArrayAttr (getter will give you llvm::ArrayRef which is much nicer to use).

azecevicTT · 2025-01-14T15:09:59Z

include/ttmlir/Dialect/TTIR/IR/TTIROps.td

@@ -624,6 +624,26 @@ class TTIR_ReductionOp<string mnemonic, list<Trait> traits = []> :

      void buildGenericRegion(::mlir::OpBuilder &opBuilder, ::mlir::Block* block);

+      SmallVector<int64_t> getReduceDims() {
+        mlir::Attribute reduceDimsAttr = getDim().value_or(mlir::Attribute{});


I get what you wanted to achieve here, but using std::optional<mlir::Attribute> reduceDimsAttr and '*' for getting a value when it exists seems like a more straightforward way of dealing with this.

azecevicTT · 2025-01-14T15:13:13Z

include/ttmlir/Dialect/TTNN/IR/TTNNOps.td

@@ -578,10 +578,32 @@ class TTNN_ReductionOp<string mnemonic, list<Trait> traits = []> : TTNN_Op<mnemo

    let arguments = (ins AnyRankedTensor:$input,
                         BoolAttr:$keep_dim,
-                         OptionalAttr<I32ArrayAttr>:$dim_arg);
+                         OptionalAttr<AnyAttrOf<[SI32Attr, I32ArrayAttr]>>:$dim);


@svuckovicTT Should we prefer I32 when TTNN uses signed type? I'm okay with both, but for simplicity I agree with should settle for the one or the other. SI32 seems more convenient because getter returns int32_t, if I remember correctly I32 getter will by default return uint32_t.

Also here DenseI32ArrayAttr.

azecevicTT · 2025-01-14T15:20:25Z

include/ttmlir/Dialect/TTNN/IR/TTNNOps.td

@@ -578,10 +578,32 @@ class TTNN_ReductionOp<string mnemonic, list<Trait> traits = []> : TTNN_Op<mnemo

    let arguments = (ins AnyRankedTensor:$input,
                         BoolAttr:$keep_dim,
-                         OptionalAttr<I32ArrayAttr>:$dim_arg);
+                         OptionalAttr<AnyAttrOf<[SI32Attr, I32ArrayAttr]>>:$dim);


@svuckovicTT Should we prefer I32 when TTNN uses a signed type? I'm okay with both, but for simplicity I agree that we should settle for one or the other. SI32 seems more convenient because getter returns int32_t, if I remember correctly I32 getter will by default return uint32_t.

mtopalovicTT added 2 commits December 27, 2024 19:17

Fixing reduction ops

435c787

Removing test name

c8f0156

mtopalovicTT requested review from jnie-TT, kmabeeTT, AleksKnezevic, pilkicTT, sdjordjevicTT, svuckovicTT, nobradovictt, jserbedzijaTT, azecevicTT, nsmithtt, mrakitaTT and tapspatel as code owners December 27, 2024 19:43

Fixing all tests

b664a07

svuckovicTT approved these changes Jan 14, 2025

View reviewed changes

azecevicTT reviewed Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing reduction ops #1673

Fixing reduction ops #1673

mtopalovicTT commented Dec 27, 2024

mrakitaTT commented Dec 27, 2024

mtopalovicTT commented Dec 27, 2024 •

edited

Loading

mrakitaTT commented Dec 27, 2024 •

edited

Loading

mtopalovicTT commented Dec 29, 2024

svuckovicTT Jan 14, 2025

azecevicTT Jan 14, 2025

azecevicTT Jan 14, 2025

svuckovicTT Jan 15, 2025

svuckovicTT Jan 14, 2025

azecevicTT Jan 14, 2025

azecevicTT Jan 14, 2025

azecevicTT Jan 14, 2025

azecevicTT Jan 14, 2025

Fixing reduction ops #1673

Are you sure you want to change the base?

Fixing reduction ops #1673

Conversation

mtopalovicTT commented Dec 27, 2024

mrakitaTT commented Dec 27, 2024

mtopalovicTT commented Dec 27, 2024 • edited Loading

mrakitaTT commented Dec 27, 2024 • edited Loading

mtopalovicTT commented Dec 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mtopalovicTT commented Dec 27, 2024 •

edited

Loading

mrakitaTT commented Dec 27, 2024 •

edited

Loading