Exclude Padding from Shape Validation in Concat Operation #15308 #15329

shwetankTT · 2024-11-21T18:23:06Z

Ticket

Problem description

The concat operation evaluates the shapes of all input tensors to determine whether the operation can be performed. However, the LegacyShape includes padding as part of the shape computation. Padding should not be considered in shape validation for the concat operation..

What's changed

CI Link: https://github.com/tenstorrent/tt-metal/actions/runs/11953331942

Checklist

Post commit CI passes
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
New/Existing tests provide coverage for changes

shwetankTT · 2024-11-21T18:25:33Z

@ntarafdar @sjameelTT @jaebaek and @yugi957 I have not tested this extensively. Please let me know if this PR does not make sense. There was an issue i was facing during optimization of yolov4 model which was failing due to this validation error and i decided to fix it.

jaykru-tt · 2024-11-22T18:19:04Z

Can you run models perf CI to ensure that nothing broke?

This doesn't seem to me like it should work in general with arbitrary padding for on-device concat; this op isn't padding aware yet afaik. If the inputs to concat have the same logical shapes but different padding, we would very likely mess stuff up. On the other hand, we only have padding for tiled tensors now and the padding should be the same for tensors of the same logical shape, so this should actually be okay until that changes.

I think we will eventually have arbitrary padding including for RM tensors, so we would have to revisit this then. @sjameelTT any thoughts? I know you've looked at concat before.

jaykru-tt · 2024-11-22T18:30:32Z

@shwetankTT mentioned to me that he is obtaining tiled tensors with identical logical shape but distinct padding from sharded to interleaved (assuming a tilize is thrown in as well). This is even worse than what I was worrying about above.

input-tensor --> shape=Shape([1, 1, 400[448], 256]), dtype=DataType::BFLOAT16, layout=Layout::TILE)
MemoryConfig(memory_layout=TensorMemoryLayout::INTERLEAVED,buffer_type=BufferType::L1,shard_spec=std::nullopt)
input_tensor_2 --> shape=Shape([1, 1, 400[416], 256]), dtype=DataType::BFLOAT16, layout=Layout::TILE)
MemoryConfig(memory_layout=TensorMemoryLayout::INTERLEAVED,buffer_type=BufferType::DRAM,shard_spec=std::nullopt)

shwetankTT · 2024-11-25T06:46:35Z

I am not seeing this issue over the mainline. Seems like resolved. Closing this issue due same.

Exclude Padding from Shape Validation in Concat Operation #15308

87ed00b

shwetankTT requested review from ntarafdar, sjameelTT, jaykru-tt and yugi957 as code owners November 21, 2024 18:23

shwetankTT closed this Nov 25, 2024

shwetankTT deleted the shwetankTT/concat_shape_val branch December 4, 2024 06:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exclude Padding from Shape Validation in Concat Operation #15308 #15329

Exclude Padding from Shape Validation in Concat Operation #15308 #15329

shwetankTT commented Nov 21, 2024 •

edited

Loading

shwetankTT commented Nov 21, 2024 •

edited

Loading

jaykru-tt commented Nov 22, 2024

jaykru-tt commented Nov 22, 2024

shwetankTT commented Nov 25, 2024

Exclude Padding from Shape Validation in Concat Operation #15308 #15329

Exclude Padding from Shape Validation in Concat Operation #15308 #15329

Conversation

shwetankTT commented Nov 21, 2024 • edited Loading

Ticket

Problem description

What's changed

Checklist

shwetankTT commented Nov 21, 2024 • edited Loading

jaykru-tt commented Nov 22, 2024

jaykru-tt commented Nov 22, 2024

shwetankTT commented Nov 25, 2024

shwetankTT commented Nov 21, 2024 •

edited

Loading

shwetankTT commented Nov 21, 2024 •

edited

Loading