#9709: Add optional transpose_a and transpose_b to ttnn matmul and linear #9836

TT-BrianLiu · 2024-06-28T21:55:34Z

Ticket

Problem description

In GGML, inputs a and b to matmul may be "pre-transposed", so to perform matmul properly we need a corresponding transpose a or b.

What's changed

Easiest solution is to add an optional transpose op before running matmul. I added transpose_a and transpose_b flags to specify whether to run the transpose on the inputs or not. This should work for most interleaved cases. As a side note, I also had to fix transpose to respect padding when swapping dims during compute_output_shapes.

There are two concerns that can be addressed down the line:

Perf: We can technically support the optional transpose directly in the matmuls, but this is a lot more work and requires more testing. But it will give us better perf.
Functionality: Using composite ops to achieve transpose will limit the input specs available to matmul to what's supported by our transpose op. We can either uplift transpose to support what matmul supports, or if we did the transposes internally in matmul this problem will automatically go away.

Checklist

Post commit CI passes
- All post-commit: https://github.com/tenstorrent/tt-metal/actions/runs/9764203272
- Nightly fast dispatch:https://github.com/tenstorrent/tt-metal/actions/runs/9764205780
- ttnn unit tests: https://github.com/tenstorrent/tt-metal/actions/runs/9764213369
Model regression CI testing passes (if applicable)
- Models post-commit: https://github.com/tenstorrent/tt-metal/actions/runs/9764208694
New/Existing tests provide coverage for changes

TT-BrianLiu requested review from eyonland, arakhmati, cfjchu, xanderchin and ayerofieiev-tt as code owners June 28, 2024 21:55

TT-BrianLiu mentioned this pull request Jun 28, 2024

[Feature Request] Support matmul with pre-transposed weights. #9709

Closed

ayerofieiev-tt approved these changes Jun 28, 2024

View reviewed changes

TT-BrianLiu force-pushed the jedi branch from bdc3d60 to 47738eb Compare July 2, 2024 15:35

TT-BrianLiu mentioned this pull request Jul 2, 2024

Correctly resolve both tensor dimension and padding when calculating transposed tensor sizes #9805

Closed

3 tasks

TT-BrianLiu force-pushed the jedi branch from 47738eb to ef7055f Compare July 2, 2024 15:46

TT-BrianLiu temporarily deployed to dev July 2, 2024 15:55 — with GitHub Actions Inactive

TT-BrianLiu temporarily deployed to dev July 2, 2024 15:56 — with GitHub Actions Inactive

TT-BrianLiu temporarily deployed to dev July 2, 2024 16:10 — with GitHub Actions Inactive

TT-BrianLiu temporarily deployed to dev July 2, 2024 16:17 — with GitHub Actions Inactive

TT-BrianLiu merged commit 2008386 into main Jul 2, 2024
5 checks passed

TT-BrianLiu deleted the jedi branch July 2, 2024 18:01

TT-BrianLiu temporarily deployed to dev July 2, 2024 18:01 — with GitHub Actions Inactive

tapspatel temporarily deployed to dev July 2, 2024 18:01 — with GitHub Actions Inactive

tapspatel temporarily deployed to dev July 2, 2024 18:03 — with GitHub Actions Inactive

TT-BrianLiu temporarily deployed to dev July 2, 2024 18:13 — with GitHub Actions Inactive

tapspatel temporarily deployed to dev July 2, 2024 18:15 — with GitHub Actions Inactive

TT-BrianLiu temporarily deployed to production July 2, 2024 18:30 — with GitHub Actions Inactive

tt-rkim temporarily deployed to dev July 2, 2024 19:05 — with GitHub Actions Inactive

tt-rkim had a problem deploying to dev July 2, 2024 19:11 — with GitHub Actions Failure

bbradelTT mentioned this pull request Jul 8, 2024

#9492: move matmul code to ttnn directory hierarchy #10015

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#9709: Add optional transpose_a and transpose_b to ttnn matmul and linear #9836

#9709: Add optional transpose_a and transpose_b to ttnn matmul and linear #9836

TT-BrianLiu commented Jun 28, 2024 •

edited

Loading

#9709: Add optional transpose_a and transpose_b to ttnn matmul and linear #9836

#9709: Add optional transpose_a and transpose_b to ttnn matmul and linear #9836

Conversation

TT-BrianLiu commented Jun 28, 2024 • edited Loading

Ticket

Problem description

What's changed

Checklist

TT-BrianLiu commented Jun 28, 2024 •

edited

Loading