Adding bf16 data type workaround for max_pool2d op and fixing embedding workaround tests #1657

sdjordjevicTT · 2024-12-23T13:52:40Z

Applying bf16 data type workaround on max_pool2d op.

With this change, I rewrote the workaround test, silicon, and compiler test.

Closes #1389

runtime/lib/ttnn/operations/layout/typecast.cpp

sdjordjevicTT · 2024-12-23T16:59:30Z

Created the following issue to track the runtime workaround:
#1658

jnie-TT · 2024-12-23T17:50:17Z

runtime/include/tt/runtime/detail/workarounds.h

 private:
  constexpr Env(bool maxpool2dPreshard, bool swapBinaryOperands,
-                bool readUpdateIndexFromDeviceForKVCache)
+                bool readUpdateIndexFromDeviceForKVCache, bool typecastOnHost)


Maybe name this as toDtypeOnHost or something so that it's explicit that we're using to_dtype when typcasting on host.

Also please add a ttrt command line option to toggle this in runtime/tools/python/ttrt/common/run.py, the procedure will be the same as the other workaround flags.

Renamed typecastOnHost to toDtypeOnHost. I added an option in run.py; please double-check if I implemented it correctly.

jnie-TT

Runtime changes look good, thanks Stefan!

jserbedzijaTT

Looks good, thanks stefi!

lib/Dialect/TTNN/IR/TTNNWorkarounds.cpp

sdjordjevicTT requested review from svuckovicTT, mtopalovicTT, nobradovictt, jserbedzijaTT, jnie-TT and azecevicTT as code owners December 23, 2024 13:52

sdjordjevicTT force-pushed the sdjordjevic/max_pool_2d_bf16_workaround branch from 599bfb9 to ce775d3 Compare December 23, 2024 14:54

sdjordjevicTT requested review from kmabeeTT, AleksKnezevic and pilkicTT as code owners December 23, 2024 14:54

sdjordjevicTT force-pushed the sdjordjevic/max_pool_2d_bf16_workaround branch from ce775d3 to ee54049 Compare December 23, 2024 15:36

jnie-TT reviewed Dec 23, 2024

View reviewed changes

runtime/lib/ttnn/operations/layout/typecast.cpp Outdated Show resolved Hide resolved

sdjordjevicTT force-pushed the sdjordjevic/max_pool_2d_bf16_workaround branch from ee54049 to f3ec40c Compare December 23, 2024 16:58

sdjordjevicTT force-pushed the sdjordjevic/max_pool_2d_bf16_workaround branch from f3ec40c to f88de78 Compare December 23, 2024 17:28

sdjordjevicTT requested review from tapspatel and nsmithtt as code owners December 23, 2024 17:28

jnie-TT reviewed Dec 23, 2024

View reviewed changes

sdjordjevicTT force-pushed the sdjordjevic/max_pool_2d_bf16_workaround branch from f88de78 to e320d04 Compare December 23, 2024 18:26

jnie-TT approved these changes Dec 23, 2024

View reviewed changes

mtopalovicTT approved these changes Dec 25, 2024

View reviewed changes

jserbedzijaTT approved these changes Dec 25, 2024

View reviewed changes

lib/Dialect/TTNN/IR/TTNNWorkarounds.cpp Outdated Show resolved Hide resolved

sdjordjevicTT changed the title ~~Adding bf16 data type workaround for max_pool2d op~~ Adding bf16 data type workaround for max_pool2d op and fixing embedding workaround tests Dec 25, 2024

dgolubovicTT mentioned this pull request Dec 25, 2024

Add embedding weight cast from float32 to bfloat16 tenstorrent/tt-tvm#55

Merged

tapspatel approved these changes Dec 26, 2024

View reviewed changes

Adding bf16 data type workaround for max_pool2d op.

5ba32ac

sdjordjevicTT force-pushed the sdjordjevic/max_pool_2d_bf16_workaround branch from e320d04 to 5ba32ac Compare December 26, 2024 09:57

sdjordjevicTT merged commit cfc6f53 into main Dec 26, 2024
21 checks passed

sdjordjevicTT deleted the sdjordjevic/max_pool_2d_bf16_workaround branch December 26, 2024 10:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding bf16 data type workaround for max_pool2d op and fixing embedding workaround tests #1657

Adding bf16 data type workaround for max_pool2d op and fixing embedding workaround tests #1657

sdjordjevicTT commented Dec 23, 2024

sdjordjevicTT commented Dec 23, 2024

jnie-TT Dec 23, 2024

sdjordjevicTT Dec 23, 2024

jnie-TT left a comment

jserbedzijaTT left a comment •

edited

Loading

Adding bf16 data type workaround for max_pool2d op and fixing embedding workaround tests #1657

Adding bf16 data type workaround for max_pool2d op and fixing embedding workaround tests #1657

Conversation

sdjordjevicTT commented Dec 23, 2024

sdjordjevicTT commented Dec 23, 2024

jnie-TT Dec 23, 2024

Choose a reason for hiding this comment

sdjordjevicTT Dec 23, 2024

Choose a reason for hiding this comment

jnie-TT left a comment

Choose a reason for hiding this comment

jserbedzijaTT left a comment • edited Loading

Choose a reason for hiding this comment

jserbedzijaTT left a comment •

edited

Loading