Add embedding weight cast from float32 to bfloat16 #55

dgolubovicTT · 2024-12-25T15:29:12Z

Since ttnn only supports weight of embedding to be bfloat16, cast is introduced as a temporary solution until tt-mlir supports casting of embedding weight in workaround pass (tenstorrent/tt-mlir#1657). This enables llama embedding to work.

…ch.py to cast weight to bfloat16 if they are float32.

nvukobratTT

Let's just link MLIR PR to this PR for ref :))

dgolubovicTT self-assigned this Dec 25, 2024

dgolubovicTT requested a review from nvukobratTT December 25, 2024 15:29

Add embedding cast in third_party/tvm/python/tvm/relay/frontend/pytor…

0a429cf

…ch.py to cast weight to bfloat16 if they are float32.

dgolubovicTT force-pushed the dgolubovic/add-embedding-weight-cast branch from 63fcdb6 to 0a429cf Compare December 25, 2024 15:32

nvukobratTT approved these changes Dec 25, 2024

View reviewed changes

dgolubovicTT merged commit e405246 into main Dec 25, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add embedding weight cast from float32 to bfloat16 #55

Add embedding weight cast from float32 to bfloat16 #55

dgolubovicTT commented Dec 25, 2024 •

edited

Loading

nvukobratTT left a comment

Add embedding weight cast from float32 to bfloat16 #55

Add embedding weight cast from float32 to bfloat16 #55

Conversation

dgolubovicTT commented Dec 25, 2024 • edited Loading

nvukobratTT left a comment

Choose a reason for hiding this comment

dgolubovicTT commented Dec 25, 2024 •

edited

Loading