ttnn.hypot_bw low PCC #13776

amalbasaTT · 2024-10-14T13:48:18Z

Describe the bug
ttnn.hypot_bw fails with low PCC when input_tensor_a and/or input_tensor_b has bfloat8_b dtype

To Reproduce
Steps to reproduce the behavior:
Sweep test for hypot_bw is located in 'tests/sweep_framework/sweeps/eltwise/binary_backward/hypot_bw/hypot_bw.py'

Go to 'tests/sweep_framework/sweeps/eltwise/binary_backward/hypot_bw/hypot_bw.py'
Generate new parameter vectors and run the sweep test

python3 tests/sweep_framework/sweeps_parameter_generator.py --elastic cloud --module-name eltwise.binary_backward.hypot_bw.hypot_bw
python3 tests/sweep_framework/sweeps_runner.py --elastic cloud --module-name eltwise.binary_backward.hypot_bw.hypot_bw --suite-name xfail

See the error. Results can be found on elastic cloud as explained here: https://github.com/tenstorrent/tt-metal/tree/main/tests/sweep_framework

The text was updated successfully, but these errors were encountered:

KalaivaniMCW · 2024-11-05T12:49:24Z

ttnn.hypot_bw uses ttnn.reciprocal in its implementation which has this issue #14672
This is causing the PCC drop when using bfloat8_b

KalaivaniMCW · 2024-12-23T13:29:45Z

Hi @amalbasaTT ,
When using bfloat8_b dtype, could you try with non-zero inputs to see if these cases pass ?
You can find more information on this here #14672

When involving bfloat8_b and ttnn.reciprocal related ops we need separate implementations for zero- and non-zero inputs.
Handling these at kernel level will make the op slower, so we'll discuss what the best approach will be - whether to do it at composite level. For now could we test on non-zero inputs to identify any other issues other than this.

amalbasaTT · 2024-12-23T13:45:57Z

@KalaivaniMCW ,
Of course, no problem. Do you want a unit test or a sweep test for each of ttnn.reciprocal related ops? Also is there a list of ttnn.reciprocal related ops somehwere?

KalaivaniMCW · 2024-12-23T15:13:07Z

For now, the below issues seems to be related.
If these op sweeps pass for non-zero inputs, we can rule out these to be related to bfloat8_b behaviour.
#13776
#13937
#13975

amalbasaTT · 2024-12-23T16:50:34Z

ttnn.hypot passes in all cases when all inputs (input_tensor_a and input_tensor_b) are with non-zero elements and with bfloat8_b datatype
ttnn.rdiv_bw (issue) passes in 4.167% cases when all inputs (grad_tensor, input_tensor_a and factor) are with non-zero elements and when grad_tensor and input_tensor_a have bfloat8_b datatype

I will need some additional time to analyse div_bw since it has way more parameters.

checkout branch amalbasaTT/reciprocal-related-nonzero-sweeps to check the tests (hypot_nonzero.py and rdiv_bw_nonzero.py)

amalbasaTT · 2024-12-24T15:27:33Z

@KalaivaniMCW

I have tested rdiv_bw, hypot_bw and div_bw with inputs outside of range [-1, 1]. Before running the ops, i have converted inputs to ttnn tensor with dtype bfloat8_b and back to torch before running ops with them, so that i could ensure that all the inputs are non-zero after any eventual cut-off when converting them to ttnn tensor with bfloat8_b. Issue i found is that when input_tensor has bfloat8_b and "certain" shape, it produces an all-zero tensor (or tensors for binary ops). You can check details in issues i found bellow, i have provided a unit test and a sweep test for each op:

#16305
#16304
#16306

Besides the issue i reported, all three ops work fine

amalbasaTT added bug Something isn't working GS WH op_cat: eltwise backward labels Oct 14, 2024

eyonland mentioned this issue Oct 18, 2024

Eltwise Master Tracking #13795

Open

KalaivaniMCW self-assigned this Nov 5, 2024

eyonland added the MCW label Dec 20, 2024

amalbasaTT added a commit that referenced this issue Dec 24, 2024

#13776: Add hypot_bw_nonzero and div_bw_nonzero sweeps

4b9b061

amalbasaTT added a commit that referenced this issue Dec 24, 2024

#13776: Add bfloat16 too sweeps parameters

3027ec9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ttnn.hypot_bw low PCC #13776

ttnn.hypot_bw low PCC #13776

amalbasaTT commented Oct 14, 2024 •

edited

Loading

KalaivaniMCW commented Nov 5, 2024 •

edited

Loading

KalaivaniMCW commented Dec 23, 2024

amalbasaTT commented Dec 23, 2024

KalaivaniMCW commented Dec 23, 2024

amalbasaTT commented Dec 23, 2024 •

edited

Loading

amalbasaTT commented Dec 24, 2024 •

edited

Loading

ttnn.hypot_bw low PCC #13776

ttnn.hypot_bw low PCC #13776

Comments

amalbasaTT commented Oct 14, 2024 • edited Loading

KalaivaniMCW commented Nov 5, 2024 • edited Loading

KalaivaniMCW commented Dec 23, 2024

amalbasaTT commented Dec 23, 2024

KalaivaniMCW commented Dec 23, 2024

amalbasaTT commented Dec 23, 2024 • edited Loading

amalbasaTT commented Dec 24, 2024 • edited Loading

amalbasaTT commented Oct 14, 2024 •

edited

Loading

KalaivaniMCW commented Nov 5, 2024 •

edited

Loading

amalbasaTT commented Dec 23, 2024 •

edited

Loading

amalbasaTT commented Dec 24, 2024 •

edited

Loading