-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#6344: Update RoBERTa QA demo #8896
base: main
Are you sure you want to change the base?
Conversation
f712c92
to
519788f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking for tests with both these annotations...
@pytest.mark.models_device_performance_bare_metal
@pytest.mark.models_performance_bare_metal
Note that we document this here: https://tenstorrent.github.io/tt-metal/latest/ttnn/ttnn/demos.html
See for example:
tests/ttnn/integration_tests/resnet/test_performance.py
@eyonland, |
519788f
to
72b75e3
Compare
@esmalTT , Could you please review and give your approval for this PR? |
tt_output_start_logits = tt_output[..., :, 0] | ||
tt_output_end_logits = tt_output[..., :, 1] | ||
|
||
assert_with_pcc(torch_output_start_logits, tt_output_start_logits, 0.81 if is_grayskull else 0.89) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do these PCC bounds need to be updated?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The base model for RoBERTa is ttnn_optimized_bert, at models/demos/bert/tt/ttnn_optimized_bert.py. The PCC for roberta model is 0.81 on GS and 0.89 on WH using ttnn_optimized_bert.
cc: @boris-drazic
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How do we know this model is correct? This PCC value seems pretty low to me.
tt_output_start_logits = tt_output[..., :, 0] | ||
tt_output_end_logits = tt_output[..., :, 1] | ||
|
||
assert_with_pcc(torch_output_start_logits, tt_output_start_logits, 0.81 if is_grayskull else 0.89) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How do we know this model is correct? This PCC value seems pretty low to me.
8df21f2
to
60c9f3e
Compare
@kkeerthana0573 When you are ready, please update the description with the links to successful CI runs. |
@kkeerthana0573 , please add the details, why CIs are failing for RobertA. |
The model is working as expected on top of 019a5ccaed35c1864cc102895114657e6dff08b6, Here are the passing CIs links:
However, with changes introduced in commit 93fccc5e8880010e8b5739478b48c037e747cbb4, ttnn.linear op using core_grids here is throwing Here is the failure log in Device Perf CIs for reference : link |
cb3678c
to
7d6dbe0
Compare
7d6dbe0
to
ec34d65
Compare
This is resolved after Disabling packer L1 accum. and math_approx_mode as well as setting math_fidelity to HiFi2. |
@esmalTT, |
ec34d65
to
3b3b37e
Compare
3b3b37e
to
bd30318
Compare
Re-running demo pipeline here: https://github.com/tenstorrent/tt-metal/actions/runs/12431869447 |
bd30318
to
d4bf99c
Compare
Ticket
Problem description
What's changed
Checklist
Passing CI links - 09.12.2024: