-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#14732: add bert-tiny test_performance using trace and 2cq-WIP #14799
base: main
Are you sure you want to change the base?
Conversation
2ea75aa
to
5b8d6d6
Compare
@mbahnasTT , |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
approving for codeowners only, not reviewing this PR
Is this a superset of #13471? If so, should we just combine the PRs into just one? |
@esmalTT can you review the 2CQ implementation? |
@tt-rkim, We didn't want to trigger without any approval on this PR, specifically. Is that fine? |
You won't get my approval without passing CI. This is fine, but then I will report back that CI is not the blocker. |
ttnn.record_event(1, write_event) | ||
ttnn.wait_for_event(0, write_event) | ||
# TODO: Add in place support to ttnn to_memory_config | ||
# input_tensor = ttnn.reshard(tt_input_ids, input_mem_config, input_tensor) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs to be uncommented to work properly
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In our first attempt on trace+2cq implementation, we have refered to existing work. Bert-Tiny is not sharded, whereas all the reference models are, and they are image models that process only a single input, whereas Bert-Tiny process multiple inputs.
We have modified the code to make it compatible with Bert-Tiny as an unsharded model.
Do we require the above lines as the inputs are not sharded?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In our first attempt on trace+2cq implementation, we have refered to existing work. Bert-Tiny is not sharded, whereas all the reference models are, and they are image models that process only a single input, whereas Bert-Tiny process multiple inputs.
We have modified the code to make it compatible with Bert-Tiny as an unsharded model. Do we require the above lines as the inputs are not sharded?
@esmalTT awaiting your approval on this PR and could you add your suggestion on this as well?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
From what I can tell, my comments haven't been addressed. Please address the comments, and re-run pipelines before requesting re-review.
signpost(header="stop") | ||
ttnn.dump_device_profiler(device) | ||
|
||
ttnn.release_trace(device, tid) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You should probably have some asserts (like checking PCC) in here to make sure things are working as expected.
@vigneshkeerthivasanx Please fill out the checklist section with links to CI runs before requesting reviews. There is no point in reviewing broken code. |
Did you run the posted code at least locally and confirmed that it works? If so, it is safe to launch the CI, see that your tests pass, then post the passing CI links. Then, we review the code. As Evan pointed out, it will waste of time to review untested broken code. |
Will update the code and trigger CIs. |
5b8d6d6
to
f2e62d6
Compare
b6d3abb
to
58785ba
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please address Evan's requests
ea5bc80
to
d2ac87e
Compare
d2ac87e
to
75d639b
Compare
Ticket
Link to Github Issue
Problem description
Add trace+2cq test perf file for BERT-Tiny model.
What's changed
Added test perf using trace+2cq file for BERT-Tiny model.
Checklist
Passing CI Links: