We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cross-posting this issue from ipex, in case the torch-ccl team is not aware of it.
ipex
torch-ccl
Key issues:
The pytorch profiler traces highlight the issues (copied from the other thread):
Non-blocking kernel launch and comms/compute overlap.
Blocking kernel launch and no comms/compute overlap.
See the other thread for more details.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Cross-posting this issue from
ipex
, in case thetorch-ccl
team is not aware of it.Key issues:
The pytorch profiler traces highlight the issues (copied from the other thread):
A100 Trace
Non-blocking kernel launch and comms/compute overlap.
Intel Max 1550 Trace
Blocking kernel launch and no comms/compute overlap.
See the other thread for more details.
The text was updated successfully, but these errors were encountered: