-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] unsupportedoperators.csv shows stageID=-1 for certain unsupported operator #1156
Comments
After taking a look at the eventlog. There is no clear path to work around this. We can try adding some heuristics that link an exec to a stage based on the neighboring expressions, but then we need to come up with a well defined strategy for that. Otherwise, it will be come a big mess of heuristics that's hard to understand. |
We need to investigate further by checking the SHS code that parses the RDD information inside a stage. |
Bug identified in the tools code
The suggested plan to fix this bug is:
CC: @mattahrens |
Signed-off-by: Ahmed Hussein (amahussein) <[email protected]> Fixes NVIDIA#1156 This adds logic to walk the SparkGraph in order to assign execs to stages. For nodes that have no AccumIDs, the clusterization processes relies on adjacent nodes.
Describe the bug
unsupportedoperators.csv shows stageID=-1 for certain unsupported operator.
Does it mean Qual tool could not figure out which stage is associated with certain unsupported operators?
As a result, Qual tool thinks the % of unsupported duration is very low which could be wrong.
The text was updated successfully, but these errors were encountered: