Bug fix: source node evaluation #801

courtneyholcomb · 2023-10-10T02:53:04Z

Description

Fixes a bug in how we determine which source node to use. Previously, we were choosing a plan based on source node cost. Source nodes can only be ReadSqlSourceNode or MetricTimeDimensionTransformNode, so they all have DefaultCost(num_joins=0, num_aggregations=0). That means we were just arbitrarily choosing a source node that could satisfy the query. Here, the logic changes to use the number of joins in each node's LinkableInstanceSatisfiabilityEvaluation, so we choose the dataflow plan with the lowest number of joins.
Note that this fixes a bug that came up in the process of enabling querying dimensions without metrics.

Also: we have a log that says Not evaluating other nodes since we found one that doesn't require joins, but we don't stop evaluating nodes at that point. I added a break to the loop to fix that, too.

github-actions · 2023-10-10T02:53:20Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the contributing guide.

tlento

Honestly, I'm not sure how you figured this out, most impressive. It makes sense to me, though! You might want to get @plypaul to take a quick look just to make sure I'm not missing something, this is an incredibly dense and unruly part of the codebase.

If this does go through I think it might be time to remove this whole DefaultCost visitor (in a separate PR, of course), as I believe it has outlived its usefulness.

tlento · 2023-10-10T04:20:57Z

metricflow/dataflow/builder/dataflow_plan_builder.py

@@ -553,6 +553,7 @@ def _find_measure_recipe(
            # this is going to be the lowest cost solution.
            if len(evaluation.join_recipes) == 0:
                logger.info("Not evaluating other nodes since we found one that doesn't require joins")
+                break


tlento · 2023-10-10T04:28:03Z

metricflow/dataflow/builder/dataflow_plan_builder.py

+            # All source nodes cost the same. Find evaluation with lowest number of joins.
+            node_with_lowest_cost_plan = min(
+                node_to_evaluation, key=lambda node: len(node_to_evaluation[node].join_recipes)
+            )
+            evaluation = node_to_evaluation[node_with_lowest_cost_plan]
            logger.info(
-                "Lowest cost node is:\n"
+                "Lowest cost plan is:\n"
                + pformat_big_objects(
-                    lowest_cost_node=dataflow_dag_as_text(node_with_lowest_cost),
+                    node=dataflow_dag_as_text(node_with_lowest_cost_plan),
                    evaluation=evaluation,
-                    cost=cost_function.calculate_cost(node_with_lowest_cost),
+                    joins=len(node_to_evaluation[node_with_lowest_cost_plan].join_recipes),


Nice find!

We have Yet Another Inscrutable Visitor doing this complex cost calculation, but it can only go from leaf node to root node, not the other way around. Since measure nodes are currently ONLY source nodes the cost comparison is pointless.

Using minimum join count is probably the right answer here. Later on it'll matter more what the user requests, because eventually the measure nodes could be on the right. In theory, this kind of graph walk cost computation will be useful then, but my worry about this block of logic was always that it the results might diverge as the join layout changes. However, since we have now committed to always picking the shortest join paths I think this gets us closer to where we need to be.

plypaul · 2023-10-10T22:05:52Z

@courtneyholcomb It would be preferable to encapsulate all costing using a centralized class, but that might be a more involved fix. If this is blocking you, we could merge this and do a follow up later.

tlento · 2023-10-11T00:03:28Z

@plypaul at the moment we don't do any costing at all. I say we just remove _sort_by_suitability and all of the DataflowPlanCost stuff and drop the pretense. If we find a use for a more involved centralized cost computation setup we can add one then.

courtneyholcomb · 2023-10-11T00:39:23Z

@plypaul at the moment we don't do any costing at all. I say we just remove _sort_by_suitability and all of the DataflowPlanCost stuff and drop the pretense. If we find a use for a more involved centralized cost computation setup we can add one then.

@tlento we do use _sort_by_suitability, but only to determine which node can satisfy the most linkable specs. All other costing appears to be unused.

courtneyholcomb · 2023-10-11T00:43:05Z

@courtneyholcomb It would be preferable to encapsulate all costing using a centralized class, but that might be a more involved fix. If this is blocking you, we could merge this and do a follow up later.

@plypaul _sort_by_suitability is the only costing function that's used currently, and it is actually on DataFlowPlanBuilder instead of a costing class.
With that in mind, maybe we merge this as-is and then put up a separate task to remove all other costing code?

tlento · 2023-10-11T01:07:38Z

@tlento we do use _sort_by_suitability, but only to determine which node can satisfy the most linkable specs. All other costing appears to be unused.

I should learn to read, missed the tuple return in there.....

plypaul · 2023-10-11T22:13:33Z

@plypaul at the moment we don't do any costing at all. I say we just remove _sort_by_suitability and all of the DataflowPlanCost stuff and drop the pretense. If we find a use for a more involved centralized cost computation setup we can add one then.

Yeah, fine to remove as well actually. The original use case was for the internal caching implementation, but that's no longer relevant.

cla-bot bot added the cla:yes label Oct 10, 2023

courtneyholcomb force-pushed the court/source-node-sort branch from 87f82b8 to ea5072c Compare October 10, 2023 03:08

Changelog

06d08ce

courtneyholcomb force-pushed the court/source-node-sort branch from cc64a47 to 06d08ce Compare October 10, 2023 03:20

courtneyholcomb added 2 commits October 9, 2023 20:21

Bug fix: prioritize nodes based on evaluation cost

c274045

Stop evaluating nodees if you find one with 0 joins

34f4d4f

courtneyholcomb requested review from tlento and plypaul October 10, 2023 03:36

courtneyholcomb marked this pull request as ready for review October 10, 2023 03:37

tlento approved these changes Oct 10, 2023

View reviewed changes

courtneyholcomb merged commit df0ae70 into main Oct 11, 2023
9 checks passed

courtneyholcomb deleted the court/source-node-sort branch October 11, 2023 00:44

This was referenced Oct 30, 2023

Delete unused costing code #826

Closed

Remove unused costing code #829

Merged

sarbmeetka pushed a commit to sarbmeetka/metricflow that referenced this pull request Nov 13, 2023

Bug fix: source node evaluation (dbt-labs#801)

f97e5b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix: source node evaluation #801

Bug fix: source node evaluation #801

courtneyholcomb commented Oct 10, 2023 •

edited

Loading

github-actions bot commented Oct 10, 2023

tlento left a comment •

edited

Loading

tlento Oct 10, 2023

tlento Oct 10, 2023

plypaul commented Oct 10, 2023

tlento commented Oct 11, 2023

courtneyholcomb commented Oct 11, 2023

courtneyholcomb commented Oct 11, 2023 •

edited

Loading

tlento commented Oct 11, 2023

plypaul commented Oct 11, 2023

Bug fix: source node evaluation #801

Bug fix: source node evaluation #801

Conversation

courtneyholcomb commented Oct 10, 2023 • edited Loading

Description

github-actions bot commented Oct 10, 2023

tlento left a comment • edited Loading

Choose a reason for hiding this comment

tlento Oct 10, 2023

Choose a reason for hiding this comment

tlento Oct 10, 2023

Choose a reason for hiding this comment

plypaul commented Oct 10, 2023

tlento commented Oct 11, 2023

courtneyholcomb commented Oct 11, 2023

courtneyholcomb commented Oct 11, 2023 • edited Loading

tlento commented Oct 11, 2023

plypaul commented Oct 11, 2023

courtneyholcomb commented Oct 10, 2023 •

edited

Loading

tlento left a comment •

edited

Loading

courtneyholcomb commented Oct 11, 2023 •

edited

Loading