-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Precompute / Cache Outputs for Nodes in SourceNodeSet
#1030
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
@@ -294,8 +295,10 @@ def explain_get_dimension_values( # noqa: D | |||
"""Returns the SQL query for get_dimension_values. | |||
|
|||
Args: | |||
metric_name: Names of metrics that contain the group_by. | |||
metric_names: Names of metrics that contain the group_by. | |||
metrics: Similar to `metric_names`, but specified via parameter objects., |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: remove comma at the end of this line
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed.
b4899bd
to
a322a2d
Compare
5c652ff
to
4e060a7
Compare
a322a2d
to
89a9e39
Compare
4e060a7
to
afe1d19
Compare
afe1d19
to
7840539
Compare
Description
When building a
DataflowPlan
, nodes fromSourceNodeSet
are used as common building blocks for different queries using the same semantic manifest. When the output for aDataflowPlanNode
is computed, the output is lazily computed and then cached since the output will be required many times for the same nodes (e.g. between queries, and between common join candidates). The output contains generated IDs, and so precomputing the output nodes for nodes fromSourceNodeSet
will have the following effects:DataflowPlan
should be a little more consistent between queries at the expense of initialization time.DataflowPlanBuilder
.