[ADAP-900] [Bug] Nested loop in query planner execution of relations.sql #609

slin30 · 2023-09-17T15:20:57Z

Is this a new bug in dbt-redshift?

I believe this is a new bug in dbt-redshift
I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

When running the statement in relations.sql, our Redshift cluster query planner will, seemingly at random, expect to use a nested loop join. This reflects in extended execution times, adding anywhere from 1-6 minutes to the start of a run/step.

Expected Behavior

Under normal circumstances, this query takes almost no time to run.

Steps To Reproduce

This appears to be a Redshift query planner decision, and I unfortuantely have no idea how to trigger it, nor how to resolve. I do have some thoughts on what might be causing it and a potential tweak to push the planner towards a more efficient path under more conditions (see additional context section)

Relevant log output

##### TYPICAL RUN #####
2023-09-17 01:02:56.585525 (MainThread): 01:02:56  Using redshift connection "master"
2023-09-17 01:02:56.586693 (MainThread): 01:02:56  On master: BEGIN
2023-09-17 01:02:56.587958 (MainThread): 01:02:56  Opening a new connection, currently in state init
2023-09-17 01:02:56.589196 (MainThread): 01:02:56  Redshift adapter: Establishing connection using ssl with `sslmode` set to 'prefer'.To connect without ssl, set `sslmode` to 'disable'.
2023-09-17 01:02:56.590227 (MainThread): 01:02:56  Redshift adapter: Connecting to redshift with username/password based auth...
2023-09-17 01:02:56.777927 (MainThread): 01:02:56  SQL status: SUCCESS in 0.0 seconds
2023-09-17 01:02:56.778793 (MainThread): 01:02:56  Using redshift connection "master"
2023-09-17 01:02:56.779460 (MainThread): 01:02:56  On master: /* {"app": "dbt", "dbt_version": "1.6.2", "profile_name": "user", "target_name": "prod", "connection_name": "master"} */
with
    relation as (
        select
            pg_class.oid as relation_id,
            pg_class.relname as relation_name,
            pg_class.relnamespace as schema_id,
            pg_namespace.nspname as schema_name,
            pg_class.relkind as relation_type
        from pg_class
        join pg_namespace
          on pg_class.relnamespace = pg_namespace.oid
        where pg_namespace.nspname != 'information_schema'
          and pg_namespace.nspname not like 'pg\_%'
    ),
    dependency as (
        select distinct
            coalesce(pg_rewrite.ev_class, pg_depend.objid) as dep_relation_id,
            pg_depend.refobjid as ref_relation_id,
            pg_depend.refclassid as ref_class_id
        from pg_depend
        left join pg_rewrite
          on pg_depend.objid = pg_rewrite.oid
    )

select distinct
    dep.schema_name as dependent_schema,
    dep.relation_name as dependent_name,
    ref.schema_name as referenced_schema,
    ref.relation_name as referenced_name
from dependency
join relation ref
    on dependency.ref_relation_id = ref.relation_id
join relation dep
    on dependency.dep_relation_id = dep.relation_id
where ref.relation_name != dep.relation_name
2023-09-17 01:02:57.249035 (MainThread): 01:02:57  SQL status: SUCCESS in 0.0 seconds


##### PROBLEMATIC RUN #####
2023-09-17 14:01:22.689569 (MainThread): 14:01:22  Using redshift connection "master"
2023-09-17 14:01:22.690397 (MainThread): 14:01:22  On master: BEGIN
2023-09-17 14:01:22.690997 (MainThread): 14:01:22  Opening a new connection, currently in state init
2023-09-17 14:01:22.691895 (MainThread): 14:01:22  Redshift adapter: Establishing connection using ssl with `sslmode` set to 'prefer'.To connect without ssl, set `sslmode` to 'disable'.
2023-09-17 14:01:22.692645 (MainThread): 14:01:22  Redshift adapter: Connecting to redshift with username/password based auth...
2023-09-17 14:01:22.927402 (MainThread): 14:01:22  SQL status: SUCCESS in 0.0 seconds
2023-09-17 14:01:22.928290 (MainThread): 14:01:22  Using redshift connection "master"
2023-09-17 14:01:22.929056 (MainThread): 14:01:22  On master: /* {"app": "dbt", "dbt_version": "1.6.2", "profile_name": "user", "target_name": "prod", "connection_name": "master"} */
with
    relation as (
        select
            pg_class.oid as relation_id,
            pg_class.relname as relation_name,
            pg_class.relnamespace as schema_id,
            pg_namespace.nspname as schema_name,
            pg_class.relkind as relation_type
        from pg_class
        join pg_namespace
          on pg_class.relnamespace = pg_namespace.oid
        where pg_namespace.nspname != 'information_schema'
          and pg_namespace.nspname not like 'pg\_%'
    ),
    dependency as (
        select distinct
            coalesce(pg_rewrite.ev_class, pg_depend.objid) as dep_relation_id,
            pg_depend.refobjid as ref_relation_id,
            pg_depend.refclassid as ref_class_id
        from pg_depend
        left join pg_rewrite
          on pg_depend.objid = pg_rewrite.oid
    )

select distinct
    dep.schema_name as dependent_schema,
    dep.relation_name as dependent_name,
    ref.schema_name as referenced_schema,
    ref.relation_name as referenced_name
from dependency
join relation ref
    on dependency.ref_relation_id = ref.relation_id
join relation dep
    on dependency.dep_relation_id = dep.relation_id
where ref.relation_name != dep.relation_name
2023-09-17 14:04:55.615962 (MainThread): 14:04:55  SQL status: SUCCESS in 213.0 seconds

Environment

- OS: Not sure if applicable here -- we see this in dbt cloud for runs. Also see this locally (I am on Windows 11 x64 if that matters)
- Python: Again not sure if applicable -- locally am on Python `3.11.1`
- dbt-core: `1.6.2`
- dbt-redshift: `1.6.1`

Additional Context

This is a fairly difficult anomaly to document because it seems to be related to the Redshift query planner. Our first encounter was about a week ago. The issue lasted for about 14 hours, and then went away. It just manifested again, and this time I was ready to log some of the results as well as test an alternative query I'd prepped in case this recurred.

My best assessment is that the original query is almost always optimized by the query planner to avoid a nested loop, but that edge cases remain where the planner chooses a less optimal path. My approach was to understand the core intent, and then see if I could tweak the statement a bit to hopefully push the query planner towards a more efficient path under a wider range of condtions.

The original logic that threw the planner for a loop (no pun intended) was the WHERE != in the outer statement. Removing this or setting it to =, or pushing it into the select as an = flag, works fine. For the latter, if I subsequently attempt to filter on the flag (with NOT), this predictably ends up with the same nested loop execution plan -- it's the attempt to filter on not equals in the outer select that triggers the nested loop (when the anomaly manifests).

My adjustment simply pushes the evaluation up to the dependency CTE. My thinking was that in doing so, the planner should choose a more efficient path under more conditions.

Because the conditions to test the difference between these two approaches are entirely unclear to me/a black box (i.e. I have no idea how to make the query planner decide to make an inefficient choice on demand), I just made a note to wait until the next time this happened, and if so, to capture the explain as well as the actual results, comparing runtimes and results.

The alternative CTE is simply:

dependency_alt as (
    select distinct
        coalesce(pg_rewrite.ev_class, pg_depend.objid) as dep_relation_id,
        pg_depend.refobjid as ref_relation_id,
        pg_depend.refclassid as ref_class_id
    from pg_depend
    left join pg_rewrite
      on pg_depend.objid = pg_rewrite.oid
    WHERE coalesce(pg_rewrite.ev_class, pg_depend.objid) != pg_depend.refobjid -- why not constrain here?
)

...with the only additional change being removal of the outer statement WHERE (where ref.relation_name != dep.relation_name).

Thus, the restated full statement would be:

with
    relation as (
        select
            pg_class.oid as relation_id,
            pg_class.relname as relation_name,
            pg_class.relnamespace as schema_id,
            pg_namespace.nspname as schema_name,
            pg_class.relkind as relation_type
        from pg_class
        join pg_namespace
          on pg_class.relnamespace = pg_namespace.oid
        where pg_namespace.nspname != 'information_schema'
          and pg_namespace.nspname not like 'pg\_%'
    ),
    dependency as (
        select distinct
            coalesce(pg_rewrite.ev_class, pg_depend.objid) as dep_relation_id,
            pg_depend.refobjid as ref_relation_id,
            pg_depend.refclassid as ref_class_id
        from pg_depend
        left join pg_rewrite
          on pg_depend.objid = pg_rewrite.oid
        WHERE coalesce(pg_rewrite.ev_class, pg_depend.objid) != pg_depend.refobjid -- why not constrain here?
    )    
select DISTINCT
    dep.schema_name as dependent_schema,
    dep.relation_name as dependent_name,
    ref.schema_name as referenced_schema,
    ref.relation_name as referenced_name
from dependency
join relation ref
    on dependency.ref_relation_id = ref.relation_id
join relation dep
    on dependency.dep_relation_id = dep.relation_id;

I've included the orig_query_with_explain.txt and alt_query_with_explain.txt. Note: the FALSE AS is_eq in the outer select of the alt version text file is a testing artifact I forgot to delete -- please ignore.

Finally, I've tested the outputs on our end and can confirm:

The two (original and alt) produce identical results.
The two are effectively identical performance-wise under normal conditions. But when the anomaly arises, the alternative version continues to be effectively instant, while the orignal version takes several minutes.
At the moment, the full scope of results is only a bit over 1000 records -- and roughly 400 of those are not self-references.
I've also documented some informal findings/musings in dbt slack

The text was updated successfully, but these errors were encountered:

dbeatty10 · 2023-09-18T18:58:43Z

Wow, incredible write-up @slin30 🏆

We'd be open to a potential tweak to push the planner towards a more efficient path. As long as it yields the same results either way and the only side effect is more consistent performance, we should be good to go 👍

Would you be interested in raising a PR with your modifications?

The diff with redshift__get_relations() looks simple (after removing comments, lowercasing keywords, and removing the semi-colon):

22a23
>         where coalesce(pg_rewrite.ev_class, pg_depend.objid) != pg_depend.refobjid
35d35
< where ref.relation_name != dep.relation_name

slin30 · 2023-09-18T19:27:59Z

Thanks, @dbeatty10 -- happy to raise a PR. Should be able to do so within 24 hours.

While I don't see any difference in the results, nor do I see any reason (looking at the logic) for there to be a difference, I also can't directly speak to the much wider range of scenarios in the wild. I'm keen to see how this does during testing/QA on your end.

slin30 · 2023-09-22T14:07:26Z

@dbeatty10 While my PR is in the queue, is there a mechanism for us to temporarily override the adapter macro for our projects with the adjusted one I am proposing? Would it be as simple as e.g. creating a macro named relations.sql in /macros and pasting in my suggested logic, preserving the macro name itself as well?

Asking because this has again shown up on our end.

dbeatty10 · 2023-09-22T14:59:29Z

@slin30 Thanks for opening that PR! 🤩

Yes, it would be exactly as simple as you described 👍

i.e., adding this file to your project will make it take priority over the definition in dbt-redshift:

macros/any_folder_structure_you_want/any_file_name_you_want.sql

{% macro redshift__get_relations() -%}

...

{%- endcall -%}

You can see it take effect for yourself by adding a logging statement to both the log file and stdout like this:

{{ log("Using my version of redshift__get_relations", True) }}

slin30 · 2023-09-22T15:21:18Z

Thank you @dbeatty10! That worked perfectly and I learned something new. Will deploy this for our environments as this should give us some real-world evidence to see how this works, and whether we see the anomaly again.

[0m11:13:23.599816 [debug] [MainThread]: On master: /* {"app": "dbt", "dbt_version": "1.6.1", "profile_name": "convertkit", "target_name": "dev", "connection_name": "master"} */
with
    relation as (
        select
            pg_class.oid as relation_id,
            pg_class.relname as relation_name,
            pg_class.relnamespace as schema_id,
            pg_namespace.nspname as schema_name,
            pg_class.relkind as relation_type
        from pg_class
        join pg_namespace
          on pg_class.relnamespace = pg_namespace.oid
        where pg_namespace.nspname != 'information_schema'
          and pg_namespace.nspname not like 'pg\_%'
    ),
    dependency as (
        select distinct
            coalesce(pg_rewrite.ev_class, pg_depend.objid) as dep_relation_id,
            pg_depend.refobjid as ref_relation_id,
            pg_depend.refclassid as ref_class_id
        from pg_depend
        left join pg_rewrite
          on pg_depend.objid = pg_rewrite.oid
        where coalesce(pg_rewrite.ev_class, pg_depend.objid) != pg_depend.refobjid
    )

select distinct
    dep.schema_name as dependent_schema,
    dep.relation_name as dependent_name,
    ref.schema_name as referenced_schema,
    ref.relation_name as referenced_name
from dependency
join relation ref
    on dependency.ref_relation_id = ref.relation_id
join relation dep
    on dependency.dep_relation_id = dep.relation_id
[0m11:13:23.834033 [debug] [MainThread]: SQL status: SUCCESS in 0.0 seconds
[0m11:13:23.837065 [info ] [MainThread]: Using local experimental version of redshift__get_relations

slin30 · 2023-09-22T16:27:25Z

Confirming this works. While I don't have an A/B test, I can at least confirm that the explain for the original version still returns a nested loop warning, so I expect that we'd still have issues with the runtimes if I hadn't pushed the local override. All tests, docs, etc. look as expected.

First time we see the anomaly today:

2023-09-22 08:15:52.849364 (MainThread): 08:15:52  SQL status: SUCCESS in 85.0 seconds

Last run before pushing the local override version:

2023-09-22 15:05:18.065994 (MainThread): 15:05:18  SQL status: SUCCESS in 245.0 seconds

First run after pushing the local override version:

2023-09-22 16:01:25.554237 (MainThread): 16:01:25  SQL status: SUCCESS in 0.0 seconds
2023-09-22 16:01:25.563034 (MainThread): 16:01:25  Using local experimental version of redshift__get_relations

slin30 added bug Something isn't working triage labels Sep 17, 2023

github-actions bot changed the title ~~[Bug] Nested loop in query planner execution of relations.sql~~ [ADAP-900] [Bug] Nested loop in query planner execution of relations.sql Sep 17, 2023

dbeatty10 removed the triage label Sep 18, 2023

slin30 mentioned this issue Sep 18, 2023

Avoid nested loop in query planner execution #612

Merged

4 tasks

colin-rogers-dbt closed this as completed in #612 Sep 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ADAP-900] [Bug] Nested loop in query planner execution of relations.sql #609

[ADAP-900] [Bug] Nested loop in query planner execution of relations.sql #609

slin30 commented Sep 17, 2023 •

edited

Loading

dbeatty10 commented Sep 18, 2023

slin30 commented Sep 18, 2023

slin30 commented Sep 22, 2023

dbeatty10 commented Sep 22, 2023

slin30 commented Sep 22, 2023

slin30 commented Sep 22, 2023 •

edited

Loading

[ADAP-900] [Bug] Nested loop in query planner execution of relations.sql #609

[ADAP-900] [Bug] Nested loop in query planner execution of relations.sql #609

Comments

slin30 commented Sep 17, 2023 • edited Loading

Is this a new bug in dbt-redshift?

Current Behavior

Expected Behavior

Steps To Reproduce

Relevant log output

Environment

Additional Context

dbeatty10 commented Sep 18, 2023

slin30 commented Sep 18, 2023

slin30 commented Sep 22, 2023

dbeatty10 commented Sep 22, 2023

slin30 commented Sep 22, 2023

slin30 commented Sep 22, 2023 • edited Loading

slin30 commented Sep 17, 2023 •

edited

Loading

slin30 commented Sep 22, 2023 •

edited

Loading