[CT-1197] [Bug] Schema Files not parsed for models at the right time #5869

emmyoop · 2022-09-16T15:59:14Z

Is this a new bug in dbt-core?

I believe this is a new bug in dbt-core
I have searched the existing issues, and I could not find an existing issue for this bug

Current Behavior

When we parse a model node, any configs set in the schema yaml file do not get incorporated until after the fact.

Specifically all nodes are considered enabled (if only disabled in the schema yaml file) while we parse through the sql files until we parse the schema file. So they end up as nodes in the manifest to be processed and are not in the disabled dict.

Specifically the value of node.config.enabled will always be True when the node was set to disabled only in the yaml file. It will probably be incorrectly set to False if it is set to False at the project level and overridden at the yaml level to be True.

dbt-core/core/dbt/parser/base.py

Lines 370 to 374 in f5a94fc

    
           def add_result_node(self, block: FileBlock, node: ManifestNodes): 
        
               if node.config.enabled: 
        
                   self.manifest.add_node(block.file, node) 
        
               else: 
        
                   self.manifest.add_disabled(block.file, node)

There is also a bit of hacky logic as follows to ensure we don't resolve refs on disabled nodes even though the node is in the manifest.

TODO: add link to process_refs in parser/manifest.py after #5868 gets merged

            # if the node is disabled, no need to resolve the refs
            if node.config.enabled:
                _process_refs_for_node(self.manifest, current_project, node)

The above conditional should be removed when this gets resolved since we would no longer be resolving refs for disabled nodes.

Expected Behavior

Account for configs set in schema files.

Steps To Reproduce

A barebones project with 3 models:

-- my_model.sql
select 1 as user

-- my_model_2.sql
select * from {{ ref('my_model') }}

-- my_model_3.sql
select * from {{ ref('my_model_2') }}

A schema.yml like so:

version: 2
models:
  - name: my_model
  - name: my_model_2
    config:
      enabled: false
  - name: my_model_3
    config:
      enabled: false

run dbt run and examine manifest.nodes and manifest.disabled

Additional Context

#5868 fixed the bugs users are experiencing, this is to fix the underlying issue

The text was updated successfully, but these errors were encountered:

jtcohen6 · 2022-09-16T17:42:35Z

@emmyoop Thanks for the thorough writeup!

The fundamental issue in the parsing order that you've identified here feels related to #4000 as well. We instantiate snapshot nodes after parsing the .sql file, perform dataclass validation, check for specific configs, if they're not supplied we raise an error, before actually parsing the .yml file that might contain those configs.

emmyoop · 2022-10-07T14:34:04Z

This really comes down to the order we have to process the files. "Fixing" this really means reworking parsing order, possibly saving files to parse over again? Not sure if it would really decrease complexity because we re-process the files.

emmyoop added bug Something isn't working triage labels Sep 16, 2022

github-actions bot changed the title ~~[Bug] Schema Files not parsed for models~~ [CT-1197] [Bug] Schema Files not parsed for models Sep 16, 2022

emmyoop changed the title ~~[CT-1197] [Bug] Schema Files not parsed for models~~ [CT-1197] [Bug] Schema Files not parsed for models in time Sep 16, 2022

emmyoop changed the title ~~[CT-1197] [Bug] Schema Files not parsed for models in time~~ [CT-1197] [Bug] Schema Files not parsed for models at the right time Sep 16, 2022

emmyoop mentioned this issue Sep 16, 2022

Disabled Models in schema files #5868

Merged

6 tasks

jtcohen6 mentioned this issue Sep 16, 2022

Snapshot config doesn't work in schema.yml #4000

Closed

jtcohen6 added Team:Language and removed triage labels Sep 16, 2022

leahwicz added the tech_debt Behind-the-scenes changes, with little direct impact on end-user functionality label Sep 22, 2022

jtcohen6 removed the Team:Language label Jul 19, 2023

gshank closed this as completed Aug 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-1197] [Bug] Schema Files not parsed for models at the right time #5869

[CT-1197] [Bug] Schema Files not parsed for models at the right time #5869

emmyoop commented Sep 16, 2022 •

edited

Loading

jtcohen6 commented Sep 16, 2022

emmyoop commented Oct 7, 2022

[CT-1197] [Bug] Schema Files not parsed for models at the right time #5869

[CT-1197] [Bug] Schema Files not parsed for models at the right time #5869

Comments

emmyoop commented Sep 16, 2022 • edited Loading

Is this a new bug in dbt-core?

Current Behavior

Expected Behavior

Steps To Reproduce

Additional Context

jtcohen6 commented Sep 16, 2022

emmyoop commented Oct 7, 2022

emmyoop commented Sep 16, 2022 •

edited

Loading