Data Quality Daily Row Check Mart Model #416

kengodleskidot · 2024-10-04T20:21:40Z

The data model created in this PR can be used to create data quality visualizations/reports. The model aggregates the daily number of detectors for 4 data models that should all match. Below is an example of how two of the models are matching but subsequent models do not along with a screenshot in PeMS for comparison purposes:

I anticipate the Good and Bad counts to update as needed after the full data refresh. This should assist with #413, #397 and #398.

…ata models that should match. This data model can be used for a data quality visualization using kibana or another other tools.

…-pems into qc_mart_table

…oject.yml file

JamesSLogan · 2024-10-07T21:38:52Z

transform/models/marts/quality/quality__row_count_summary.sql

@@ -0,0 +1,67 @@
+{{ config(
+    materialized="table"


This is minor, but this config could be left out since models in the marts directory are already configured (in dbt_profile.yml) to already be materialized as tables.

…ons for the detector status model

kengodleskidot · 2024-10-07T23:01:22Z

The data model created in this PR can be used to create data quality visualizations/reports. The model aggregates the daily number of detectors for 4 data models that should all match. Below is an example of how two of the models are matching but subsequent models do not along with a screenshot in PeMS for comparison purposes:

I anticipate the Good and Bad counts to update as needed after the full data refresh. This should assist with #413, #397 and #398.

@mmmiah pointed out that we should only be comparing detector counts for station types of ML and HV only since the the imputed and performance data models are for those station types only. The mart data model has been updated to reflect these station types. The updated model reflects row counts that match, great catch @mmmiah!

We will still include the good/bad/total detector count from the detector_status model for QC usage.

mmmiah · 2024-10-08T16:43:34Z

transform/models/marts/quality/quality__row_count_summary.sql

@kengodleskidot , why the all GOOD_STATUS_COUNT is zero. Does that mean all detectors are down???

I am guessing that it will change once it will go through night job in production. Is that the case here?

mmmiah

Please merge! I will use this data for QC visualization from analytical_prd

kengodleskidot added 2 commits October 4, 2024 19:57

Created a data quality data model that returns daily row counts for d…

2d9ac84

…ata models that should match. This data model can be used for a data quality visualization using kibana or another other tools.

updated column name for clarity

b703f8a

kengodleskidot requested review from ian-r-rose, JamesSLogan, thehanggit and mmmiah October 4, 2024 20:21

kengodleskidot self-assigned this Oct 4, 2024

kengodleskidot added 3 commits October 4, 2024 20:26

updated comments

72e598a

Merge branch 'main' of https://github.com/cagov/caldata-mdsa-caltrans…

b003c16

…-pems into qc_mart_table

created a yml file for the mart quality folder and updated the dbt_pr…

693be6d

…oject.yml file

JamesSLogan reviewed Oct 7, 2024

View reviewed changes

updated data model to compare HV and ML stations along with all stati…

b628c05

…ons for the detector status model

kengodleskidot marked this pull request as ready for review October 7, 2024 23:32

mmmiah reviewed Oct 8, 2024

View reviewed changes

mmmiah approved these changes Oct 10, 2024

View reviewed changes

kengodleskidot merged commit 98fd5c1 into main Oct 10, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data Quality Daily Row Check Mart Model #416

Data Quality Daily Row Check Mart Model #416

kengodleskidot commented Oct 4, 2024

JamesSLogan Oct 7, 2024

kengodleskidot commented Oct 7, 2024

mmmiah Oct 8, 2024 •

edited

Loading

mmmiah Oct 8, 2024

mmmiah left a comment

Data Quality Daily Row Check Mart Model #416

Data Quality Daily Row Check Mart Model #416

Conversation

kengodleskidot commented Oct 4, 2024

JamesSLogan Oct 7, 2024

Choose a reason for hiding this comment

kengodleskidot commented Oct 7, 2024

mmmiah Oct 8, 2024 • edited Loading

Choose a reason for hiding this comment

mmmiah Oct 8, 2024

Choose a reason for hiding this comment

mmmiah left a comment

Choose a reason for hiding this comment

mmmiah Oct 8, 2024 •

edited

Loading