Ele 1453 implement metrics collection #447

elongl · 2023-08-01T12:51:51Z

No description provided.

linear · 2023-08-01T12:51:53Z

Definition of done:

github-actions · 2023-08-01T12:52:03Z

👋 @elongl
Thank you for raising your pull request.
Please make sure to add tests and document all user-facing changes.
You can do this by editing the docs files in the elementary repository.

macros/edr/tests/on_run_end/insert_metrics.sql

elongl · 2023-08-02T12:10:19Z

macros/edr/materializations/model/metrics.sql

@@ -0,0 +1,29 @@
+{% macro query_table_metrics() %}
+  {% set query %}
+    select count(*) as row_count


The freshness metrics (model run timestamp) can be calculated from this metric.
Should we add an empty freshness or build metric for clarity?

A couple of thoughts about this:

Yes, I definitely think for clarity it's good to have an explicit row :)

When we move the test to rely on the metrics we'll need to actually compute the freshness metric (as diff between timestamps) - but maybe that's OK. For recommendation anyway that's enough.

haritamar · 2023-08-02T14:21:48Z

macros/edr/materializations/model/metrics.sql

@@ -0,0 +1,29 @@
+{% macro query_table_metrics() %}
+  {% set query %}
+    select count(*) as row_count


A couple of thoughts about this:

Yes, I definitely think for clarity it's good to have an explicit row :)

When we move the test to rely on the metrics we'll need to actually compute the freshness metric (as diff between timestamps) - but maybe that's OK. For recommendation anyway that's enough.

macros/edr/tests/on_run_end/insert_metrics.sql

macros/edr/materializations/model/metrics.sql

haritamar · 2023-08-02T14:26:32Z

macros/edr/materializations/model/metrics.sql

@@ -0,0 +1,29 @@
+{% macro query_table_metrics() %}


May be nice if there will be a mapping of metric -> SQL so this macro won't become bloated (I think this is a part of what makes the existing macros for collecting metrics hard to read)
But I'm fine with deferring until there are more metrics here

Also:

Should this file perhaps be somewhere else rather than under "materializations"?

Related to the previous comment, but do we actually want to reuse table_monitoring_query? Not sure if we do, but it is going to create some duplication otherwise.

I prefer the duplication for now, table_monitoring_query is extremely complex in my opinion.
This is very simple and straight-forward, I'd rather keep it that way in spite of duplication.

Regarding the first point, maybe I don't fully understand what you meant, but I don't find:

{% macro get_row_count_metric_expr() %} count(*) as row_count {% endmacro %} {% macro query_table_metrics() %} {% set query %} select {{ elementary.get_row_count_metric_expr() }} from {{ this }} {% endset %} ...

More readable than:

{% macro query_table_metrics() %} {% set query %} select count(*) as row_count from {{ this }} {% endset %} ...

I like how all the metrics are centralized in one clear query.
Though, if the expressions would be more complicated than count(*) than I can understand.

haritamar · 2023-08-02T14:34:16Z

macros/edr/tests/on_run_end/insert_metrics.sql

+{% macro insert_metrics() %}
+  {% set metrics = elementary.get_cache("tables").get("metrics") %}
+  {% set database_name, schema_name = elementary.get_package_database_and_schema() %}
+  {%- set target_relation = adapter.get_relation(database=database_name, schema=schema_name, identifier='data_monitoring_metrics') -%}


Can it somehow affect existing tests that we're writing to data_monitoring_metrics?
I think it should be fine but just verifying

Those metrics aren't being used in the tests for now because they don't have metric properties.

macros/edr/materializations/model/metrics.sql

This reverts commit 22067b2.

macros/edr/materializations/model/incremental.sql

macros/edr/materializations/model/metrics.sql

elongl added 10 commits July 20, 2023 14:49

POC Metrics collection.

c69e0aa

DbtProject.seed is not a context manager.

9f1702c

Added BigQuery materializations.

3b89bc7

Created a high-level function.

ef355b2

Changed a macro.

088c075

Added view materialization.

f819420

Simplified seed names.

9e9ce64

Merge branch 'master' into ele-1356-poc-metrics-collection

6e4d3cf

Added view materialization.

3542290

Merged master.

bdf3f13

elongl added 3 commits August 1, 2023 16:01

Using YAML selector.

fe44a61

Pulled master.

7ba7616

Deleted redundant source table.

915ca16

elongl commented Aug 1, 2023

View reviewed changes

macros/edr/tests/on_run_end/insert_metrics.sql Outdated Show resolved Hide resolved

Renamed a file, added 'updated_at'.

456b849

elongl marked this pull request as draft August 2, 2023 12:09

elongl commented Aug 2, 2023

View reviewed changes

haritamar reviewed Aug 2, 2023

View reviewed changes

elongl added 7 commits August 3, 2023 11:27

Added a 'build' metric.

efafcf7

Added a feature flag to collect metrics.

2bf4de9

Changed an error to a warning.

7e2f82c

Changed the 'full_table_name'.

7bb2bdb

Changed the 'full_table_name'.

57b0426

Not using YAML selector.

3cd6bb5

Added materializations to rest of adapters.

2e3d04f

elongl commented Aug 3, 2023

View reviewed changes

macros/edr/materializations/model/metrics.sql Show resolved Hide resolved

elongl added 2 commits August 3, 2023 15:02

Merge branch 'master' into ele-1453-implement-metrics-collection

4f7b75e

Moved feature flag.

22067b2

elongl added 4 commits August 3, 2023 15:32

Revert "Moved feature flag."

ce276b8

This reverts commit 22067b2.

Removed a redundant return.

2b30eaa

Keeping the default vars.

bd3d1fb

Deleted 'view' materialization.

1b9aa0a

ofek1weiss reviewed Aug 3, 2023

View reviewed changes

macros/edr/materializations/model/incremental.sql Show resolved Hide resolved

macros/edr/materializations/model/metrics.sql Outdated Show resolved Hide resolved

elongl added 2 commits August 3, 2023 17:09

Added a metric value for the 'build_timestamp' metric.

2925c0d

Added metric count indicator.

acc8bae

elongl marked this pull request as ready for review August 3, 2023 14:19

Merged master.

a35e04a

ofek1weiss approved these changes Aug 6, 2023

View reviewed changes

elongl merged commit a6a8401 into master Aug 6, 2023

elongl deleted the ele-1453-implement-metrics-collection branch August 6, 2023 07:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ele 1453 implement metrics collection #447

Ele 1453 implement metrics collection #447

elongl commented Aug 1, 2023

linear bot commented Aug 1, 2023

github-actions bot commented Aug 1, 2023

elongl Aug 2, 2023 •

edited

Loading

haritamar Aug 2, 2023

haritamar Aug 2, 2023

haritamar Aug 2, 2023

haritamar Aug 2, 2023

elongl Aug 3, 2023

haritamar Aug 2, 2023

elongl Aug 3, 2023

Ele 1453 implement metrics collection #447

Ele 1453 implement metrics collection #447

Conversation

elongl commented Aug 1, 2023

linear bot commented Aug 1, 2023

github-actions bot commented Aug 1, 2023

elongl Aug 2, 2023 • edited Loading

Choose a reason for hiding this comment

haritamar Aug 2, 2023

Choose a reason for hiding this comment

haritamar Aug 2, 2023

Choose a reason for hiding this comment

haritamar Aug 2, 2023

Choose a reason for hiding this comment

haritamar Aug 2, 2023

Choose a reason for hiding this comment

elongl Aug 3, 2023

Choose a reason for hiding this comment

haritamar Aug 2, 2023

Choose a reason for hiding this comment

elongl Aug 3, 2023

Choose a reason for hiding this comment

elongl Aug 2, 2023 •

edited

Loading