-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(): Add usage_reporting template to glean_usage sql_generator #6581
base: main
Are you sure you want to change the base?
Conversation
0b452ba
to
26dcc56
Compare
require_partition_filter: true | ||
clustering: | ||
fields: | ||
- normalized_channel |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@irrationalagent So far it appears most of the values in the new ping for normalized_channel are null, is this expected?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We plan to fix this.
-- client_info.app_build, | ||
normalized_os, | ||
normalized_os_version, | ||
-- client_info.locale, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@irrationalagent all of the client_info.*
fields are missing in the new ping. Are we planning on adding those or should I just delete these from the code?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some of these will be added back, not under client_info but likely as "normal" metrics e.g. string metrics. Specifically, we can expect these fields to be added as metrics:
App_version
App_build
first_run_date
reason
OS
Duration
Is default browser
Distribution_id
Channel
I will follow up about where exactly in the ping we will expect to see those.
normalized_os, | ||
normalized_os_version, | ||
-- client_info.locale, | ||
normalized_country_code, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@irrationalagent Are we ok to include the country code instead of locale
(no longer available due to it coming from the client_info)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
country code should stay, this is not reported in the ping itself but assigned on the backend
CAST(TRUE AS INT64) & CAST(is_active AS INT64) AS days_active_bits, | ||
udf.days_since_created_profile_as_28_bits( | ||
DATE_DIFF(submission_date, first_run_date, DAY) | ||
) AS days_created_profile_bits, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@irrationalagent Is the days_created_profile_bits
field important? Right now it seems first_run_date
is not available in the new ping. Is this intended?
8281806
to
cc26ab9
Compare
This comment has been minimized.
This comment has been minimized.
cc26ab9
to
0a92d91
Compare
This comment has been minimized.
This comment has been minimized.
eecc977
to
ace64de
Compare
Waiting for the missing fields to be added to the usage_reporting ping. |
…ed in the usage_reporting derived tables
ace64de
to
303a4bf
Compare
Integration report for "feat: remove dau_reporting ping as we decided it should not be included in the usage_reporting derived tables"
|
feat(): Add dau_reporting template to glean_usage sql_generator
Description
This is a replacement to to PR##6399
The main difference being a clean git history and all resource being renamed from
dau_reportin_*
tousage_reporting_*
.Also, we still need to account for data sent via the old
dau_reporting
ping. This is handled here via theusage_reporting_clients_daily_v1
template.Original description:
** Note: modified to use the updated naming**
Adding templates to set up ELT for the usage_reporting ping. This is a new minimal ping which will be used specifically for reporting the number of users of our applications. The templates mimic our existing ETL setup for the baseline ping and will result in the following artifacts being generated in BigQuery for each GLEAN app sending us the
usage_reporting
ping:Tables:
[app_name]_derived.usage_reporting_clients_daily_v1
- Keeping a record of each client that sent us the usage_reporting ping and limited selected numbers of associated with them.[app_name]_derived.usage_reporting_clients_first_seen_v1
- Letting us know when we observed a specific client for the first time in our warehouse.[app_name]_derived.usage_reporting_clients_last_seen_v1
- Keeping track of when is the last time we saw each client, and when is the last time they were active, and how long it's been since the profile has been created.Views:
[app_name].usage_reporting_clients_daily
[app_name].usage_reporting_clients_first_seen
[app_name].usage_reporting_clients_last_seen