Add `DateutilTimePeriodAdjuster` #1233

plypaul · 2024-05-30T01:24:09Z

Description

This adds an implementation for adjusting time periods using dateutil instead
of pandas.

tlento

Nice, thanks! nitty nitty inline but the only really important thing is to sync up with @courtneyholcomb about the sub-daily granularity transition and whether exceptions are preferable to silent date assumptions.

tlento · 2024-05-30T23:00:46Z

metricflow-semantics/metricflow_semantics/time/dateutil_adjuster.py

+        if time_granularity is TimeGranularity.DAY:
+            return date_to_adjust


This assumes all input has a minimum granularity of daily and silently returns whatever is passed in. The Pandas implementation raises an exception in this case, which might be preferable for testing scenarios.

I'm fine with either approach since we're going to update this within the next couple of weeks, but @courtneyholcomb is going to be doing the work on opening up granularity so maybe check in with her about what she prefers before merging.

Pinged her, but preferring to fix later if there are issues to unblock merging of the stack.

tlento · 2024-05-30T23:04:44Z

metricflow-semantics/metricflow_semantics/time/dateutil_adjuster.py

+        elif time_granularity is TimeGranularity.WEEK:
+            return date_to_adjust + relativedelta(weekday=dateutil.relativedelta.MO(-1))
+        elif time_granularity is TimeGranularity.MONTH:
+            return date_to_adjust + relativedelta(day=1)


This API.... TIL day=1 and days=1 are vastly different things.

Oh well, it works, just read very carefully is all.....

Improved documentation and more examples of that API would have helped - had to do play around with the methods in the REPL.

tlento · 2024-05-30T23:06:28Z

metricflow-semantics/metricflow_semantics/time/dateutil_adjuster.py

+        elif time_granularity is TimeGranularity.WEEK:
+            return date_to_adjust + relativedelta(weekday=dateutil.relativedelta.SU(1))
+        elif time_granularity is TimeGranularity.MONTH:
+            return date_to_adjust + relativedelta(day=31)


Thanks for putting that docstring note on the class, I would've wondered about this.

tlento · 2024-05-30T23:09:21Z

metricflow-semantics/tests_metricflow_semantics/time/test_time_adjuster.py

+def date_times_to_check() -> Sequence[datetime.datetime]:  # noqa: D103
+    date_times = []
+    # Cover regular and leap years.
+    start_date_time = datetime.datetime(year=2020, month=1, day=1)


Should we check the year 2000, which looks like a leap year and is? And also 1900, which looks like a leap year, but isn't?

I hate date/time operations.....

Both 1900 and 2000 work, but 1900 exceeds the limit of supported times, so that was left as a comment.

tlento · 2024-05-30T23:13:33Z

metricflow-semantics/tests_metricflow_semantics/time/test_time_adjuster.py

+        ), f"Expansion mismatch: {pandas_adjuster_result=} {dateutil_adjuster_result=} {time_granularity=}"
+        finished_count += 1
+        if finished_count % 100000 == 0 or finished_count == test_case_count:
+            logger.info(f"Progress {finished_count / test_case_count * 100:.0f}%")


How many INFO rows does this print? If it's a lot we might want to put it in debug just so it's not filling up CI logs, but I assume it' snot that many since it's one every 100k.

tlento · 2024-05-30T23:14:54Z

metricflow-semantics/tests_metricflow_semantics/time/test_time_adjuster.py

+        for time_granularity in TimeGranularity
+        for end_time in (
+            start_time + datetime.timedelta(days=day_offset)
+            for day_offset in range(grain_to_count_in_year[time_granularity] + 2)


+2? Why plus two? To guarantee a spillover across the year boundary?

Yeah, added a comment about it.

tlento · 2024-05-30T23:19:16Z

...ts/test_time_adjuster.py/str/test_start_and_end_periods__start_and_end_of_period_results.txt

@@ -0,0 +1,3652 @@
+                     Date                 Period Grain         Period Start         Period End


This looks like a leftover from an earlier version of the test. Remove?

This adds an implementation for adjusting time periods using `dateutil` instead of `pandas`. Later, `DateutilTimePeriodAdjuster` will replace the `pandas` implementation.

…ons.

plypaul added the Skip Changelog label May 30, 2024

cla-bot bot added the cla:yes label May 30, 2024

plypaul force-pushed the p--py312--01 branch from 583ec19 to d6bdd8e Compare May 30, 2024 01:32

plypaul force-pushed the p--py312--02 branch from b250af2 to de09dfc Compare May 30, 2024 01:32

plypaul force-pushed the p--py312--01 branch from d6bdd8e to c9216fa Compare May 30, 2024 01:42

plypaul force-pushed the p--py312--02 branch from de09dfc to 6d8c9c1 Compare May 30, 2024 01:42

plypaul marked this pull request as ready for review May 30, 2024 01:51

tlento approved these changes May 30, 2024

View reviewed changes

plypaul force-pushed the p--py312--02 branch 2 times, most recently from fa6d0a0 to 2f50de9 Compare May 31, 2024 01:09

Base automatically changed from p--py312--01 to main May 31, 2024 01:12

plypaul force-pushed the p--py312--02 branch from 2f50de9 to 0fbe07a Compare May 31, 2024 01:20

/* PR_START p--py312 02 */ Add DateutilTimePeriodAdjuster.

cfb3f52

This adds an implementation for adjusting time periods using `dateutil` instead of `pandas`. Later, `DateutilTimePeriodAdjuster` will replace the `pandas` implementation.

plypaul force-pushed the p--py312--02 branch from 0fbe07a to b1e19e4 Compare May 31, 2024 01:24

plypaul added 3 commits May 30, 2024 18:26

Add tests that compare the output of the time adjustment implemetnati…

f8d699e

…ons.

Update snapshots.

020b169

Migrate existing use cases to DateutilTimePeriodAdjuster.

7d872da

plypaul force-pushed the p--py312--02 branch from b1e19e4 to 7d872da Compare May 31, 2024 01:26

plypaul merged commit 4bbccce into main May 31, 2024
15 checks passed

plypaul deleted the p--py312--02 branch May 31, 2024 01:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `DateutilTimePeriodAdjuster` #1233

Add `DateutilTimePeriodAdjuster` #1233

plypaul commented May 30, 2024 •

edited

Loading

tlento left a comment

tlento May 30, 2024

plypaul May 31, 2024

tlento May 30, 2024

plypaul May 31, 2024

tlento May 30, 2024

tlento May 30, 2024

plypaul May 31, 2024

tlento May 30, 2024

plypaul May 31, 2024

tlento May 30, 2024

plypaul May 31, 2024

tlento May 30, 2024

plypaul May 31, 2024

		if time_granularity is TimeGranularity.DAY:
		return date_to_adjust

Add DateutilTimePeriodAdjuster #1233

Add DateutilTimePeriodAdjuster #1233

Conversation

plypaul commented May 30, 2024 • edited Loading

Description

tlento left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Add `DateutilTimePeriodAdjuster` #1233

Add `DateutilTimePeriodAdjuster` #1233

plypaul commented May 30, 2024 •

edited

Loading