Add root mean squared scaled error to metrics #2031

Beerstabr · 2023-10-16T15:05:10Z

Added root mean squared scaled error as used during the M5 competition to metrics.
Also see this link for explanation.

Is this something you are interested in? If so, I'll add some tests.

madtoinou · 2023-10-20T09:29:10Z

It looks interesting but wouldn't be simpler to just add an argument to mse? Or at least reuse it in rmsse?

Beerstabr · 2023-10-20T12:33:03Z

Thanks for having a look @madtoinou .

I think you are right about reusing rmse() in rmsse(). I changed it in the PR.

Adding an argument to mse() would make mse more complicated then it needs to be.

I think the way rmsse() is now included is in line with how mase is included, as well as in line with how rmse() relates to mse().

What do you think?

dennisbader

Hi @Beerstabr and thanks for this, looks like a great start 🚀
You can go ahead with the PR!

Main suggestion was to refactor the MASE and RMSSE so that they both use the same functions.

dennisbader · 2023-11-03T08:57:05Z

darts/metrics/metrics.py

Thanks @Beerstabber for this contribution and sorry for the late review.
I think we can indeed add this to Darts (keeping in mind that RMSSE also has its cons, e.g. when the naive score is bad).

As @madtoinou mentioned, we should try to simplify this a bit.
It is using a similar logic to MASE, can we refactor it so that both functions make use of the same logic?

@dennisbader I refactored the code such that mase and rmsse are wrapped similar to the other metrics. Might be possible to further refactor by for example combining multi_ts_support() and multi_ts_support_insample(). Not sure if it is worth the effort. Let me know what you think.

Also just copied the mase tests for rmsse. Will refactor it if you like the other changes I've made so far.

Would like to add the Scaled Quantile Loss metric as well if you agree that this would be a worthwhile addition.

…ed_error

…github.com/Beerstabr/darts into feature/add_root_mean_squared_scaled_error

dennisbader

This looks very neat now, great job and thanks @Beerstabr 🚀

Apart from some minor suggestions, I think it makes sense to combine the multi_ts_support for the non-scaled and scaled logic. I left a comment there with a suggestions.

dennisbader · 2023-11-13T09:25:37Z

darts/tests/metrics/test_metrics.py

@@ -277,11 +282,9 @@ def test_smape(self):
        self.helper_test_nan(metrics.smape)

    def test_mase(self):
-


we can parametrize this test with pytest to check both metrics:

@pytest.mark.parametrize("metric", [metrics.mase, metrics.rmsse]) def test_scaled_metrics(self, metric):

then we can replace metrics.mape with metric and remove the test_rmsse test.

dennisbader · 2023-11-13T09:27:25Z

darts/tests/metrics/test_metrics.py

can we add a check for MASE, RMSSE that verifies an expected metric score for a prediction that is not equal to the target (e.g. MASE/RMSSE != 0)?

dennisbader · 2023-11-13T09:37:05Z

darts/metrics/metrics.py

@@ -107,6 +107,104 @@ def wrapper_multi_ts_support(*args, **kwargs):
    return wrapper_multi_ts_support


+def multi_ts_support_insample(func):


Nice job with refactoring the scaled logic!

It would indeed be nice to have the multi ts support handled by the same wrapper, as only the insample handling is different between the two.

You could check for example that "insample" is part of func's signature to know which logic to use.

What do you think?

dennisbader · 2023-11-13T09:46:04Z

darts/metrics/metrics.py

+        elif len(args) > 3:
+            m = args[3]
+        else:
+            m = 1


we should use the func's default parameters here and for intersect:

Suggested change

m = 1

m = signature(func).parameters["m"].default

dennisbader · 2023-11-13T09:51:09Z

darts/metrics/metrics.py

+                    )
+                    m = 1
+
+            value_list.append(func(y_true, y_hat, x_t, m, *args[3:], **kwargs))


if m is passed in args, then this will give it twice to func (once through m, and another time through *args[3:]).

Do we have to exclude it from args when we retrieve it at the top?

dennisbader · 2023-11-14T16:31:29Z

darts/metrics/metrics.py

+    raise_if_not(
+        not np.isclose(scale, 0),


below might be more interpretable

Suggested change

raise_if_not(

not np.isclose(scale, 0),

raise_if(

np.isclose(scale, 0),

dennisbader · 2023-11-14T16:38:52Z

darts/metrics/metrics.py

+    raise_if_not(
+        not np.isclose(scale, 0),


Suggested change

raise_if_not(

not np.isclose(scale, 0),

raise_if(

np.isclose(scale, 0),

dennisbader · 2024-02-02T10:25:49Z

Hi @Beerstabr, just wanted to check in whether you're still working on this PR? :)

add rmsse, similar to mase

88d78ff

used 'rmse()' in 'rmsse()'

17ffb8e

dennisbader reviewed Nov 3, 2023

View reviewed changes

Beerstabr and others added 8 commits November 6, 2023 10:56

Merge branch 'unit8co:master' into feature/add_root_mean_squared_scal…

abae296

…ed_error

refactored by adding wrappers for mase and rmsse

32abdbf

mase passes tests

94a53c2

mase passes tests

bcf209f

copied mase tests for rmsse (refactor later)

dca4fc6

Merge branch 'unit8co:master' into feature/add_root_mean_squared_scal…

6ad19d3

…ed_error

Merge branch 'feature/add_root_mean_squared_scaled_error' of https://…

54436a4

…github.com/Beerstabr/darts into feature/add_root_mean_squared_scaled_error

removed unnecessary test code

db5f8ac

dennisbader reviewed Nov 14, 2023

View reviewed changes

dennisbader marked this pull request as ready for review November 14, 2023 16:44

dennisbader mentioned this pull request Mar 27, 2024

Refactor/metrics #2284

Merged

3 tasks

dennisbader closed this in #2284 Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add root mean squared scaled error to metrics #2031

Add root mean squared scaled error to metrics #2031

Beerstabr commented Oct 16, 2023

madtoinou commented Oct 20, 2023

Beerstabr commented Oct 20, 2023

dennisbader left a comment

dennisbader Nov 3, 2023

Beerstabr Nov 10, 2023 •

edited

Loading

dennisbader left a comment

dennisbader Nov 13, 2023

dennisbader Nov 13, 2023

dennisbader Nov 13, 2023

dennisbader Nov 13, 2023

dennisbader Nov 13, 2023

dennisbader Nov 14, 2023

dennisbader Nov 14, 2023

dennisbader commented Feb 2, 2024

		@@ -277,11 +282,9 @@ def test_smape(self):
		self.helper_test_nan(metrics.smape)

		def test_mase(self):

		@@ -107,6 +107,104 @@ def wrapper_multi_ts_support(args, *kwargs):
		return wrapper_multi_ts_support


		def multi_ts_support_insample(func):

Add root mean squared scaled error to metrics #2031

Add root mean squared scaled error to metrics #2031

Conversation

Beerstabr commented Oct 16, 2023

madtoinou commented Oct 20, 2023

Beerstabr commented Oct 20, 2023

dennisbader left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Beerstabr Nov 10, 2023 • edited Loading

Choose a reason for hiding this comment

dennisbader left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dennisbader commented Feb 2, 2024

Beerstabr Nov 10, 2023 •

edited

Loading