Varying calibration / prediction results #22

GitHunter0 · 2021-03-17T01:20:39Z

Every calibration / prediction for the deepAR model gives a slightly different result. Below, forecast1 and forecast2 will never be exactly the same.

library(modeltime.gluonts)
library(tidymodels)
library(tidyverse)

model_fit <- modeltime.gluonts::deep_ar(
    id                    = "id",
    freq                  = "M",
    prediction_length     = 24,
    lookback_length       = 36,
    epochs                = 10, 
    num_batches_per_epoch = 50,
    learn_rate            = 0.001,
    num_layers            = 2,
    dropout               = 0.10
  ) %>%
  set_engine("gluonts_deepar") %>%
  fit(formula = value ~ ., data = training(m750_splits))
  
forecast1 <- 
  modeltime_table(model_fit) %>%
  modeltime_forecast(new_data = testing(m750_splits),
                     actual_data = m750) %>% 
  filter(.key=="prediction") %>% 
  select(.index, .value)

forecast2 <- 
  modeltime_table(model_fit) %>%
  modeltime_forecast(new_data = testing(m750_splits),
                     actual_data = m750) %>% 
  filter(.key=="prediction") %>% 
  select(.index, .value)

I was aware deep learning models estimates cannot be exactly replicated but I did not know the forecast would vary each time too, even using the same parameters and estimates. Do you know the reason?
A disclaimer about that in the documentation might be useful for non-experts in deep learning like myself.

Thanks again

I saw that you just launch modeltime.h2o, very cool stuff, I will explore that now.

The text was updated successfully, but these errors were encountered:

mdancho84 · 2021-03-17T10:55:37Z

This variation is actually a feature of GluonTS. But there is a solution, set the MXNet seed (discussed here and below).

Why is variation a feature?

Because GluonTS bills itself as a "Probabilistic Forecasting Software". This statement varies by algorithm, but for DeepAR, you get a probabilistic forecast.

What's happening under the hood that makes it probabilistic?

We are actually making a mean prediction from many paths that DeepAR generates. We only use the mean inside of Modeltime, but if you forecast with GluonTS DeepAR in python you can actually get quantiles around the forecast.

We don't use that feature since it doesn't fit into the Modeltime framework (currently - we use a quantile around a calibration forecast, refer to modeltime_calibrate()), but maybe there are benefits to including extras from the GluonTS model like quantiles that you can return.

That's for another day... for now, just recognize that your forecast variance is actually a result of how DeepAR forecasts and not an error occurring.

Solution

GluonTS internally uses a seed from the MXNet library. We can set this via reticulate.

mxnet <- reticulate::import("mxnet")
mxnet$random$seed <- 123

GitHunter0 · 2021-03-17T13:16:26Z

Hey Matt, I was surprised to see that even the forecast is probabilistic. I tried mxnet seed solution but it is still giving different forecasts (forecast1 != forecast2). Since the variation is always small, it is not a big deal. Feel free to close this issue. Thanks man

mdancho84 · 2021-03-17T17:19:03Z

Ok, that's odd. I would think the MXNet Random Seed would solve it. I'll close for now, but can reopen if something is amiss.

mesdi · 2023-09-23T11:28:23Z

I think not retrieving reproducible results will pose a significant obstacle to using this package.

mdancho84 closed this as completed Mar 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Varying calibration / prediction results #22

Varying calibration / prediction results #22

GitHunter0 commented Mar 17, 2021

mdancho84 commented Mar 17, 2021 •

edited

Loading

GitHunter0 commented Mar 17, 2021

mdancho84 commented Mar 17, 2021

mesdi commented Sep 23, 2023

Varying calibration / prediction results #22

Varying calibration / prediction results #22

Comments

GitHunter0 commented Mar 17, 2021

mdancho84 commented Mar 17, 2021 • edited Loading

Why is variation a feature?

What's happening under the hood that makes it probabilistic?

Solution

GitHunter0 commented Mar 17, 2021

mdancho84 commented Mar 17, 2021

mesdi commented Sep 23, 2023

mdancho84 commented Mar 17, 2021 •

edited

Loading