Skip to content

Univariate Prophet

Bases: BaseProphetForecaster

Univariate Prophetverse forecaster, with support for multiple likelihoods.

Differences to facebook's prophet:

  • logistic trend. Here, another parametrization is considered, and the capacity is not passed as input, but inferred from the data.

  • the users can pass arbitrary sktime transformers as feature_transformer, for instance FourierFeatures or HolidayFeatures.

  • no default weekly_seasonality/yearly_seasonality, this is left to the user via the feature_transformer parameter

  • Uses changepoint_interval instead of n_changepoints to set changepoints.

  • accepts configurations where each exogenous variable has a different function relating it to its additive effect on the time series. One can, for example, set different priors for a group of feature, or use a Hill function to model the effect of a feature.

Parameters:

Name Type Description Default
trend Union[str, BaseEffect], optional, one of "linear" (default) or "logistic"

Type of trend to use. Can also be a custom effect object.

'linear'
changepoint_interval int

Number of potential changepoints to sample in the history.

25
changepoint_range float or int

Proportion of the history in which trend changepoints will be estimated.

  • if float, must be between 0 and 1. The range will be that proportion of the training history.

  • if int, can be positive or negative. Absolute value must be less than number of training points. The range will be that number of points. A negative int indicates number of points counting from the end of the history, a positive int from the beginning.

0.8
changepoint_prior_scale float

Regularization parameter controlling the flexibility of the automatic changepoint selection.

0.001
offset_prior_scale float

Scale parameter for the prior distribution of the offset. The offset is the constant term in the piecewise trend equation.

0.1
capacity_prior_scale float

Scale parameter for the prior distribution of the capacity.

0.2
capacity_prior_loc float

Location parameter for the prior distribution of the capacity.

1.1
feature_transformer sktime transformer, BaseTransformer

Transformer object to generate Fourier terms, holiday or other features. If None, no additional features are used. For multiple features, pass a FeatureUnion object with the transformers.

None
noise_scale float

Scale parameter for the observation noise.

0.05
mcmc_samples int

Number of MCMC samples to draw.

2000
mcmc_warmup int

Number of MCMC warmup steps. Also known as burn-in.

200
mcmc_chains int

Number of MCMC chains to run in parallel.

4
inference_method str, optional, one of "mcmc" or "map"

Inference method to use. Can be "mcmc" or "map".

"map"
optimizer_name str

Name of the numpyro optimizer to use for variational inference.

"Adam"
optimizer_kwargs dict

Additional keyword arguments to pass to the numpyro optimizer.

{}
optimizer_steps int

Number of optimization steps to perform for variational inference.

100_000
exogenous_effects List[AbstractEffect]

A list of prophetverse AbstractEffect objects defining the exogenous effects to be used in the model.

None
likelihood str

Likelihood to use for the model. Can be "normal", "gamma" or "negbinomial".

"normal"
default_effect AbstractEffectm optional, defalut=None

The default effect to be used when no effect is specified for a variable.

None
default_exogenous_prior tuple

Default prior distribution for exogenous effects.

None
rng_key jax.random.PRNGKey or None (default

Random number generator key.

None
Source code in src/prophetverse/sktime/univariate.py
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
class Prophetverse(BaseProphetForecaster):
    """Univariate ``Prophetverse`` forecaster, with support for multiple likelihoods.

    Differences to facebook's prophet:

    * logistic trend. Here, another parametrization is considered,
      and the capacity is not passed as input, but inferred from the data.

    * the users can pass arbitrary ``sktime`` transformers as ``feature_transformer``,
      for instance ``FourierFeatures`` or ``HolidayFeatures``.

    * no default weekly_seasonality/yearly_seasonality, this is left to the user
      via the ``feature_transformer`` parameter

    * Uses ``changepoint_interval`` instead of ``n_changepoints`` to set changepoints.

    * accepts configurations where each exogenous variable has a different function
      relating it to its additive effect on the time series.
      One can, for example, set different priors for a group of feature,
      or use a Hill function to model the effect of a feature.

    Parameters
    ----------
    trend : Union[str, BaseEffect], optional, one of "linear" (default) or "logistic"
        Type of trend to use. Can also be a custom effect object.

    changepoint_interval : int, optional, default=25
        Number of potential changepoints to sample in the history.

    changepoint_range : float or int, optional, default=0.8
        Proportion of the history in which trend changepoints will be estimated.

        * if float, must be between 0 and 1.
          The range will be that proportion of the training history.

        * if int, can be positive or negative.
          Absolute value must be less than number of training points.
          The range will be that number of points.
          A negative int indicates number of points
          counting from the end of the history, a positive int from the beginning.

    changepoint_prior_scale : float, optional, default=0.001
        Regularization parameter controlling the flexibility
        of the automatic changepoint selection.

    offset_prior_scale : float, optional, default=0.1
        Scale parameter for the prior distribution of the offset.
        The offset is the constant term in the piecewise trend equation.

    capacity_prior_scale : float, optional, default=0.2
        Scale parameter for the prior distribution of the capacity.

    capacity_prior_loc : float, optional, default=1.1
        Location parameter for the prior distribution of the capacity.

    feature_transformer : sktime transformer, BaseTransformer, optional, default=None
        Transformer object to generate Fourier terms, holiday or other features.
        If None, no additional features are used.
        For multiple features, pass a ``FeatureUnion`` object with the transformers.

    noise_scale : float, optional, default=0.05
        Scale parameter for the observation noise.

    mcmc_samples : int, optional, default=2000
        Number of MCMC samples to draw.

    mcmc_warmup : int, optional, default=200
        Number of MCMC warmup steps. Also known as burn-in.

    mcmc_chains : int, optional, default=4
        Number of MCMC chains to run in parallel.

    inference_method : str, optional, one of "mcmc" or "map", default="map"
        Inference method to use. Can be "mcmc" or "map".

    optimizer_name : str, optional, default="Adam"
        Name of the numpyro optimizer to use for variational inference.

    optimizer_kwargs : dict, optional, default={}
        Additional keyword arguments to pass to the numpyro optimizer.

    optimizer_steps : int, optional, default=100_000
        Number of optimization steps to perform for variational inference.

    exogenous_effects : List[AbstractEffect], optional, default=None
        A list of ``prophetverse`` ``AbstractEffect`` objects
        defining the exogenous effects to be used in the model.

    likelihood : str, optional, default="normal"
        Likelihood to use for the model. Can be "normal", "gamma" or "negbinomial".

    default_effect : AbstractEffectm optional, defalut=None
        The default effect to be used when no effect is specified for a variable.

    default_exogenous_prior : tuple, default=None
        Default prior distribution for exogenous effects.

    rng_key : jax.random.PRNGKey or None (default
        Random number generator key.
    """

    _tags = {
        # packaging info
        # --------------
        "authors": "felipeangelimvieira",
        "maintainers": "felipeangelimvieira",
        "python_dependencies": "prophetverse",
        # estimator type
        # --------------
        "capability:pred_int": True,
        "capability:pred_int:insample": True,
        "enforce_index_type": [pd.Period, pd.DatetimeIndex],
        "requires-fh-in-fit": False,
        "y_inner_mtype": "pd.DataFrame",
    }

    def __init__(
        self,
        trend: Union[BaseEffect, str] = "linear",
        changepoint_interval: int = 25,
        changepoint_range: Union[float, int] = 0.8,
        changepoint_prior_scale: float = 0.001,
        offset_prior_scale: float = 0.1,
        capacity_prior_scale=0.2,
        capacity_prior_loc=1.1,
        exogenous_effects: Optional[List[BaseEffect]] = None,
        default_effect: Optional[BaseEffect] = None,
        feature_transformer=None,
        noise_scale=0.05,
        mcmc_samples=2000,
        mcmc_warmup=200,
        mcmc_chains=4,
        inference_method="map",
        optimizer_name="Adam",
        optimizer_kwargs=None,
        optimizer_steps=100_000,
        likelihood="normal",
        scale=None,
        rng_key=None,
    ):
        """Initialize the Prophet model."""
        self.noise_scale = noise_scale
        self.feature_transformer = feature_transformer

        self.likelihood = likelihood

        super().__init__(
            rng_key=rng_key,
            # Trend
            trend=trend,
            changepoint_interval=changepoint_interval,
            changepoint_range=changepoint_range,
            changepoint_prior_scale=changepoint_prior_scale,
            offset_prior_scale=offset_prior_scale,
            capacity_prior_scale=capacity_prior_scale,
            capacity_prior_loc=capacity_prior_loc,
            # Exog
            default_effect=default_effect,
            exogenous_effects=exogenous_effects,
            # BaseBayesianForecaster
            inference_method=inference_method,
            mcmc_samples=mcmc_samples,
            mcmc_warmup=mcmc_warmup,
            mcmc_chains=mcmc_chains,
            optimizer_name=optimizer_name,
            optimizer_kwargs=optimizer_kwargs,
            optimizer_steps=optimizer_steps,
            scale=scale,
        )

        self._validate_hyperparams()

    @property
    def model(self):
        """Return the model function.

        Returns
        -------
        Callable
            A function that can be passed to Numpyro samplers.
        """
        return _LIKELIHOOD_MODEL_MAP[self.likelihood]

    @property
    def _likelihood_is_discrete(self) -> bool:
        """Skip scaling if the likelihood is discrete.

        In the case of discrete likelihoods, the data is not scaled since this can
        create non-integer values.
        """
        return self.likelihood in _DISCRETE_LIKELIHOODS

    def _validate_hyperparams(self):
        """Validate the hyperparameters."""
        super()._validate_hyperparams()

        if self.noise_scale <= 0:
            raise ValueError("noise_scale must be greater than 0.")

        if self.likelihood not in _LIKELIHOOD_MODEL_MAP:
            raise ValueError(
                f"likelihood must be one of {list(_LIKELIHOOD_MODEL_MAP.keys())}"
                + "Got {self.likelihood}."
            )

    def _get_fit_data(self, y, X, fh):
        """
        Prepare the data for the Numpyro model.

        Parameters
        ----------
        y: pd.DataFrame
            Time series data.
        X: pd.DataFrame
            Exogenous variables.
        fh: ForecastingHorizon
            Forecasting horizon.

        Returns
        -------
        dict
            Dictionary of data for the Numpyro model.
        """
        fh = y.index.get_level_values(-1).unique()

        self.trend_model_ = self._get_trend_model()

        if self._likelihood_is_discrete:
            # Scale the data, since _get_fit_data receives
            # a non-scaled y for discrete likelihoods
            self.trend_model_.fit(X=X, y=y / self._scale)
        else:
            self.trend_model_.fit(X=X, y=y)

        # Exogenous features

        if X is None:
            X = pd.DataFrame(index=y.index)

        if self.feature_transformer is not None:

            X = self.feature_transformer.fit_transform(X)

        self._has_exogenous = ~X.columns.empty
        X = X.loc[y.index]

        trend_data = self.trend_model_.transform(X=X, fh=fh)

        self._fit_effects(X, y)
        exogenous_data = self._transform_effects(X, fh=fh)

        y_array = jnp.array(y.values.flatten()).reshape((-1, 1))

        # Inputs that also are used in predict
        self.fit_and_predict_data_ = {
            "trend_model": self.trend_model_,
            "noise_scale": self.noise_scale,
            "scale": self._scale,
            "exogenous_effects": (
                self.non_skipped_exogenous_effect if self._has_exogenous else None
            ),
        }

        inputs = {
            "y": y_array,
            "data": exogenous_data,
            "trend_data": trend_data,
            **self.fit_and_predict_data_,
        }

        return inputs

    def _get_predict_data(
        self, X: Union[pd.DataFrame, None], fh: ForecastingHorizon
    ) -> dict:
        """
        Prepare the data for making predictions.

        Parameters
        ----------
        X: pd.DataFrame
            Exogenous variables.
        fh: ForecastingHorizon
            Forecasting horizon.

        Returns
        -------
        dict
            Dictionary of data for the Numpyro model.
        """
        fh_dates = self.fh_to_index(fh)
        fh_as_index = pd.Index(list(fh_dates.to_numpy()))

        if X is None:
            X = pd.DataFrame(index=fh_as_index)

        if self.feature_transformer is not None:
            X = self.feature_transformer.transform(X)

        trend_data = self.trend_model_.transform(X=X, fh=fh_as_index)

        exogenous_data = (
            self._transform_effects(X, fh_as_index) if self._has_exogenous else None
        )

        return dict(
            y=None,
            data=exogenous_data,
            trend_data=trend_data,
            **self.fit_and_predict_data_,
        )

    @classmethod
    def get_test_params(cls, parameter_set="default"):  # pragma: no cover
        """Params to be used in sktime unit tests.

        Parameters
        ----------
        parameter_set : str, optional
            The parameter set to be used (ignored in this implementation)

        Returns
        -------
        List[dict[str, int]]
            A list of dictionaries containing the test parameters.
        """
        params = []
        for likelihood in _LIKELIHOOD_MODEL_MAP.keys():
            params.append(
                {
                    "likelihood": likelihood,
                    "optimizer_steps": 10,
                }
            )
        return params

model property

Return the model function.

Returns:

Type Description
Callable

A function that can be passed to Numpyro samplers.

__init__(trend='linear', changepoint_interval=25, changepoint_range=0.8, changepoint_prior_scale=0.001, offset_prior_scale=0.1, capacity_prior_scale=0.2, capacity_prior_loc=1.1, exogenous_effects=None, default_effect=None, feature_transformer=None, noise_scale=0.05, mcmc_samples=2000, mcmc_warmup=200, mcmc_chains=4, inference_method='map', optimizer_name='Adam', optimizer_kwargs=None, optimizer_steps=100000, likelihood='normal', scale=None, rng_key=None)

Initialize the Prophet model.

Source code in src/prophetverse/sktime/univariate.py
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
def __init__(
    self,
    trend: Union[BaseEffect, str] = "linear",
    changepoint_interval: int = 25,
    changepoint_range: Union[float, int] = 0.8,
    changepoint_prior_scale: float = 0.001,
    offset_prior_scale: float = 0.1,
    capacity_prior_scale=0.2,
    capacity_prior_loc=1.1,
    exogenous_effects: Optional[List[BaseEffect]] = None,
    default_effect: Optional[BaseEffect] = None,
    feature_transformer=None,
    noise_scale=0.05,
    mcmc_samples=2000,
    mcmc_warmup=200,
    mcmc_chains=4,
    inference_method="map",
    optimizer_name="Adam",
    optimizer_kwargs=None,
    optimizer_steps=100_000,
    likelihood="normal",
    scale=None,
    rng_key=None,
):
    """Initialize the Prophet model."""
    self.noise_scale = noise_scale
    self.feature_transformer = feature_transformer

    self.likelihood = likelihood

    super().__init__(
        rng_key=rng_key,
        # Trend
        trend=trend,
        changepoint_interval=changepoint_interval,
        changepoint_range=changepoint_range,
        changepoint_prior_scale=changepoint_prior_scale,
        offset_prior_scale=offset_prior_scale,
        capacity_prior_scale=capacity_prior_scale,
        capacity_prior_loc=capacity_prior_loc,
        # Exog
        default_effect=default_effect,
        exogenous_effects=exogenous_effects,
        # BaseBayesianForecaster
        inference_method=inference_method,
        mcmc_samples=mcmc_samples,
        mcmc_warmup=mcmc_warmup,
        mcmc_chains=mcmc_chains,
        optimizer_name=optimizer_name,
        optimizer_kwargs=optimizer_kwargs,
        optimizer_steps=optimizer_steps,
        scale=scale,
    )

    self._validate_hyperparams()

get_test_params(parameter_set='default') classmethod

Params to be used in sktime unit tests.

Parameters:

Name Type Description Default
parameter_set str

The parameter set to be used (ignored in this implementation)

'default'

Returns:

Type Description
List[dict[str, int]]

A list of dictionaries containing the test parameters.

Source code in src/prophetverse/sktime/univariate.py
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
@classmethod
def get_test_params(cls, parameter_set="default"):  # pragma: no cover
    """Params to be used in sktime unit tests.

    Parameters
    ----------
    parameter_set : str, optional
        The parameter set to be used (ignored in this implementation)

    Returns
    -------
    List[dict[str, int]]
        A list of dictionaries containing the test parameters.
    """
    params = []
    for likelihood in _LIKELIHOOD_MODEL_MAP.keys():
        params.append(
            {
                "likelihood": likelihood,
                "optimizer_steps": 10,
            }
        )
    return params