LGBMLinearForecaster#

class openstef_models.models.forecasting.lgbmlinear_forecaster.LGBMLinearForecaster(**data: Any) → None[source]

Bases: Forecaster, ExplainableForecaster, ContributionsMixin

LGBMLinear-based forecaster for probabilistic energy forecasting.

Implements gradient boosting trees with linear leaves using LightGBM for multi-quantile forecasting. Optimized for time series prediction with specialized loss functions and comprehensive hyperparameter control suitable for production energy forecasting.

The forecaster uses a multi-output strategy where each quantile is predicted by separate trees within the same boosting ensemble. This approach provides well-calibrated uncertainty estimates while maintaining computational efficiency.

Invariants

fit() must be called before predict() to train the model
Configuration quantiles determine the number of prediction outputs
Model state is preserved across predict() calls after fitting
Input features must match training data structure during prediction

Example

Basic forecasting workflow

>>> from datetime import timedelta
>>> from openstef_core.types import LeadTime, Quantile
>>> forecaster = LGBMLinearForecaster(
...     quantiles=[Quantile(0.1), Quantile(0.5), Quantile(0.9)],
...     horizons=[LeadTime(timedelta(hours=1))],
...     hyperparams=LGBMLinearHyperParams(n_estimators=100, max_depth=6),
... )
>>> forecaster.fit(training_data)
>>> predictions = forecaster.predict(test_data)

Note

LightGBM dependency is optional and must be installed separately. The model automatically handles multi-quantile output and uses magnitude-weighted pinball loss by default for better forecasting performance.

See also

LGBMLinearHyperParams: Detailed hyperparameter configuration options. Forecaster: Base interface for all forecasting models. GBLinearForecaster: Alternative linear model using XGBoost.

Parameters:: data (Any)

HyperParams: alias of LGBMLinearHyperParams

hyperparams: LGBMLinearHyperParams

device: str

n_jobs: int

verbosity: Literal[-1, 0, 1, 2, 3]

random_state: int | None

early_stopping_rounds: int | None

property hparams: LGBMLinearHyperParams

Model hyperparameters for training and prediction.

Concrete forecasters implement this by returning their narrowed hyperparams field, giving callers a polymorphic read-only view while each subclass keeps full type safety on its own field.

model_post_init(_context: object, /) → None[source]

Initialize the underlying LightGBM linear model from configuration.

Raises:: MissingExtraError – If lightgbm is not installed.
Parameters:: _context (object)
Return type:: None

property is_fitted: bool: Check if the predictor has been fitted.

fit(data: ForecastInputDataset, data_val: ForecastInputDataset | None = None) → None[source]

Fit the predictor to the input data.

This method should be called before generating predictions. It allows the predictor to learn parameters from the training data.

Parameters:

data (ForecastInputDataset) – The training data to fit the predictor on.
data_val (ForecastInputDataset | None) – The validation data to evaluate and tune the predictor on (optional).
data
data_val

Return type:

None

predict(data: ForecastInputDataset) → ForecastDataset[source]

Generate predictions for the input data.

This method should use the fitted parameters to generate predictions.

Parameters:

data (ForecastInputDataset) – The input data to generate predictions for.
data

Returns:

Predictions for the input data.

Raises:

NotFittedError – If the predictor has not been fitted yet.

Return type:

ForecastDataset

predict_contributions(data: ForecastInputDataset) → TimeSeriesDataset[source]

Compute SHAP feature contributions for the median quantile.

Parameters:

data (ForecastInputDataset) – Input dataset for which to compute feature contributions.
data

Returns:

TimeSeriesDataset with per-feature SHAP values plus a bias column.

Raises:

NotFittedError – If the model has not been fitted.

Return type:

TimeSeriesDataset

property feature_importances: DataFrame

Get feature importance scores for this model.

Returns DataFrame with feature names as index and quantiles as columns. Each quantile represents the importance distribution across multiple model training runs or folds.

Returns:: DataFrame with feature names as index and quantile columns. Values represent normalized importance scores summing to 1.0.

Note

The returned DataFrame must have feature names as index and quantile columns in format ‘quantile_PXX’ (e.g., ‘quantile_P50’, ‘quantile_P95’). All quantile values must be between 0 and 1.

model_config: ClassVar[ConfigDict] = {'arbitrary_types_allowed': True, 'extra': 'ignore', 'protected_namespaces': ()}: Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].