OpenSTEF Meta V0.1 #771

Lars800 · 2025-11-28T14:34:32Z

Introducting OpenSTEF-Meta, a one-stop-shop for all things meta learning.

This sub-package introduces four common meta learning algorithms:

Residual Forecaster

2 stage Forecasting Model
Primary model is fitted as usual
Secondary model fitted on residuals

Stacking Forecaster

Multiple Base Forecasters
Single Regressor is fitted on Base Predictions

Learned Weights Forecaster

Multiple Base Forecasters
Classification model learns optimal model weights

Rules Forecaster

Multiple Base Forecasters
Pre-defined rules on how to combine base methods.

This is an initial implementation

The methods have been tested on the Liander 2024 Huggingface dataset.
The results are accurate and improve over existing LGBM, XGBoost and GBLinear models
The code can still be further optimized for efficiency

…ia-partners.com>

commit 37089b8 Author: Egor Dmitriev <[email protected]> Date: Mon Nov 17 15:29:59 2025 +0100 fix(#728): Fixed parallelism stability issues, and gblinear feature pipeline. (#752) * fix(STEF-2475): Added loky as default option for parallelism since fork causes instabilities for xgboost results. Signed-off-by: Egor Dmitriev <[email protected]> * fix(STEF-2475): Added better support for flatliners and predicting when data is sparse. Signed-off-by: Egor Dmitriev <[email protected]> * fix(STEF-2475): Feature handing improvements for gblinear. Like imputation, nan dropping, and checking if features are available. Signed-off-by: Egor Dmitriev <[email protected]> * fix(#728): Added checks on metrics to gracefully handle empty data. Added flatline filtering during evalution. Signed-off-by: Egor Dmitriev <[email protected]> * fix(#728): Updated xgboost to skip scaling on empty prediction. Signed-off-by: Egor Dmitriev <[email protected]> * fix(STEF-2475): Added parallelism parameters. Signed-off-by: Egor Dmitriev <[email protected]> --------- Signed-off-by: Egor Dmitriev <[email protected]> commit a85a3f7 Author: Egor Dmitriev <[email protected]> Date: Fri Nov 14 14:31:34 2025 +0100 fix(STEF-2475): Fixed rolling aggregate adder by adding forward filling and stating support for only one horizon. (#750) Signed-off-by: Egor Dmitriev <[email protected]> commit 4f0c664 Author: Egor Dmitriev <[email protected]> Date: Thu Nov 13 16:54:15 2025 +0100 feature: Disabled data cutoff by default to be consistent with openstef 3. And other minor improvements. (#748) commit 493126e Author: Egor Dmitriev <[email protected]> Date: Thu Nov 13 16:12:35 2025 +0100 fix(STEF-2475) fix and refactor backtesting iction in context of backtestforecasting config for clarity. Added more colors. Fixed data split function to handle 0.0 splits. (#747) * fix: Fixed data collation during backtesting. Renamed horizon to prediction in context of backtestforecasting config for clarity. Added more colors. Fixed data split function to handle 0.0 splits. * fix: Formatting. Signed-off-by: Egor Dmitriev <[email protected]> * fix: Formatting. Signed-off-by: Egor Dmitriev <[email protected]> --------- Signed-off-by: Egor Dmitriev <[email protected]> commit 6b1da44 Author: Egor Dmitriev <[email protected]> Date: Thu Nov 13 16:05:32 2025 +0100 feature: forecaster hyperparams and eval metrics (#746) * feature(#729) Removed to_state and from_state methods in favor of builtin python state saving functions. Signed-off-by: Egor Dmitriev <[email protected]> * feature(#729): Fixed issue where generic transform pipeline could not be serialized. Signed-off-by: Egor Dmitriev <[email protected]> * feature(#729): Added more state saving tests Signed-off-by: Egor Dmitriev <[email protected]> * feature(#729): Added more state saving tests Signed-off-by: Egor Dmitriev <[email protected]> * feature(#729): Added more state saving tests Signed-off-by: Egor Dmitriev <[email protected]> * feature: standardized objective function. Added custom evaluation functions for forecasters. * fix: Formatting. Signed-off-by: Egor Dmitriev <[email protected]> --------- Signed-off-by: Egor Dmitriev <[email protected]>

…ters

Signed-off-by: Lars van Someren <[email protected]>

…ybridForecaster2.0

Signed-off-by: Lars van Someren <[email protected]>

Residual Forecaster and Stacking Forecaster can now predict model contributions. Regular forecasters (EXCEPT LGBM Linear) can predict feature contributions

Signed-off-by: Lars van Someren <[email protected]>

commit 6f88d72 Author: Lars van Someren <[email protected]> Date: Mon Dec 8 09:46:57 2025 +0100 Bugfixes Signed-off-by: Lars van Someren <[email protected]> commit b44fd92 Author: Lars van Someren <[email protected]> Date: Thu Dec 4 14:39:31 2025 +0100 bug fixes Signed-off-by: Lars van Someren <[email protected]> commit e212448 Author: Lars van Someren <[email protected]> Date: Thu Dec 4 12:38:24 2025 +0100 fixes Signed-off-by: Lars van Someren <[email protected]> commit eb775e4 Author: Lars van Someren <[email protected]> Date: Thu Dec 4 11:40:44 2025 +0100 BugFix Signed-off-by: Lars van Someren <[email protected]> commit c33ce93 Author: Lars van Someren <[email protected]> Date: Wed Dec 3 14:15:06 2025 +0100 Made PR Compliant Signed-off-by: Lars van Someren <[email protected]>

Signed-off-by: Lars van Someren <[email protected]>

…utionsWFunctions' into research/HybridForecaster2.0 Signed-off-by: Lars van Someren <[email protected]>

Signed-off-by: Lars van Someren <[email protected]>

…itting and Model Fit Result. Validation and test data can now be fully used Signed-off-by: Lars van Someren <[email protected]>

egordm

Awesome improvements! It looks mostly release-ready.

I have a few small comments / nitpicks, but overall great work!

Quick note, we did make some small changes/fixes in the current release branch. So you might need to rebase.

I think after this is merged, we can open a PR to merge the research branch into release if it's free of research artifacts / temporary scripts.

egordm · 2025-12-16T09:11:32Z

packages/openstef-models/src/openstef_models/integrations/mlflow/mlflow_storage_callback.py

+        if isinstance(context.workflow.model, EnsembleForecastingModel):
+            raise NotImplementedError(
+                "MLFlowStorageCallback does not yet support EnsembleForecastingWorkflow model storage."
+            )
+
        # Create a new run
        run = self.storage.create_run(
            model_id=context.workflow.model_id,
            tags=context.workflow.model.tags,
-            hyperparams=context.workflow.model.forecaster.hyperparams,
+            hyperparams=context.workflow.model.forecaster.hyperparams,  # type: ignore TODO Make MLFlow compatible with OpenSTEF Meta


We should probably address this before merging.

It's mostly in hyperparams if I understand it correctly? Since the rest is pickling.

Yes, this is required for use in production as I understand it. I left this integration un resolved as I do not fully understand this part of the package.

The primary issue here is EnsembleForecastingModel does not have a Forecaster attribute. So forecaster.hyperparams is unavailable. Depending on what they are used for downstream, we can either:

Pass combiner.hyperparams

Pass a dictionary of hyperparams, something like

{ 'forecaster_model_1' : Hyperparams() 'forecaster_model_2' : Hyperparams() 'combiner' : Hyperparams() }

This way everything is neatly saved, but it does require additional changes

packages/openstef-models/src/openstef_models/explainability/mixins.py

packages/openstef-models/src/openstef_models/models/forecasting/flatliner_forecaster.py

packages/openstef-models/src/openstef_models/models/forecasting/gblinear_forecaster.py

egordm · 2025-12-16T09:22:01Z

packages/openstef-models/src/openstef_models/workflows/custom_forecasting_workflow.py

+            if isinstance(result, EnsembleModelFitResult):
+                self._logger.info("Discarding EnsembleModelFitResult for compatibility.")
+                result = result.combiner_fit_result


Nitpick. But I think using log level info might be too verbose. debug would be more appropriate, or no logging at all, since it's not really actionable.

I will change the log level. I do think it would be nice to keep the full EnsembleModelFit result in the future, to really evaluate the setup.

current setup:

EnsembleModelFitResults: forecaster_results: ModelFitResult[] // Performance of base forecasters combiner_results: ModelFitResult // Performance of forecasters + combiner

Right now the combiner fit result gives the total performance after applying the combiner models. Ideally we would separate this. We would get something like this:

EnsembleModelFitResults: forecaster_results: ModelFitResult[] // Performance of base forecasters combiner_results: ModelFitResult // Added performance of combiner full_results: ModelFitResult // Performance of forecasters + combiner

This off course implies extra work for the callbacks etc to make everyting compatible, so for now we discard larger structure and keep only the total result.

packages/openstef-models/src/openstef_models/presets/forecasting_workflow.py

packages/openstef-models/src/openstef_models/transforms/general/selector.py

Signed-off-by: Lars van Someren <[email protected]>

...stef-beam/src/openstef_beam/backtesting/backtest_forecaster/openstef4_backtest_forecaster.py

MvLieshout · 2025-12-16T09:18:00Z

...stef-beam/src/openstef_beam/backtesting/backtest_forecaster/openstef4_backtest_forecaster.py

        # Extract quantiles from the workflow's model
+
+        if isinstance(self._workflow.model, EnsembleForecastingModel):
+            # Assuming all ensemble members have the same quantiles


Do we also enforce this?

Then we do not have to assume I guess

We do not enforce it explicitly, but if the Forecasting Workflow (Config) is used, it is always the case. I have removed the inline comment. If statement still applies, as ensemble forecasting workflow does not have a .forecaster property. Alternatively we can define a .forecaster property on Ensemble Forecasting Model that returns self.forecasters[0] (To ensure compatibility without If statements with beam etc.

packages/openstef-meta/src/openstef_meta/models/ensemble_forecasting_model.py

packages/openstef-meta/src/openstef_meta/presets/forecasting_workflow.py

packages/openstef-meta/src/openstef_meta/utils/datasets.py

Signed-off-by: Lars van Someren <[email protected]>

MvLieshout · 2025-12-16T11:08:34Z

Really nice additions to OpenSTEF, I have left some comments. Mostly small nitpicks.

Signed-off-by: Lars van Someren <[email protected]>

…ybridForecaster2.0 Signed-off-by: Lars van Someren <[email protected]>

Signed-off-by: Lars van Someren <[email protected]>

Lars800 added 30 commits November 7, 2025 15:48

Added Lightgbm, LightGBM Linear Trees and Hybrid Stacking Forecasters

f69b8d2

Fixed small issues

6fcd632

Ruff compliance

7523987

fixed quality checks

c680aa1

Fixed last issues, Signed-off-by: Lars van Someren <lars.vansomeren@s…

9c1e3d3

…ia-partners.com>

fixed comments

4394895

Refactor LightGBM to LGBM

5745212

Update LGBM and LGBMLinear defaults, fixed comments

a2538b6

Merge branch 'release/v4.0.0' into Openstef4.0.0_Additional_forecasters

1788759

Fixed comments

3d54604

Added SkopsModelSerializer

34fc3e5

Fixed issues

bad4c44

Gitignore optimization and dev sandbox

99c9bc5

Added MultiQuantileAdapter Class

4027de7

small fix

064a92d

Hybrid V2

ed83b3a

Small fix

bfa2e2f

set silence

8be453a

Merge branch 'release/v4.0.0' into research/v4.1.0_Additional_Forecas…

8c5743b

…ters

small fix

ea90239

Fix final learner

93baa03

fixed lgbm efficiency

4f8ea8f

updated lgbm linear params

b4bdbdc

Fixed type and quality issues

ea1f5f7

First Version Sample Weighting Approach

22688e0

Signed-off-by: Lars van Someren <[email protected]>

MetaForecasterClass

9b971d3

Signed-off-by: Lars van Someren <[email protected]>

Merge remote-tracking branch 'origin/research/v4.1.0' into research/H…

5a54c4f

…ybridForecaster2.0

fix merge issue

72b1ca7

Signed-off-by: Lars van Someren <[email protected]>

Fixed type Issues

553e2fd

Signed-off-by: Lars van Someren <[email protected]>

Lars800 and others added 18 commits December 3, 2025 13:29

Prepared TODOs for Florian

e18ce5a

Signed-off-by: Lars van Someren <[email protected]>

Small fix

ece5d18

Signed-off-by: Lars van Someren <[email protected]>

Made PR Compliant

c33ce93

Signed-off-by: Lars van Someren <[email protected]>

BugFix

eb775e4

Signed-off-by: Lars van Someren <[email protected]>

fixes

e212448

Signed-off-by: Lars van Someren <[email protected]>

bug fixes

b44fd92

Signed-off-by: Lars van Someren <[email protected]>

added learned weights contributions

51579d0

Added Feature Contributions

2899baf

Residual Forecaster and Stacking Forecaster can now predict model contributions. Regular forecasters (EXCEPT LGBM Linear) can predict feature contributions

Bugfixes

6f88d72

Signed-off-by: Lars van Someren <[email protected]>

fixes

20edf2d

Signed-off-by: Lars van Someren <[email protected]>

Fixes

e6bc447

Signed-off-by: Lars van Someren <[email protected]>

fixed tests

bedf6af

Signed-off-by: Lars van Someren <[email protected]>

small fix

c9f135f

Signed-off-by: Lars van Someren <[email protected]>

Merge remote-tracking branch 'origin/research/ExplainableModelContrib…

b10d02c

…utionsWFunctions' into research/HybridForecaster2.0 Signed-off-by: Lars van Someren <[email protected]>

Stacking Bugfix

845e384

Signed-off-by: Lars van Someren <[email protected]>

Added hard Forecast Selection

780e012

Signed-off-by: Lars van Someren <[email protected]>

Improved data handling in EnsembleForecasting model, correct data spl…

682ae2f

…itting and Model Fit Result. Validation and test data can now be fully used Signed-off-by: Lars van Someren <[email protected]>

egordm requested changes Dec 16, 2025

View reviewed changes

Migrated Flagger and Selector to OpenSTEF Models transforms

619c271

Signed-off-by: Lars van Someren <[email protected]>

MvLieshout reviewed Dec 16, 2025

View reviewed changes

Fixed restore target Forecast Combiner

3b6587a

Signed-off-by: Lars van Someren <[email protected]>

Lars800 added 6 commits December 16, 2025 12:10

Streamline logging statements, Fix quality

ede0908

Signed-off-by: Lars van Someren <[email protected]>

Resolved comments, fixed bug

ab13581

Signed-off-by: Lars van Someren <[email protected]>

Moved example

b5a3737

Signed-off-by: Lars van Someren <[email protected]>

Merge remote-tracking branch 'origin/research/v4.1.0' into research/H…

c650bb8

…ybridForecaster2.0 Signed-off-by: Lars van Someren <[email protected]>

Integrated changes to beam structure

297f186

Signed-off-by: Lars van Someren <[email protected]>

make PR compliant

0ac62c8

Signed-off-by: Lars van Someren <[email protected]>

egordm mentioned this pull request Dec 19, 2025

[OpenSTEF 4.0] Adding shap explainability support for forecasters #792

Open

1 task

OpenSTEF Meta V0.1 #771

Are you sure you want to change the base?

OpenSTEF Meta V0.1 #771

Uh oh!

Conversation

Lars800 commented Nov 28, 2025

Uh oh!

egordm left a comment

Choose a reason for hiding this comment

Uh oh!

egordm Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Lars800 Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

egordm Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Lars800 Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MvLieshout Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Lars800 Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MvLieshout commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants