[ENH] Notebook and Template For Global Forecasting API #6699

XinyuWuu · 2024-07-01T12:39:40Z

To close #6575 and #6684.

The notebook is originally from #6551.

Copy some discussion:

fkiraly:

Great! FYI @benHeid, have you seen and reviewed this?

I mainly have comments about the notebook.

could you separate this into another PR? The notebook is great, but there is some iteration needed regarding location and integration with the other notebooks. I think we can merge the forecasters already and don't need to delay.

Some minor comments on the notebook content:

there are a lot of printouts that confuse the reader. These should be silenced so you can focus on the didactic content.
the markdown text is nice, I would just format it so the lines are not too long, and I would also use shorter telegram style, like on ppt slides

Regarding location, I would actually add the content to the notebook 01c, which has some content already. There is some minor confusion in the notebook about terminology, the notebook also uses the term "global" but in a different way.

in the M5 paper, what the 01c notebook does is called "cross-learning"
the "global" in current 01d is more of a pre-training on other instances

In any case, we need to disambiguate terminology and perhaps adopt clearer distinctions here. @benHeid, what do you suggest on how we handle the terminology clash between 01c and 01d? And, should this go in the same notebook, so the "multiple instances" cases can be explained easily?

Originally posted by @fkiraly in #6551 (review)

shlok191:

@fkiraly, @Xinyu-Wu-0000. Yes, I have some minor input! In the 01d_forecasting_global_forecast.ipynb notebook and the 01c_forecasting_hierarchical_global.ipynb notebooks, "global learning" is referred to as a term. I do agree that instead of using "global learning", we can instead use "pretrained" and "cross-learning" as replacements.

Here is how I'd differentiate them:

To the best of my knowledge, pre-trained models do not require the training dataset time series to be correlated to each other
Referencing this paper on the M5 competition, I believe "cross-learning" is the term utilized for training models on time series which have a strong correlation. I think M5 was referenced in 01c, so it might be good to use "cross-learning" there instead of "global learning"!

Originally posted by @shlok191 in #6551 (comment)

review-notebook-app · 2024-07-01T12:39:45Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

geetu040 · 2024-07-11T16:13:53Z

@Xinyu-Wu-0000 I may ask why you created the split like this (1)

data = _make_hierarchical(
    (5, 100),
    n_columns=2,
    max_timepoints=10,
    min_timepoints=10,
)
l1 = data.index.get_level_values(1).map(lambda x: int(x[3:]))
X_train = data.loc[l1 < 90, "c0"].to_frame()
y_train = data.loc[l1 < 90, "c1"].to_frame()
X_test = data.loc[l1 >= 80, "c0"].to_frame()
y_test = data.loc[l1 >= 80, "c1"].to_frame()
y_test = y_test.groupby(level=[0, 1]).apply(lambda x: x.droplevel([0, 1]).iloc[:-3])

instead of doing it like this? (2)

y_train, y_test, X_train, X_test = temporal_train_test_split(y, X)

the later approach splits at the inner-most level and the former approach splits at 2nd level. Is there a reason for choosing this level? is it related to global forecasting in some way?

XinyuWuu · 2024-07-12T02:00:17Z

Is there a reason for choosing this level? is it related to global forecasting in some way?

Yes, global forecasting is the reason. The second approach splits the data at the time index level but first approach splits the data at the instance level (1st level or 2nd level). Global forecasting is about fitting and predicting on different instances.

yarnabrina

Made two comments on the extension template. For some reason, I can not see the notebook in Github UI, it says Unable to render rich display.

I've a off-the-topic question about global forecasting. Suppose we train a model for series 1 and 2, and global forecasting should be able to predict for 3, 4, etc. as well. If I pass no y in predict, and if I pass (just 1) or (just 2) or (1 and 2 together), are we expecting same predictions in both cases, if random_state like parameters are set?

yarnabrina · 2024-08-01T18:55:19Z

extension_templates/forecasting_global_supersimple.py

+    }
+
+    # todo: add any hyper-parameters and components to constructor
+    def __init__(self, parama, paramb="default", paramc=None, broadcasting=True):


Are you suggesting that broadcasting must be a parameter, or do you want to enforce it must be the last parameter? Currently it may suggest all model specific parameters must be added before mandatorily adding broadcasting.

I am not suggesting to enforce it.

sktime/extension_templates/forecasting_global_supersimple.py

Lines 116 to 129 in 1a33227

# (for_global)

self.broadcasting = broadcasting

if self.broadcasting:

self.set_tags(

**{

"y_inner_mtype": "pd.Series",

"X_inner_mtype": "pd.DataFrame",

"capability:global_forecasting": False,

}

)

# If you are extending an existing forecaster to global mode, you migth

# need to use the broadcasting parameter to reserve the original behavior.

# You can use deprecation cycle to switch the default behavior.

# How deprecation works in sktime can be found at https://www.sktime.net/en/stable/developer_guide/deprecation.html

yarnabrina · 2024-08-01T18:57:20Z

extension_templates/forecasting_global_supersimple.py

+        descriptive explanation of paramb
+    paramc : boolean, optional (default= whether paramb is not the default)
+        descriptive explanation of paramc
+    and so on


I will suggest to add broadcasting here, and mention what is the role of that parameter to new contributors.

Question: when we make changes to an existing estimator, adding a boradcasting control makes sense. If we get a new estimator, specifically for global forecasting, do we still enforce this?

I will suggest to add broadcasting here, and mention what is the role of that parameter to new contributors.

Yeah, I will add some explanation here.

Question: when we make changes to an existing estimator, adding a boradcasting control makes sense. If we get a new estimator, specifically for global forecasting, do we still enforce this?

I think it depends on specific forecaster not enforced.

XinyuWuu · 2024-08-02T01:01:57Z

For some reason, I can not see the notebook in Github UI, it says Unable to render rich display.

You may want to use the view file option.

XinyuWuu · 2024-08-02T01:24:55Z

I've a off-the-topic question about global forecasting. Suppose we train a model for series 1 and 2, and global forecasting should be able to predict for 3, 4, etc. as well. If I pass no y in predict, and if I pass (just 1) or (just 2) or (1 and 2 together), are we expecting same predictions in both cases, if random_state like parameters are set?

I think it depends on the forecaster. If it's a simple full connected network, I am expecting it to give the same predictions. If there are dropout layers or other random layers, I am not sure it will give the same output. Even some convolution layers and RNN layers or some simple operations like torch.Tensor.index_add_() could be nondeterministic. According to pytorch documents, it seems to be quite complicated. Even if random_state like parameters are set, I am not sure we can achieve determinstic output. The order series are predicted could alse affect the output. If we first predict on series 1, the random_state of Random Generator will change, then the following predictions are affected by the first prediction. If we pass multiple series together, how they will be batched could be really complicated.

fkiraly

Made some minor changes related to terminology and formatting, hope this is ok.

XinyuWuu added 30 commits March 28, 2024 10:57

pytorch-forecasting first draft

0278280

Merge branch 'main' into global_pytorch_forecasting

20c52b0

set None params to default value

4aaa022

convert X, y to TimeSeriesDataSet in fit

7dd2a2f

fix monotone_constaints is None

670f53c

add dataset_params

f68fb1c

add to_dataloader_params

f0ac35c

train validation split by max_prediction_length

473d606

fix kwargs overwrite in model.from_dataset

17b02e2

fix soft dependencies import error

861182a

data convertion in predict

1e233bd

Merge branch 'main' into global_pytorch_forecasting

1bd91f2

fix unsupported operand type(s) for |

ef946ca

fix kwargs loss after reset

abefa3a

fix output y name

1341b36

add comments

f78e635

Merge branch 'main' into global_pytorch_forecasting

d3fb3c8

rename GlobalBaseForecaster to BaseGlobalForecaster

8ee461a

add global_forecaster tag

f72fc41

add BaseGlobalForecaster to BASE_CLASS_REGISTER

af4fce4

add TestAllGlobalForecasters

6525392

BaseGlobalForecaster as an exception in test_inheritance

c3c43de

Merge branch 'main' into global_pytorch_forecasting

17ec882

set _tags before init (add "global_forecaster" tag)

e7e7205

add capability:global_forecasting to PytorchForecastingTFT

80e4005

fix global_forecasting check in base class

ec800bf

register capability:global_forecasting tag

4adf5b6

test_global_forecasting_tag

5383bcd

add pytorch-forecasting to pyproject.toml

b5de8f0

test_pridect_signature

433acbc

XinyuWuu added 3 commits July 1, 2024 19:55

address review from benHeid

4f392da

remove notebook

1a3496c

notebook from sktime#6551

786f84a

XinyuWuu requested review from achieveordie, benHeid, fkiraly and yarnabrina as code owners July 1, 2024 12:39

super simple template

e17ece5

This was referenced Jul 1, 2024

[ENH] Creating documentation/extension template for the Global Forecasting API #6684

Open

[ENH] DeepAR and NHiTS and refinements for pytorch-forecasting interface #6551

Merged

fkiraly assigned XinyuWuu Jul 3, 2024

XinyuWuu added 7 commits August 1, 2024 14:43

Merge branch 'main' into notebook-and-template

93a5f22

global to panel in 01c

2394be8

refine 01d

4bcf504

merge 01d to 01c

6be62b3

fix ruff format error

c32aaa0

broadcasting in template

19cb933

fix dependency error

1a33227

yarnabrina reviewed Aug 1, 2024

View reviewed changes

XinyuWuu and others added 4 commits August 2, 2024 11:45

docstring for broadcasting

7c10164

Merge branch 'main' into pr/6699

bbddf4d

minor changes

f3ea04b

reformat

3d210f8

fkiraly approved these changes Aug 10, 2024

View reviewed changes

fkiraly merged commit b1d951d into sktime:main Aug 10, 2024
16 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Notebook and Template For Global Forecasting API #6699

[ENH] Notebook and Template For Global Forecasting API #6699

XinyuWuu commented Jul 1, 2024 •

edited by yarnabrina

Loading

review-notebook-app bot commented Jul 1, 2024

geetu040 commented Jul 11, 2024

XinyuWuu commented Jul 12, 2024 •

edited

Loading

yarnabrina left a comment

yarnabrina Aug 1, 2024

XinyuWuu Aug 2, 2024

yarnabrina Aug 1, 2024

XinyuWuu Aug 2, 2024

XinyuWuu commented Aug 2, 2024

XinyuWuu commented Aug 2, 2024

fkiraly left a comment

	# (for_global)
	self.broadcasting = broadcasting
	if self.broadcasting:
	self.set_tags(
	**{
	"y_inner_mtype": "pd.Series",
	"X_inner_mtype": "pd.DataFrame",
	"capability:global_forecasting": False,
	}
	)
	# If you are extending an existing forecaster to global mode, you migth
	# need to use the broadcasting parameter to reserve the original behavior.
	# You can use deprecation cycle to switch the default behavior.
	# How deprecation works in sktime can be found at https://www.sktime.net/en/stable/developer_guide/deprecation.html

[ENH] Notebook and Template For Global Forecasting API #6699

[ENH] Notebook and Template For Global Forecasting API #6699

Conversation

XinyuWuu commented Jul 1, 2024 • edited by yarnabrina Loading

review-notebook-app bot commented Jul 1, 2024

geetu040 commented Jul 11, 2024

XinyuWuu commented Jul 12, 2024 • edited Loading

yarnabrina left a comment

Choose a reason for hiding this comment

yarnabrina Aug 1, 2024

Choose a reason for hiding this comment

XinyuWuu Aug 2, 2024

Choose a reason for hiding this comment

yarnabrina Aug 1, 2024

Choose a reason for hiding this comment

XinyuWuu Aug 2, 2024

Choose a reason for hiding this comment

XinyuWuu commented Aug 2, 2024

XinyuWuu commented Aug 2, 2024

fkiraly left a comment

Choose a reason for hiding this comment

XinyuWuu commented Jul 1, 2024 •

edited by yarnabrina

Loading

XinyuWuu commented Jul 12, 2024 •

edited

Loading