I have one year of wind speed data, collected at hourly intervals. I am trying to train a forecasting model using this data. I want to be able to generate rolling forecasts up to 7 days in the future. How do I decide the amount of data to use in my training set. Should I use the whole one year data or should I use a subset of it. How can one determine this amount of data to be used while training. I understand that having more data points than the seasonality of the data is always good in time series forecasting. But the data is rather noisy and no seasonality is observed other than diurnal seasonality.

Similar questions and discussions