I even read in a book by Giuseppe Bonaccorso that sometimes the training set can be extended to 98% of total data. Does it have something to do with the size of dataset? If I have 2000+ observations and intend to forecast 30 days ahead, does it make sense to use 98% of data for training?

Your answers are highly appreciated!

Similar questions and discussions