Recall & Review
beginner
What is the main difference between train-test split for time series data and for regular data?
In time series, the data is ordered by time, so the train-test split must keep this order to avoid using future data to predict the past. Regular data can be shuffled before splitting.
Click to reveal answer
beginner
Why should you never randomly shuffle time series data before splitting into train and test sets?
Random shuffling breaks the time order and can cause the model to learn from future information, which is unrealistic and leads to over-optimistic results.
Click to reveal answer
beginner
What is a common method to split time series data into train and test sets?
Use the earliest part of the data for training and the later part for testing, preserving the time order.
Click to reveal answer
intermediate
How does the size of the test set affect time series model evaluation?
A larger test set gives a better estimate of future performance but reduces training data size. A balance is needed to train well and evaluate reliably.
Click to reveal answer
intermediate
What is the purpose of using a rolling or expanding window approach in time series train-test splitting?
These approaches simulate real forecasting by repeatedly training on past data and testing on the next time step, helping to evaluate model stability over time.
Click to reveal answer
Why can't you randomly shuffle time series data before splitting into train and test sets?
✗ Incorrect
Random shuffling breaks the time order and causes the model to learn from future data, which is not realistic.
What is the typical way to split time series data for training and testing?
✗ Incorrect
Time series data must keep time order, so training uses earlier data and testing uses later data.
What does a rolling window approach do in time series model evaluation?
✗ Incorrect
Rolling window trains on a fixed-size past window and tests on the next time step repeatedly.
What is a risk of using too small a training set in time series?
✗ Incorrect
A small training set may not provide enough information for the model to learn well.
Which of these is NOT a valid reason to keep time order in train-test split for time series?
✗ Incorrect
Increasing randomness by shuffling is not valid for time series splitting.
Explain why preserving time order is important when splitting time series data into train and test sets.
Think about how time flows and why using future data to predict past is a problem.
You got /3 concepts.
Describe how a rolling window approach works for training and testing time series models.
Imagine sliding a small window over your data to train and test repeatedly.
You got /4 concepts.