Overview - Train/val/test split
What is it?
Train/val/test split is the process of dividing a dataset into three parts: training, validation, and testing. The training set is used to teach the model, the validation set helps tune the model's settings, and the test set checks how well the model works on new data. This helps ensure the model learns well and can make good predictions on data it hasn't seen before.
Why it matters
Without splitting data properly, a model might just memorize the examples it sees and fail to work on new data. This would make it useless in real life, like a student who only memorizes answers but can't solve new problems. Splitting data helps us build models that truly understand patterns and perform reliably.
Where it fits
Before this, you should understand what datasets and models are. After learning this, you will explore model training, tuning hyperparameters, and evaluating model performance.