Experiment - Why regularization controls overfitting
Problem:We want to train a neural network to classify handwritten digits from the MNIST dataset. The current model fits the training data very well but performs poorly on new data.
Current Metrics:Training accuracy: 98%, Validation accuracy: 82%, Training loss: 0.05, Validation loss: 0.45
Issue:The model is overfitting: it learns the training data too well but does not generalize to validation data.
