Experiment - Weight decay (L2 regularization)
Problem:Train a neural network to classify handwritten digits from the MNIST dataset. The current model achieves 99% training accuracy but only 85% validation accuracy.
Current Metrics:Training accuracy: 99%, Validation accuracy: 85%, Training loss: 0.02, Validation loss: 0.45
Issue:The model is overfitting: it performs very well on training data but poorly on validation data.
