Experiment - Why architecture design impacts performance
Problem:We want to classify images of handwritten digits (0-9) using a neural network. The current model is a simple neural network with one hidden layer.
Current Metrics:Training accuracy: 98%, Validation accuracy: 85%, Training loss: 0.05, Validation loss: 0.45
Issue:The model overfits the training data. It learns training data very well but does not generalize well to new images.
