Experiment - Image understanding and description
Problem:We want to build a model that looks at an image and writes a short sentence describing what it sees. Currently, the model is very good at describing training images but makes many mistakes on new images it has never seen.
Current Metrics:Training accuracy: 95%, Validation accuracy: 65%, Validation loss: 1.2
Issue:The model is overfitting. It performs very well on training images but poorly on validation images, showing it does not generalize well.