Experiment - Answer span extraction
Problem:We want to build a model that finds the exact answer span in a paragraph given a question. Currently, the model predicts start and end positions of the answer in the text.
Current Metrics:Training loss: 0.15, Training accuracy (exact match): 85%, Validation loss: 0.40, Validation accuracy (exact match): 65%
Issue:The model is overfitting: training accuracy is high but validation accuracy is much lower.