Experiment - Why QA systems extract answers
Problem:We want to build a Question Answering (QA) system that reads a paragraph and extracts the exact answer to a question. Currently, the model gives long, vague answers that are not precise.
Current Metrics:Exact match accuracy: 55%, F1 score: 60%
Issue:The model is not extracting precise answers but generating longer, less accurate responses. This reduces usefulness in real applications.
