Experiment - Fallback and error handling
Problem:You have a text generation AI model that sometimes produces irrelevant or nonsensical answers when given unusual or ambiguous questions.
Current Metrics:On a test set of 100 queries, 15% of the outputs are irrelevant or incorrect, causing poor user experience.
Issue:The model lacks fallback and error handling mechanisms to detect and correct bad outputs.