A. Divide by len(retrieved_docs) instead of len(relevant_docs)
B. Use union instead of intersection in numerator
C. Convert lists to tuples before set operations
D. No bug, code is correct
Solution
Step 1: Understand precision formula
Precision = relevant retrieved / total retrieved, so denominator must be retrieved docs count.
Step 2: Identify denominator mistake
Code divides by len(relevant_docs), which is recall formula denominator.
Step 3: Fix denominator
Change denominator to len(retrieved_docs) to compute precision correctly.
Final Answer:
Divide by len(retrieved_docs) instead of len(relevant_docs) -> Option A
Quick Check:
Precision denominator = retrieved docs count [OK]
Hint: Precision divides by retrieved docs count, not relevant docs [OK]
Common Mistakes:
Mixing precision with recall formula
Using union instead of intersection
Ignoring set conversion issues
5. You want to evaluate a RAG model combining answer F1 score and retrieval precision into a single metric. Which approach is best to fairly combine these metrics?
hard
A. Add F1 score and retrieval precision directly
B. Calculate the harmonic mean of F1 score and retrieval precision
C. Use only the higher of the two scores
D. Multiply F1 score by retrieval precision without normalization
Solution
Step 1: Understand metric combination needs
Combining metrics requires balancing both scores fairly, avoiding dominance by one.
Step 2: Evaluate combination methods
Harmonic mean balances low and high values well; addition or multiplication can skew results.
Step 3: Choose harmonic mean
Harmonic mean is common for combining precision and recall, so it suits combining F1 and retrieval precision.
Final Answer:
Calculate the harmonic mean of F1 score and retrieval precision -> Option B
Quick Check:
Harmonic mean balances combined metrics [OK]
Hint: Use harmonic mean to balance combined metrics fairly [OK]