Challenge - 5 Problems
Factual Consistency Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate2:00remaining
What is the main goal of factual consistency checking in AI-generated text?
Choose the best description of what factual consistency checking aims to achieve in AI-generated content.
Attempts:
2 left
💡 Hint
Think about why we want AI to be truthful and accurate.
✗ Incorrect
Factual consistency checking ensures that AI-generated text does not contradict known facts or the source material. This helps maintain trust and reliability.
❓ Predict Output
intermediate2:00remaining
Output of a simple factual consistency check using cosine similarity
Given the following Python code snippet that compares embeddings of a source and generated sentence, what is the printed output?
Prompt Engineering / GenAI
from sklearn.metrics.pairwise import cosine_similarity import numpy as np source_embedding = np.array([[0.1, 0.3, 0.5]]) generated_embedding = np.array([[0.1, 0.3, 0.5]]) similarity = cosine_similarity(source_embedding, generated_embedding)[0][0] print(round(similarity, 2))
Attempts:
2 left
💡 Hint
Cosine similarity of identical vectors is 1.
✗ Incorrect
The cosine similarity between two identical vectors is 1, indicating perfect similarity.
❓ Model Choice
advanced2:00remaining
Best model type for factual consistency checking in text generation
Which model type is most suitable for checking factual consistency between a source document and generated summary?
Attempts:
2 left
💡 Hint
Think about a model that can say yes or no about factual correctness.
✗ Incorrect
A binary classifier trained on pairs of source and generated text to predict factual consistency is best suited for this task.
❓ Metrics
advanced2:00remaining
Which metric directly measures factual consistency in generated text?
Among these metrics, which one is designed specifically to evaluate factual consistency between generated text and source content?
Attempts:
2 left
💡 Hint
This metric was created for factual consistency evaluation.
✗ Incorrect
FactCC is a metric and model designed specifically to detect factual inconsistencies in generated summaries.
🔧 Debug
expert3:00remaining
Why does this factual consistency check code fail to detect errors?
Consider this Python code snippet that compares source and generated sentences using exact string match for factual consistency. Why might it fail to detect factual errors?
Prompt Engineering / GenAI
source = "The Eiffel Tower is in Paris." generated = "The Eiffel Tower is in Berlin." consistent = (source == generated) print(consistent)
Attempts:
2 left
💡 Hint
Think about what comparing two sentences by '==' really means.
✗ Incorrect
Checking exact string equality does not capture factual differences if sentences differ slightly in wording but have different facts.