0
0
NLPml~5 mins

Answer span extraction in NLP - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is answer span extraction in NLP?
Answer span extraction is the task of finding the exact part (span) of a text that answers a question. It locates the start and end positions of the answer within a passage.
Click to reveal answer
beginner
Why do models predict start and end positions for answers instead of generating text?
Predicting start and end positions helps models find the exact answer inside the given text. It is simpler and more accurate than generating new text, especially when the answer is a direct excerpt.
Click to reveal answer
intermediate
What kind of model output is used for answer span extraction?
Models output two probability distributions: one for the start position and one for the end position of the answer span. The highest probabilities indicate the predicted answer boundaries.
Click to reveal answer
intermediate
How is the training loss calculated for answer span extraction models?
The loss is usually the sum of two cross-entropy losses: one comparing predicted start positions to true start, and one comparing predicted end positions to true end. This guides the model to predict correct spans.
Click to reveal answer
advanced
What is a common challenge when extracting answer spans from long passages?
Long passages can have multiple similar phrases, making it hard to pick the correct span. Also, the answer might be spread out or require understanding context beyond simple matching.
Click to reveal answer
In answer span extraction, what do models predict?
AStart and end positions of the answer in the text
BThe full generated answer text
COnly the start position of the answer
DThe question category
Which loss function is commonly used to train answer span extraction models?
AMean squared error
BCross-entropy loss
CHinge loss
DCosine similarity
Why is answer span extraction preferred over answer generation in some QA tasks?
AIt requires less computation
BIt always produces longer answers
CIt finds exact text spans, improving accuracy
DIt does not need training data
What is a typical output format of an answer span extraction model?
ATwo probability distributions over tokens for start and end
BA single probability for the whole answer
CA list of possible answers
DA confidence score only
What makes answer span extraction challenging in long texts?
AAnswers are always at the start
BModels cannot handle long texts
CAnswers are never in the text
DMultiple similar phrases can confuse the model
Explain how answer span extraction models find answers in a passage.
Think about how the model points to parts of the text instead of creating new words.
You got /4 concepts.
    Describe challenges faced when extracting answer spans from long or complex passages.
    Consider why picking the right part of a long text can be tricky.
    You got /4 concepts.