Recall & Review

beginner

What is answer span extraction in NLP?

Answer span extraction is the task of finding the exact part (span) of a text that answers a question. It locates the start and end positions of the answer within a passage.

Click to reveal answer

beginner

Why do models predict start and end positions for answers instead of generating text?

Predicting start and end positions helps models find the exact answer inside the given text. It is simpler and more accurate than generating new text, especially when the answer is a direct excerpt.

Click to reveal answer

intermediate

What kind of model output is used for answer span extraction?

Models output two probability distributions: one for the start position and one for the end position of the answer span. The highest probabilities indicate the predicted answer boundaries.

Click to reveal answer

intermediate

How is the training loss calculated for answer span extraction models?

The loss is usually the sum of two cross-entropy losses: one comparing predicted start positions to true start, and one comparing predicted end positions to true end. This guides the model to predict correct spans.

Click to reveal answer

advanced

What is a common challenge when extracting answer spans from long passages?

Long passages can have multiple similar phrases, making it hard to pick the correct span. Also, the answer might be spread out or require understanding context beyond simple matching.

Click to reveal answer

In answer span extraction, what do models predict?

AStart and end positions of the answer in the text

BThe full generated answer text

COnly the start position of the answer

DThe question category

Which loss function is commonly used to train answer span extraction models?

AMean squared error

BCross-entropy loss

CHinge loss

DCosine similarity

Why is answer span extraction preferred over answer generation in some QA tasks?

AIt requires less computation

BIt always produces longer answers

CIt finds exact text spans, improving accuracy

DIt does not need training data

What is a typical output format of an answer span extraction model?

ATwo probability distributions over tokens for start and end

BA single probability for the whole answer

CA list of possible answers

DA confidence score only

What makes answer span extraction challenging in long texts?

AAnswers are always at the start

BModels cannot handle long texts

CAnswers are never in the text

DMultiple similar phrases can confuse the model

Explain how answer span extraction models find answers in a passage.

Describe challenges faced when extracting answer spans from long or complex passages.

Practice

(1/5)

1. What is the main goal of answer span extraction in NLP?

easy

A. To generate new text based on a prompt

B. To find the exact part of text that answers a question

C. To summarize long documents into short sentences

D. To translate text from one language to another

Answer span extraction in NLP - Cheat Sheet & Quick Revision

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of answer span extraction

Step 2: Compare with other NLP tasks

Final Answer:

Quick Check:

Solution

Step 1: Identify typical data types for positions

Step 2: Evaluate options

Final Answer:

Quick Check:

Solution

Step 1: Identify tokens and their indices

Step 2: Extract tokens from start to end index

Final Answer:

Quick Check:

Solution

Step 1: Understand the problem with indices

Step 2: Choose a fix that preserves valid spans

Final Answer:

Quick Check:

Solution

Step 1: Understand logits for start and end tokens

Step 2: Combine logits to find best span

Final Answer:

Quick Check: