NLPml~15 mins

Extractive QA concept in NLP - Deep Dive

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Overview - Extractive QA concept

What is it?

Extractive Question Answering (QA) is a method where a system finds the exact answer to a question by selecting a piece of text from a given document or passage. Instead of generating new text, it extracts the answer directly from the source. This helps machines understand and respond to questions using existing information.

Why it matters

Extractive QA solves the problem of quickly finding precise answers from large amounts of text, like articles or reports. Without it, people would have to read everything themselves, which is slow and tiring. It powers search engines, virtual assistants, and customer support by giving fast, accurate answers.

Where it fits

Before learning Extractive QA, you should understand basic natural language processing concepts like tokenization and embeddings. After this, you can explore generative QA, where answers are created rather than extracted, and advanced models like transformers for better understanding.

Mental Model

Core Idea

Extractive QA works by scanning a text to find the exact part that answers a question, like highlighting a sentence in a book.

Think of it like...

Imagine you have a big book and someone asks you a question. Instead of rewriting the answer, you flip through the pages and point to the exact sentence that answers them.

┌───────────────┐
│   Question    │
└──────┬────────┘
       │
       ▼
┌─────────────────────────────┐
│       Text Passage           │
│  ┌───────────────────────┐  │
│  │  Extracted Answer     │  │
│  │  (Exact text snippet) │  │
│  └───────────────────────┘  │
└─────────────────────────────┘

Build-Up - 7 Steps

FoundationUnderstanding Questions and Text

Concept: Learn what questions and text passages are in simple terms and how they relate.

A question is something you want to know, like 'What is the capital of France?'. A text passage is a piece of writing that might contain the answer, like a paragraph about France. Extractive QA finds the answer inside this passage.

Result

You can identify a question and a text passage that might contain the answer.

Understanding the basic elements of question and text is essential before trying to find answers inside text.

FoundationWhat Does Extractive Mean?

IntermediateHow Models Find Answers in Text

IntermediateRole of Contextual Word Representations

IntermediateTraining Extractive QA Models

AdvancedHandling No-Answer Questions

ExpertLimitations and Biases in Extractive QA

Under the Hood

Extractive QA models use deep neural networks, often transformers, to encode both the question and passage into vectors. They then compute scores for each token's likelihood of being the start or end of the answer span. The highest scoring span is selected as the answer. During training, the model learns to assign high scores to correct spans by minimizing a loss function comparing predictions to true answers.

Why designed this way?

Extractive QA was designed to provide precise answers without generating new text, reducing errors and hallucinations common in generative models. Using start and end positions simplifies the problem to span prediction, which is easier to train and interpret. Transformers were chosen for their ability to capture context and relationships in text effectively.

┌───────────────┐      ┌───────────────┐
│   Question    │      │  Text Passage  │
└──────┬────────┘      └──────┬────────┘
       │                      │
       │                      │
       ▼                      ▼
  ┌─────────────────────────────────┐
  │      Transformer Encoder         │
  │  (jointly encodes question &    │
  │   passage into contextual tokens)│
  └──────────────┬──────────────────┘
                 │
                 ▼
       ┌─────────────────────┐
       │ Start & End Scorers  │
       │ (predict answer span)│
       └─────────┬───────────┘
                 │
                 ▼
       ┌─────────────────────┐
       │ Extracted Answer Span│
       └─────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does Extractive QA generate new sentences as answers? Commit to yes or no.

Common Belief:Extractive QA creates new answers by writing sentences based on the question.

Tap to reveal reality

Quick: Do you think Extractive QA always finds the correct answer if it exists in the text? Commit to yes or no.

Common Belief:If the answer is in the text, the model will always find it correctly.

Tap to reveal reality

Quick: Does Extractive QA require the entire document to answer a question? Commit to yes or no.

Common Belief:Extractive QA models need the full document to find answers.

Tap to reveal reality

Quick: Can Extractive QA handle questions with no answers in the text? Commit to yes or no.

Common Belief:Extractive QA always returns an answer, even if the text doesn't contain it.

Tap to reveal reality

Expert Zone

Extractive QA models often rely heavily on the quality and length of the input passage; too short or too long passages can reduce accuracy.

The choice of tokenizer and how text is split into tokens can affect the exact span predicted, impacting answer precision.

Fine-tuning on domain-specific data significantly improves performance, as general models may miss specialized terminology or phrasing.

When NOT to use

Extractive QA is not suitable when answers require synthesis, summarization, or reasoning beyond text spans. In such cases, generative QA or multi-hop reasoning models are better alternatives.

Production Patterns

In real systems, Extractive QA is combined with document retrieval to first find relevant passages, then extract answers. It is also used with confidence scoring to decide when to show answers or ask for human help.

Connections

Information Retrieval

Extractive QA builds on Information Retrieval by first locating relevant documents or passages before extracting answers.

Understanding retrieval helps grasp how QA systems narrow down large text collections to manageable chunks for answer extraction.

Span Prediction in NLP

Extractive QA is a specific application of span prediction, where models identify start and end positions in text.

Knowing span prediction techniques clarifies the core mechanism behind answer extraction.

Legal Document Review

Extractive QA techniques are used in legal tech to find exact clauses or facts in contracts and laws.

Seeing Extractive QA applied in law shows its power to speed up complex text analysis in real-world fields.

Common Pitfalls

#1Expecting the model to generate answers not present in the text.

Wrong approach:Question: 'Who invented the telephone?' Passage: 'Alexander Graham Bell was a scientist.' Model output: 'Thomas Edison' (made-up answer)

Correct approach:Model should return 'no answer' or the exact text if present, e.g., 'Alexander Graham Bell' if mentioned.

Root cause:Misunderstanding that Extractive QA only selects text, not generates new information.

#2Feeding very long documents directly to the model without splitting.

Wrong approach:Inputting entire book text as one passage for answer extraction.

Correct approach:Split the document into smaller passages or paragraphs before running Extractive QA.

Root cause:Ignoring model input length limits and performance degradation on long texts.

#3Ignoring no-answer detection and always trusting model output.

Wrong approach:Accepting any extracted span as correct answer even if irrelevant.

Correct approach:Use models or thresholds that can indicate no-answer when appropriate.

Root cause:Not accounting for cases where the passage lacks the answer.

Key Takeaways

Extractive QA finds answers by selecting exact text spans from a passage, not by creating new text.

Models predict start and end positions of answers using deep learning and contextual understanding.

Handling no-answer cases is essential for reliable real-world QA systems.

Training requires examples with questions, passages, and exact answer spans to learn effectively.

Extractive QA works best combined with document retrieval and careful passage selection.

Practice

(1/5)

1. What is the main goal of extractive question answering (QA)?

easy

A. To translate the question into another language

B. To generate a new answer not present in the text

C. To summarize the entire text into a short paragraph

D. To find the exact answer span within a given text

Extractive QA concept in NLP - Deep Dive

Start learning this pattern below

Practice

Solution

Step 1: Understand extractive QA purpose

Step 2: Compare options with definition

Final Answer:

Quick Check:

Solution

Step 1: Recall extractive QA output format

Step 2: Match options to output format

Final Answer:

Quick Check:

Solution

Step 1: Understand question and context

Step 2: Identify exact answer span

Final Answer:

Quick Check:

Solution

Step 1: Analyze index values

Step 2: Understand slicing behavior

Final Answer:

Quick Check:

Solution

Step 1: Understand the problem of missing answers

Step 2: Evaluate solution options

Final Answer:

Quick Check: