Jump into concepts and practice - no test required
or
Recommended
Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong
Recall & Review
beginner
What is extractive summarization?
Extractive summarization is a method that creates a summary by selecting important sentences or phrases directly from the original text without changing them.
Click to reveal answer
beginner
How does extractive summarization differ from abstractive summarization?
Extractive summarization picks exact parts from the text, while abstractive summarization rewrites the content in new words to create a summary.
Click to reveal answer
intermediate
Name a common technique used in extractive summarization.
One common technique is scoring sentences based on word frequency or importance, then selecting the top scoring sentences for the summary.
Click to reveal answer
beginner
Why is extractive summarization easier to implement than abstractive summarization?
Because it only selects existing sentences without generating new text, it requires less complex language understanding and generation.
Click to reveal answer
intermediate
What is a limitation of extractive summarization?
It may produce summaries that are less coherent or natural because it only copies parts of the original text without rewriting.
Click to reveal answer
What does extractive summarization do?
ATranslates the text into another language
BGenerates new sentences to summarize the text
CRemoves all stop words from the text
DSelects important sentences from the original text
✗ Incorrect
Extractive summarization selects important sentences directly from the original text to form a summary.
Which of these is a common way to score sentences in extractive summarization?
AUsing a random number generator
BCounting word frequency
CTranslating sentences to another language
DReplacing words with synonyms
✗ Incorrect
Counting word frequency helps identify important sentences by how often key words appear.
Why might extractive summaries be less natural?
AThey translate text incorrectly
BThey use too many new words
CThey copy sentences without rewriting
DThey remove all punctuation
✗ Incorrect
Extractive summaries copy sentences as-is, which can make the summary less smooth or coherent.
Which summarization method rewrites content in new words?
AAbstractive summarization
BExtractive summarization
CKeyword extraction
DText translation
✗ Incorrect
Abstractive summarization generates new sentences to express the main ideas.
Extractive summarization is easier to implement because:
AIt does not generate new text
BIt requires complex language models
CIt translates text automatically
DIt summarizes by rewriting sentences
✗ Incorrect
Extractive summarization only selects existing sentences, so it needs less complex processing.
Explain extractive summarization and how it works in simple terms.
Think about picking key sentences like highlighting important parts in a book.
You got /4 concepts.
List advantages and disadvantages of extractive summarization.
Consider what is easy and what might feel unnatural about copying text directly.
You got /4 concepts.
Practice
(1/5)
1. What is the main goal of extractive summarization in NLP?
easy
A. To translate the text into another language
B. To rewrite the text using simpler words
C. To select important sentences from the original text to create a summary
D. To generate new sentences that explain the text
Solution
Step 1: Understand extractive summarization
Extractive summarization picks key sentences directly from the original text without changing them.
Step 2: Compare options
Only To select important sentences from the original text to create a summary describes selecting important sentences from the original text, which matches extractive summarization.
Final Answer:
To select important sentences from the original text to create a summary -> Option C
Hint: Extractive means picking from original text directly [OK]
Common Mistakes:
Confusing extractive with abstractive summarization
Thinking it rewrites or translates text
Assuming it generates new sentences
2. Which of the following is a common technique used in extractive summarization?
easy
A. Neural machine translation
B. Text generation with GPT
C. Part-of-speech tagging
D. TF-IDF scoring of sentences
Solution
Step 1: Identify techniques for extractive summarization
Extractive summarization often uses TF-IDF to score sentences by importance based on word frequency.
Step 2: Eliminate unrelated options
Neural machine translation and text generation are for other NLP tasks, and POS tagging is not directly used for summarization scoring.
Final Answer:
TF-IDF scoring of sentences -> Option D
Quick Check:
TF-IDF = common extractive technique [OK]
Hint: TF-IDF ranks sentence importance in extractive summarization [OK]
Common Mistakes:
Confusing summarization with translation or generation
Thinking POS tagging directly creates summaries
Ignoring TF-IDF's role in scoring
3. Given the following Python code snippet using TF-IDF for extractive summarization, what will be the output?
from sklearn.feature_extraction.text import TfidfVectorizer
texts = ["Cats are great pets.", "Dogs are loyal animals.", "Cats and dogs can live together."]
vectorizer = TfidfVectorizer()
X = vectorizer.fit_transform(texts)
scores = X.sum(axis=1)
print(scores)
medium
A. [[0.0], [0.0], [0.0]]
B. [[2.0], [2.0], [2.4]]
C. [[2.0], [2.0], [3.0]]
D. [[1.0], [1.0], [1.0]]
Solution
Step 1: Understand TF-IDF vectorization and summing
The code vectorizes three sentences and sums TF-IDF scores per sentence (row-wise sum).
Step 2: Calculate approximate sums
Each sentence has TF-IDF scores summing roughly to 2.0, 2.0, and 2.4 respectively due to shared and unique words.
Final Answer:
[[2.0], [2.0], [2.4]] -> Option B
Quick Check:
Sum TF-IDF per sentence ≈ [[2.0], [2.0], [2.4]] [OK]
Hint: Sum TF-IDF scores per sentence to get importance [OK]
Common Mistakes:
Assuming zero scores for all sentences
Confusing sum with average
Misunderstanding TF-IDF output shape
4. You have this extractive summarization code snippet:
sentences = ["AI is fascinating.", "It helps solve problems.", "AI can learn from data."]
scores = [0.8, 0.9, 0.85]
summary = []
for i in range(len(sentences)):
if scores[i] > 0.85:
summary.append(sentences[i])
print(summary)
What is the output and is there any bug?
medium
A. ['It helps solve problems.'] with no bug
B. ['AI is fascinating.', 'It helps solve problems.', 'AI can learn from data.'] with no bug
C. ['It helps solve problems.', 'AI can learn from data.'] but index error bug
D. [] because scores are not compared correctly
Solution
Step 1: Check score filtering condition
The code adds sentences with scores > 0.85, so sentences with 0.9 and 0.85 are checked; 0.85 is not > 0.85, so only 0.9 and 0.85 fail or pass accordingly.
Step 2: Determine which sentences are included
Scores: 0.8 (no), 0.9 (yes), 0.85 (no). So only "It helps solve problems." is included. But 0.85 is not > 0.85, so excluded.
Final Answer:
['It helps solve problems.'] -> Option A
Quick Check:
Scores > 0.85 filter sentences correctly [OK]
Hint: Check strict > vs >= in score filtering [OK]
Common Mistakes:
Including sentences with score equal to threshold
Expecting index errors where none exist
Misreading the comparison operator
5. You want to create an extractive summarizer that picks the top 2 sentences from a document based on TF-IDF scores. Given these sentences and their scores: