Practice

(1/5)

1. What does the term context window mean in natural language processing?

easy

A. A method to remove stop words from text

B. The entire document used for training a model

C. A list of all words in a sentence

D. A small part of text around a word used to understand its meaning

Solution

Step 1: Understand the definition of context window
The context window refers to a limited number of words surrounding a target word to help understand its meaning.
Step 2: Compare options with the definition
Only A small part of text around a word used to understand its meaning correctly describes this as a small part of text around a word. Other options describe unrelated concepts.
Final Answer:
A small part of text around a word used to understand its meaning -> Option D
Quick Check:
Context window = small text part around word [OK]

Hint: Context window = nearby words around a target word [OK]

Common Mistakes:

Confusing context window with entire document
Thinking it means all words in a sentence
Mixing it up with stop word removal

2. Which of the following is the correct way to define a context window of size 3 around the word at index 5 in a list words?

easy

A. words[4:7]

B. words[3:8]

C. words[2:7]

D. words[5:8]

Solution

Step 1: Understand context window size and indexing
A window size of 3 means 3 words total, usually centered on the target word. For index 5, the window covers indices 4, 5, 6.
Step 2: Check each option's slice range
words[4:7] slices words[4:7], which includes indices 4, 5, 6 (3 words). Others include wrong ranges or counts.
Final Answer:
words[4:7] -> Option A
Quick Check:
Window size 3 around index 5 = indices 4 to 6 [OK]

Hint: Slice from index-1 to index+2 for window size 3 [OK]

Common Mistakes:

Using wrong slice indices causing off-by-one errors
Including too many or too few words
Not centering window on target word

3. Given the code below, what will be the output?

words = ['I', 'love', 'to', 'eat', 'apples', 'and', 'bananas']
index = 4
window_size = 3
start = max(0, index - window_size // 2)
end = min(len(words), index + window_size // 2 + 1)
context = words[start:end]
print(context)

medium

A. ['to', 'eat', 'apples']

B. ['eat', 'apples', 'and']

C. ['apples', 'and', 'bananas']

D. ['love', 'to', 'eat']

Solution

Step 1: Calculate start and end indices
window_size is 3, so window_size // 2 = 1. start = max(0, 4 - 1) = 3, end = min(7, 4 + 1 + 1) = 6.
Step 2: Extract words from start to end
words[3:6] = ['eat', 'apples', 'and'].
Final Answer:
['eat', 'apples', 'and'] -> Option B
Quick Check:
Slice words[3:6] = ['eat', 'apples', 'and'] [OK]

Hint: Calculate start/end with floor division and slice accordingly [OK]

Common Mistakes:

Off-by-one errors in slicing
Ignoring max/min boundaries
Misunderstanding integer division

4. The following code tries to get a context window but sometimes throws an error. What is the main issue?

def get_context(words, index, window_size):
    start = index - window_size // 2
    end = index + window_size // 2 + 1
    return words[start:end]

words = ['hello', 'world']
print(get_context(words, 0, 3))

medium

A. index is out of range

B. window_size must be even

C. start can be negative causing an IndexError

D. The function does not return a list

Solution

Step 1: Analyze start index calculation
For index=0 and window_size=3, start = 0 - 1 = -1, which is negative.
Step 2: Understand Python slicing with negative start
Negative start in slicing accesses from the end, which may cause unexpected results or errors if out of range.
Final Answer:
start can be negative causing an IndexError -> Option C
Quick Check:
Negative start index causes slicing issues [OK]

Hint: Check if start index is negative before slicing [OK]

Common Mistakes:

Assuming negative indices always work safely
Thinking window_size must be even
Ignoring index bounds

5. You want to build a model that uses a context window of size 5 to understand words in sentences. Which approach best handles sentences shorter than 5 words without errors?

hard

A. Pad the sentence with special tokens to length 5 before extracting the window

B. Always extract 5 words ignoring sentence length, causing errors if too short

C. Use only the first word as context if sentence is short

D. Skip sentences shorter than 5 words during training

Solution

Step 1: Understand the problem with short sentences
Sentences shorter than the window size cause indexing errors or incomplete context.
Step 2: Evaluate options for handling short sentences
Padding with special tokens ensures fixed length and avoids errors, unlike skipping or ignoring length.
Final Answer:
Pad the sentence with special tokens to length 5 before extracting the window -> Option A
Quick Check:
Padding fixes short sentence context window issues [OK]

Hint: Pad short sentences to window size to avoid errors [OK]

Common Mistakes:

Ignoring short sentences causing runtime errors
Skipping data reduces training quality
Using incomplete context weakens model understanding

Epoch	Loss ↓	Accuracy ↑	Observation
1	2.3	0.30	Model starts learning basic word patterns
2	1.8	0.45	Loss decreases as model understands context windows better
3	1.4	0.60	Model improves predictions using overlapping windows
4	1.1	0.70	Context window handling helps capture longer dependencies
5	0.9	0.78	Training converges with good understanding of context

Context window handling in NLP - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand the definition of context window

Step 2: Compare options with the definition

Final Answer:

Quick Check:

Solution

Step 1: Understand context window size and indexing

Step 2: Check each option's slice range

Final Answer:

Quick Check:

Solution

Step 1: Calculate start and end indices

Step 2: Extract words from start to end

Final Answer:

Quick Check:

Solution

Step 1: Analyze start index calculation

Step 2: Understand Python slicing with negative start

Final Answer:

Quick Check:

Solution

Step 1: Understand the problem with short sentences

Step 2: Evaluate options for handling short sentences

Final Answer:

Quick Check: