Recall & Review

beginner

What is beam search decoding in NLP?

Beam search decoding is a method to find the most likely sequence of words by exploring multiple options at each step, keeping only the best few sequences (called beams) instead of just one.

Click to reveal answer

beginner

Why is beam search better than greedy search?

Beam search keeps multiple candidate sequences at each step, so it can avoid early mistakes that greedy search makes by choosing only the single best option at each step.

Click to reveal answer

intermediate

What does the beam width control in beam search decoding?

Beam width controls how many candidate sequences are kept at each step. A larger beam width means more sequences are considered, which can improve results but takes more time.

Click to reveal answer

intermediate

How does beam search handle sequence probabilities during decoding?

Beam search multiplies or adds the log probabilities of words in a sequence to score each candidate. It keeps the top sequences with the highest total scores at each step.

Click to reveal answer

intermediate

What is a common drawback of beam search decoding?

Beam search can still miss the best sequence if the beam width is too small, and it can be slower than greedy search because it keeps multiple candidates.

Click to reveal answer

What does beam search keep track of during decoding?

AMultiple best candidate sequences

BOnly the single best sequence

CRandom sequences

DAll possible sequences

What happens if you increase the beam width in beam search?

AThe model trains faster

BFewer sequences are considered, speeding up decoding

CMore sequences are considered, improving accuracy but increasing computation

DThe output becomes random

Which of these is a key difference between greedy search and beam search?

ABeam search ignores probabilities

BGreedy search keeps one best sequence; beam search keeps multiple

CGreedy search is slower than beam search

DBeam search only works for images

How does beam search score candidate sequences?

ABy summing or multiplying word probabilities

BBy counting word length

CBy random selection

DBy alphabetical order

What is a potential downside of using a very small beam width?

AIt will always find the best sequence

BIt will generate random outputs

CIt will be too slow

DIt may miss better sequences and behave like greedy search

Explain how beam search decoding works and why it is used instead of greedy search.

Describe the trade-offs involved in choosing the beam width for beam search decoding.

Practice

(1/5)

1. What is the main purpose of beam search decoding in natural language processing?

easy

A. To keep track of multiple best candidate sequences during prediction

B. To randomly select words for output generation

C. To generate only one possible output sequence

D. To speed up training by skipping steps

Beam search decoding in NLP - Cheat Sheet & Quick Revision

Start learning this pattern below

Practice

Solution

Step 1: Understand beam search goal

Step 2: Compare options

Final Answer:

Quick Check:

Solution

Step 1: Define beam width

Step 2: Eliminate incorrect options

Final Answer:

Quick Check:

Solution

Step 1: Calculate scores for all expansions

Step 2: Select top 2 sequences by score

Final Answer:

Quick Check:

Solution

Step 1: Analyze symptom of identical outputs

Step 2: Identify beam width effect

Final Answer:

Quick Check:

Solution

Step 1: Understand beam width effect on quality

Step 2: Understand beam width effect on speed

Final Answer:

Quick Check: