Recall & Review

beginner

What is an N-gram in language modeling?

An N-gram is a sequence of N words used together to predict the next word in a sentence. For example, a bigram is two words, a trigram is three words.

Click to reveal answer

beginner

How does an N-gram language model predict the next word?

It looks at the previous N-1 words and calculates the probability of each possible next word based on how often those words appeared together in training data.

Click to reveal answer

beginner

What is the difference between unigram, bigram, and trigram models?

Unigram models consider single words independently. Bigram models consider pairs of words. Trigram models consider triples of words to predict the next word.

Click to reveal answer

intermediate

Why do N-gram models face the problem of data sparsity?

Because many word combinations may not appear in the training data, making it hard to estimate their probabilities accurately.

Click to reveal answer

intermediate

What is smoothing in N-gram language models?

Smoothing is a technique to adjust probabilities so that unseen word sequences get a small, non-zero probability instead of zero, helping the model handle new phrases.

Click to reveal answer

What does a trigram model use to predict the next word?

AThe previous two words

BThe previous three words

COnly the last word

DAll words in the sentence

Why is smoothing important in N-gram models?

ATo remove rare words from the model

BTo assign zero probability to unseen word sequences

CTo increase the size of the training data

DTo assign small probabilities to unseen word sequences

Which problem occurs because many word sequences are rare or missing in training data?

AOverfitting

BData sparsity

CUnderfitting

DBias

In a bigram model, what is the probability of a word based on?

ARandom chance

BThe word after it

CThe word before it

DThe entire sentence

Which of these is NOT a type of N-gram model?

AQuadrigram

BUnigram

CTrigram

DBigram

Explain how an N-gram language model predicts the next word in a sentence.

Describe the challenges N-gram models face and how smoothing helps.

Practice

(1/5)

1. What does an n-gram language model primarily do?

easy

A. Predict the next word based on previous words

B. Translate text from one language to another

C. Generate images from text descriptions

D. Detect the sentiment of a sentence

N-gram language models in NLP - Cheat Sheet & Quick Revision

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of n-gram models

Step 2: Identify the main function

Final Answer:

Quick Check:

Solution

Step 1: Understand bigrams

Step 2: Extract bigrams from 'I love AI'

Final Answer:

Quick Check:

Solution

Step 1: Identify trigrams in the sentence

Step 2: Count the trigram ('the', 'cat', 'sat')

Final Answer:

Quick Check:

Solution

Step 1: Analyze the loop range

Step 2: Check index access inside loop

Final Answer:

Quick Check:

Solution

Step 1: Understand sparse data in n-gram models

Step 2: Identify smoothing techniques

Final Answer:

Quick Check: