Practice

(1/5)

1. Why do word embeddings help computers understand language better?

easy

A. Because they turn words into numbers that show their meaning

B. Because they translate words into different languages

C. Because they count how many times a word appears

D. Because they remove stop words from sentences

Solution

Step 1: Understand what embeddings do
Embeddings convert words into numbers (vectors) that represent their meanings.
Step 2: Recognize the benefit for computers
These numbers help computers see which words are similar in meaning by their closeness in vector space.
Final Answer:
Because they turn words into numbers that show their meaning -> Option A
Quick Check:
Embeddings = numeric meaning representation [OK]

Hint: Embeddings = words as meaningful numbers [OK]

Common Mistakes:

Thinking embeddings translate languages
Confusing embeddings with word frequency counts
Believing embeddings remove words

2. Which of the following is the correct way to represent a word embedding vector in code?

easy

A. embedding = 'word vector'

B. embedding = {'word': 1}

C. embedding = 12345

D. embedding = [0.1, 0.5, -0.3]

Solution

Step 1: Identify the data type for embeddings
Embeddings are numeric vectors, usually lists or arrays of floats.
Step 2: Check each option's format
embedding = [0.1, 0.5, -0.3] shows a list of numbers, which is correct. Others are strings, integers, or dictionaries, which are incorrect.
Final Answer:
embedding = [0.1, 0.5, -0.3] -> Option D
Quick Check:
Embedding vector = list of numbers [OK]

Hint: Embedding = list of numbers, not strings or ints [OK]

Common Mistakes:

Using strings instead of numeric vectors
Using single numbers instead of vectors
Using dictionaries instead of lists

3. Given the following embeddings:
embedding_cat = [0.2, 0.4, 0.6]
embedding_dog = [0.21, 0.39, 0.58]
embedding_car = [0.9, 0.1, 0.2]
Which pair is most semantically similar based on cosine similarity?

medium

A. dog and car

B. cat and car

C. cat and dog

D. All pairs are equally similar

Solution

Step 1: Understand cosine similarity
Cosine similarity measures how close two vectors point in the same direction; higher means more similar.
Step 2: Compare vectors
embedding_cat and embedding_dog are close numerically, so their cosine similarity is high. embedding_car is quite different.
Final Answer:
cat and dog -> Option C
Quick Check:
Closest vectors = most similar words [OK]

Hint: Closest vectors mean similar words [OK]

Common Mistakes:

Assuming car is similar to cat or dog
Thinking all pairs have same similarity
Ignoring vector closeness

4. You have this code snippet to compute similarity between two embeddings:

def similarity(vec1, vec2):
    return sum(a*b for a, b in zip(vec1, vec2))

embedding1 = [0.3, 0.5, 0.2]
embedding2 = [0.3, 0.5]
print(similarity(embedding1, embedding2))

What is the main problem here?

medium

A. The vectors have different lengths causing incorrect similarity

B. The function uses sum instead of product

C. The function should return a list, not a number

D. The embeddings contain strings instead of numbers

Solution

Step 1: Check vector lengths
embedding1 has 3 elements, embedding2 has 2 elements, so zip stops early, ignoring last element of embedding1.
Step 2: Understand impact on similarity
This causes incomplete calculation and inaccurate similarity score.
Final Answer:
The vectors have different lengths causing incorrect similarity -> Option A
Quick Check:
Vector length mismatch = wrong similarity [OK]

Hint: Vectors must be same length for similarity [OK]

Common Mistakes:

Ignoring vector length mismatch
Thinking sum is wrong operation here
Expecting list output instead of number

5. You want to improve a chatbot's understanding by using embeddings. Which approach best captures semantic meaning for similar questions like "How are you?" and "How do you do?"?

hard

A. Use only the first word's embedding as sentence meaning

B. Use pretrained word embeddings and average their vectors for the whole sentence

C. Use random vectors for each word without training

D. Use one-hot encoding for each word and sum them

Solution

Step 1: Understand sentence embedding from word embeddings
Averaging pretrained word embeddings creates a vector representing the whole sentence's meaning.
Step 2: Compare other options
One-hot encoding loses semantic info, random vectors have no meaning, and using only first word misses context.
Final Answer:
Use pretrained word embeddings and average their vectors for the whole sentence -> Option B
Quick Check:
Average pretrained embeddings = better sentence meaning [OK]

Hint: Average pretrained embeddings for sentence meaning [OK]

Common Mistakes:

Using one-hot encoding which lacks meaning
Using random vectors without training
Ignoring all words except the first

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Loss starts high, accuracy low as embeddings begin to learn.
2	0.9	0.60	Loss decreases, accuracy improves as embeddings capture word context.
3	0.7	0.72	Embeddings better represent semantic meaning, improving model predictions.
4	0.55	0.80	Loss continues to drop, accuracy rises, embeddings capture more subtle meanings.
5	0.45	0.85	Training converges, embeddings effectively represent word meanings.

Why embeddings capture semantic meaning in NLP - Model Pipeline Impact

Start learning this pattern below

Practice

Solution

Step 1: Understand what embeddings do

Step 2: Recognize the benefit for computers

Final Answer:

Quick Check:

Solution

Step 1: Identify the data type for embeddings

Step 2: Check each option's format

Final Answer:

Quick Check:

Solution

Step 1: Understand cosine similarity

Step 2: Compare vectors

Final Answer:

Quick Check:

Solution

Step 1: Check vector lengths

Step 2: Understand impact on similarity

Final Answer:

Quick Check:

Solution

Step 1: Understand sentence embedding from word embeddings

Step 2: Compare other options

Final Answer:

Quick Check: