Practice

(1/5)

1. What does semantic similarity with embeddings help us do in natural language processing?

easy

A. Translate text from one language to another

B. Count the number of words in a sentence

C. Measure how similar the meanings of two texts are

D. Generate random sentences

Solution

Step 1: Understand semantic similarity
Semantic similarity means checking how close the meanings of two texts are, not just the words.
Step 2: Role of embeddings
Embeddings convert text into numbers that capture meaning, allowing comparison of texts by meaning.
Final Answer:
Measure how similar the meanings of two texts are -> Option C
Quick Check:
Semantic similarity = meaning comparison [OK]

Hint: Semantic similarity compares meanings, not word counts [OK]

Common Mistakes:

Confusing similarity with word count
Thinking embeddings translate text
Assuming semantic similarity generates text

2. Which Python library is commonly used to compute cosine similarity between embeddings?

easy

A. matplotlib

B. scikit-learn

C. pandas

D. flask

Solution

Step 1: Identify cosine similarity function
Cosine similarity is often computed using scikit-learn's metrics module.
Step 2: Check other libraries
matplotlib is for plotting, pandas for data frames, flask for web apps, so they don't compute cosine similarity.
Final Answer:
scikit-learn -> Option B
Quick Check:
Cosine similarity = scikit-learn [OK]

Hint: Use scikit-learn for cosine similarity calculations [OK]

Common Mistakes:

Using matplotlib for similarity
Confusing pandas with similarity tools
Thinking flask handles embeddings

3. What is the output of this Python code snippet?

from sklearn.metrics.pairwise import cosine_similarity
import numpy as np

emb1 = np.array([[1, 0, 0]])
emb2 = np.array([[0, 1, 0]])
sim = cosine_similarity(emb1, emb2)
print(sim[0][0])

medium

A. Error

B. 1.0

C. -1.0

D. 0.0

Solution

Step 1: Understand cosine similarity formula
Cosine similarity measures the cosine of the angle between two vectors. Orthogonal vectors have similarity 0.
Step 2: Analyze given vectors
emb1 is [1,0,0], emb2 is [0,1,0]. They are perpendicular, so similarity is 0.
Final Answer:
0.0 -> Option D
Quick Check:
Orthogonal vectors similarity = 0.0 [OK]

Hint: Orthogonal vectors have cosine similarity zero [OK]

Common Mistakes:

Assuming similarity is 1 for any vectors
Confusing dot product with cosine similarity
Expecting error due to shape

4. Identify the error in this code that tries to compute semantic similarity:

from sklearn.metrics.pairwise import cosine_similarity

emb1 = [0.1, 0.2, 0.3]
emb2 = [0.1, 0.2, 0.3]
sim = cosine_similarity(emb1, emb2)
print(sim)

medium

A. emb1 and emb2 should be 2D arrays, not 1D lists

B. cosine_similarity function does not exist in sklearn

C. embeddings must be strings, not numbers

D. print statement syntax is incorrect

Solution

Step 1: Check input format for cosine_similarity
cosine_similarity expects 2D arrays (like [[...]]), but emb1 and emb2 are 1D lists.
Step 2: Confirm other options
cosine_similarity exists, embeddings are numeric vectors, and print syntax is correct in Python 3.
Final Answer:
emb1 and emb2 should be 2D arrays, not 1D lists -> Option A
Quick Check:
Input shape must be 2D arrays [OK]

Hint: cosine_similarity needs 2D arrays, not 1D lists [OK]

Common Mistakes:

Passing 1D lists instead of 2D arrays
Thinking embeddings must be text
Misunderstanding print syntax

5. You have two sentences: "I love apples" and "I adore oranges". Using a pre-trained embedding model, you get vectors for both. Which approach best helps you find if these sentences have similar meaning?

hard

A. Calculate cosine similarity between their embeddings

B. Count common words between the sentences

C. Check if sentence lengths are equal

D. Compare the first letters of each word

Solution

Step 1: Understand semantic similarity goal
We want to compare meanings, not just words or sentence length.
Step 2: Use embeddings and cosine similarity
Pre-trained embeddings capture meaning; cosine similarity measures closeness of meanings numerically.
Final Answer:
Calculate cosine similarity between their embeddings -> Option A
Quick Check:
Meaning comparison = cosine similarity on embeddings [OK]

Hint: Use cosine similarity on embeddings for meaning comparison [OK]

Common Mistakes:

Relying on word overlap only
Using sentence length as similarity
Comparing letters instead of meaning

Why Semantic similarity with embeddings in NLP? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand semantic similarity

Step 2: Role of embeddings

Final Answer:

Quick Check:

Solution

Step 1: Identify cosine similarity function

Step 2: Check other libraries

Final Answer:

Quick Check:

Solution

Step 1: Understand cosine similarity formula

Step 2: Analyze given vectors

Final Answer:

Quick Check:

Solution

Step 1: Check input format for cosine_similarity

Step 2: Confirm other options

Final Answer:

Quick Check:

Solution

Step 1: Understand semantic similarity goal

Step 2: Use embeddings and cosine similarity

Final Answer:

Quick Check: