2.

Which of the following is the correct way to compute cosine similarity between two vectors A and B in Python using numpy?

import numpy as np
A = np.array([1, 2, 3])
B = np.array([4, 5, 6])
# What code computes cosine similarity?

easy

A. np.dot(A, B) * (np.linalg.norm(A) + np.linalg.norm(B))

B. np.dot(A, B) / (np.linalg.norm(A) - np.linalg.norm(B))

C. np.sum(A * B) / (np.linalg.norm(A) - np.linalg.norm(B))

D. np.dot(A, B) / (np.linalg.norm(A) * np.linalg.norm(B))

3.

Given the following vectors, what is the cosine similarity between vec1 and vec2?

import numpy as np
vec1 = np.array([1, 0, 0])
vec2 = np.array([0, 1, 0])
cos_sim = np.dot(vec1, vec2) / (np.linalg.norm(vec1) * np.linalg.norm(vec2))
print("{:.2f}".format(cos_sim))

medium

A. 0.00

B. 0.50

C. -1.00

D. 1.00

4.

Consider this code snippet for similarity search. What is the error?

import numpy as np
vectors = [np.array([1, 2]), np.array([3, 4])]
query = np.array([1, 0])
scores = []
for v in vectors:
    score = np.dot(query, v) / np.linalg.norm(query) * np.linalg.norm(v)
    scores.append(score)
print(scores)

medium

A. Missing parentheses causing wrong order of operations

B. Using np.dot instead of np.cross

C. Vectors have different lengths

D. Query vector is not normalized

5.

You have a collection of text documents converted into vectors. You want to find the top 2 most similar documents to a new query vector using cosine similarity. Which approach is best?

Compute cosine similarity between query and each document vector.
Sort documents by similarity score descending.
Return top 2 documents.

Which code snippet correctly implements this?

import numpy as np

docs = [np.array([1, 0]), np.array([0, 1]), np.array([1, 1])]
query = np.array([1, 0])

# Choose the correct code:

hard

A. scores = [np.dot(query, d) * np.linalg.norm(query) * np.linalg.norm(d) for d in docs] top2 = sorted(scores)[:2] print(top2)

B. scores = [np.dot(query, d) / (np.linalg.norm(query) * np.linalg.norm(d)) for d in docs] top2 = sorted(range(len(scores)), key=lambda i: scores[i], reverse=True)[:2] print(top2)

C. scores = [np.dot(query, d) / (np.linalg.norm(query) - np.linalg.norm(d)) for d in docs] top2 = sorted(range(len(scores)), key=lambda i: scores[i])[:2] print(top2)

D. scores = [np.cross(query, d) / (np.linalg.norm(query) * np.linalg.norm(d)) for d in docs] top2 = sorted(range(len(scores)), key=lambda i: scores[i], reverse=True)[:2] print(top2)

Why Similarity search and retrieval in Prompt Engineering / GenAI? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of similarity search

Step 2: Compare options with the definition

Final Answer:

Quick Check:

Solution

Step 1: Recall cosine similarity formula

Step 2: Match formula to code options

Final Answer:

Quick Check:

Solution

Step 1: Calculate dot product of vec1 and vec2

Step 2: Calculate norms and cosine similarity

Final Answer:

Quick Check:

Solution

Step 1: Analyze the cosine similarity formula in code

Step 2: Identify missing parentheses

Final Answer:

Quick Check:

Solution

Step 1: Compute cosine similarity correctly

Step 2: Sort indices by similarity descending and select top 2

Final Answer:

Quick Check: