Practice

(1/5)

What is the main goal of similarity search in machine learning?

easy

A. To count the number of items in a dataset

B. To sort items alphabetically

C. To find items that are close or alike in a collection

D. To remove duplicate items from a list

You have a collection of text documents converted into vectors. You want to find the top 2 most similar documents to a new query vector using cosine similarity. Which approach is best?

Compute cosine similarity between query and each document vector.
Sort documents by similarity score descending.
Return top 2 documents.

Which code snippet correctly implements this?

import numpy as np

docs = [np.array([1, 0]), np.array([0, 1]), np.array([1, 1])]
query = np.array([1, 0])

# Choose the correct code:

hard

A. scores = [np.dot(query, d) * np.linalg.norm(query) * np.linalg.norm(d) for d in docs] top2 = sorted(scores)[:2] print(top2)

B. scores = [np.dot(query, d) / (np.linalg.norm(query) * np.linalg.norm(d)) for d in docs] top2 = sorted(range(len(scores)), key=lambda i: scores[i], reverse=True)[:2] print(top2)

C. scores = [np.dot(query, d) / (np.linalg.norm(query) - np.linalg.norm(d)) for d in docs] top2 = sorted(range(len(scores)), key=lambda i: scores[i])[:2] print(top2)

D. scores = [np.cross(query, d) / (np.linalg.norm(query) * np.linalg.norm(d)) for d in docs] top2 = sorted(range(len(scores)), key=lambda i: scores[i], reverse=True)[:2] print(top2)

Similarity search and retrieval in Prompt Engineering / GenAI - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of similarity search

Step 2: Compare options with the definition

Final Answer:

Quick Check:

Solution

Step 1: Recall cosine similarity formula

Step 2: Match formula to code options

Final Answer:

Quick Check:

Solution

Step 1: Calculate dot product of vec1 and vec2

Step 2: Calculate norms and cosine similarity

Final Answer:

Quick Check:

Solution

Step 1: Analyze the cosine similarity formula in code

Step 2: Identify missing parentheses

Final Answer:

Quick Check:

Solution

Step 1: Compute cosine similarity correctly

Step 2: Sort indices by similarity descending and select top 2

Final Answer:

Quick Check: