What is Similarity search vs MMR retrieval in LangChain?

LangChainframework~5 mins

Similarity search vs MMR retrieval in LangChain

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Perf

Introduction

Similarity search helps find items close to your query. MMR retrieval balances closeness and variety to avoid repeats.

When you want to find documents or data points most like your question.

When you want diverse results that cover different aspects without repeating the same info.

When building chatbots that should give varied answers, not just the closest match.

When searching a large database and you want both relevance and variety.

When you want to avoid showing very similar items multiple times.

Syntax

LangChain

from langchain.vectorstores import FAISS

# Similarity search
results = vectorstore.similarity_search(query, k=5)

# MMR retrieval
results = vectorstore.max_marginal_relevance_search(query, k=5, fetch_k=10)

similarity_search returns the top k closest matches to the query.

max_marginal_relevance_search returns k results balancing similarity and diversity, using fetch_k candidates internally.

Examples

Finds the 3 most similar documents to 'What is AI?'.

LangChain

results = vectorstore.similarity_search('What is AI?', k=3)

Finds 3 documents that are similar but also diverse, choosing from top 6 candidates.

LangChain

results = vectorstore.max_marginal_relevance_search('What is AI?', k=3, fetch_k=6)

If no close matches exist, similarity search returns the closest available, even if not very similar.

LangChain

results = vectorstore.similarity_search('Nonexistent topic', k=3)

MMR tries to pick diverse results even if all are not very close to the query.

LangChain

results = vectorstore.max_marginal_relevance_search('Nonexistent topic', k=3, fetch_k=5)

Sample Program

This program creates a small vector store from example sentences about AI. It runs both similarity search and MMR retrieval for the query 'Tell me about AI'. It prints the top 3 results from each method so you can see the difference.

LangChain

from langchain.vectorstores import FAISS
from langchain.embeddings import OpenAIEmbeddings

# Sample documents
documents = [
    'AI is the simulation of human intelligence.',
    'Machine learning is a subset of AI.',
    'Deep learning uses neural networks.',
    'AI can be used in healthcare.',
    'Neural networks mimic the brain.'
]

# Create embeddings
embeddings = OpenAIEmbeddings()

# Build vector store
vectorstore = FAISS.from_texts(documents, embeddings)

query = 'Tell me about AI'

# Similarity search
similar_results = vectorstore.similarity_search(query, k=3)

# MMR retrieval
mmr_results = vectorstore.max_marginal_relevance_search(query, k=3, fetch_k=5)

print('Similarity Search Results:')
for doc in similar_results:
    print('-', doc.page_content)

print('\nMMR Retrieval Results:')
for doc in mmr_results:
    print('-', doc.page_content)

OutputSuccess

Important Notes

Similarity search is fast and finds the closest matches but can return very similar or repeated results.

MMR retrieval adds diversity by balancing similarity and novelty, which is useful when you want varied answers.

MMR is slightly slower because it fetches more candidates internally before picking diverse results.

Summary

Similarity search finds the closest matches to your query.

MMR retrieval balances closeness and diversity to avoid repeated or very similar results.

Use similarity search for quick, relevant results; use MMR when you want variety in answers.