Elasticsearchquery~10 mins

TF-IDF and BM25 scoring in Elasticsearch - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - TF-IDF and BM25 scoring

Input Query

↓

Document Collection

↓

Calculate Term Frequency (TF)

↓

Calculate Inverse Document Frequency (IDF)

↓

Compute TF-IDF Score

↓

Apply BM25 Formula (TF, IDF, doc length)

↓

Rank Documents by Score

↓

Return Top Results

The flow shows how Elasticsearch scores documents by first calculating term frequency and inverse document frequency, then combining them using TF-IDF or BM25 formulas to rank documents.

Execution Sample

Elasticsearch

GET /my_index/_search
{
  "query": {
    "match": { "text": "apple banana" }
  }
}

This query searches documents containing 'apple' and 'banana' and scores them using BM25 by default.

Execution Table

Step	Action	Term	TF	IDF	BM25 Score	Explanation
1	Calculate TF for 'apple' in Doc1	apple	3	2.0	0	Count how many times 'apple' appears in Doc1
2	Calculate TF for 'banana' in Doc1	banana	1	1.5	0	Count how many times 'banana' appears in Doc1
3	Calculate IDF for 'apple'	apple	-	2.0	0	Inverse document frequency for 'apple'
4	Calculate IDF for 'banana'	banana	-	1.5	0	Inverse document frequency for 'banana'
5	Compute BM25 score for 'apple' in Doc1	apple	3	2.0	2.5	Apply BM25 formula with TF=3, IDF=2.0, doc length normalization
6	Compute BM25 score for 'banana' in Doc1	banana	1	1.5	1.2	Apply BM25 formula with TF=1, IDF=1.5, doc length normalization
7	Sum BM25 scores for Doc1	-	-	-	3.7	Total BM25 score for Doc1 is sum of term scores
8	Repeat steps 1-7 for Doc2	-	-	-	2.1	Calculate scores for another document
9	Rank documents by BM25 score	-	-	-	-	Doc1 (3.7) ranks higher than Doc2 (2.1)
10	Return top ranked documents	-	-	-	-	Results returned to user sorted by score

💡 All documents scored and ranked; top results returned.

Variable Tracker

Variable	Start	After Step 1	After Step 5	After Step 7	Final
TF_apple_Doc1	0	3	3	3	3
TF_banana_Doc1	0	1	1	1	1
IDF_apple	-	-	2.0	2.0	2.0
IDF_banana	-	-	1.5	1.5	1.5
BM25_apple_Doc1	0	0	2.5	2.5	2.5
BM25_banana_Doc1	0	0	0	1.2	1.2
BM25_total_Doc1	0	0	0	3.7	3.7

Key Moments - 3 Insights

Why does BM25 score consider document length while TF-IDF does not?

Why is IDF important in scoring?

How are scores combined for multiple terms?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution table, what is the BM25 score for 'apple' in Doc1 at step 5?

A1.5

C2.5

Concept Snapshot

TF-IDF and BM25 scoring in Elasticsearch:
- TF counts term appearances in a doc
- IDF measures term rarity across docs
- TF-IDF multiplies TF by IDF
- BM25 improves TF-IDF by normalizing for doc length
- Elasticsearch uses BM25 by default to rank search results

Full Transcript

This visual execution shows how Elasticsearch scores documents using TF-IDF and BM25. First, it counts how often each search term appears in a document (TF). Then, it calculates how rare each term is across all documents (IDF). BM25 scoring combines these with adjustments for document length to avoid favoring longer documents. Each term's BM25 score is computed and summed to get the document's total score. Documents are then ranked by these scores to return the most relevant results. Key points include the importance of IDF to reduce common term weight, BM25's length normalization, and summing scores for multiple terms. The execution table traces these calculations step-by-step for clarity.