Elasticsearchquery~10 mins

Autocomplete with edge n-gram in Elasticsearch - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Concept Flow - Autocomplete with edge n-gram

User types prefix

↓

Elasticsearch receives query

↓

Search edge n-gram tokens

↓

Match tokens starting with prefix

↓

Return autocomplete suggestions

↓

User sees suggestions

When a user types a prefix, Elasticsearch searches tokens created by edge n-gram to find matching suggestions starting with that prefix.

Execution Sample

Elasticsearch

PUT /autocomplete_example
{
  "settings": {
    "analysis": {
      "analyzer": {
        "autocomplete_analyzer": {
          "tokenizer": "autocomplete_tokenizer"
        }
      },
      "tokenizer": {
        "autocomplete_tokenizer": {
          "type": "edge_ngram",
          "min_gram": 1,
          "max_gram": 10,
          "token_chars": ["letter"]
        }
      }
    }
  },
  "mappings": {
    "properties": {
      "name": {
        "type": "text",
        "analyzer": "autocomplete_analyzer",
        "search_analyzer": "standard"
      }
    }
  }
}

GET /autocomplete_example/_search
{
  "query": {
    "match": {
      "name": "mic"
    }
  }
}

This code creates an index with an edge n-gram tokenizer for autocomplete on the 'name' field, then searches for prefix 'mic'.

Execution Table

Step	Action	Input Text	Tokens Created	Query Tokens	Matched Tokens	Output Suggestions
1	Index document	michael	["m", "mi", "mic", "mich", "micha", "michae", "michael"]	-	-	-
2	Index document	michelle	["m", "mi", "mic", "mich", "miche", "michel", "michell", "michelle"]	-	-	-
3	User types prefix	mic	-	-	["mic"]	-
4	Search edge n-gram tokens	-	-	["mic"]	["mic"]	-
5	Match tokens starting with prefix	-	-	["mic"]	["mic"]	-
6	Return autocomplete suggestions	-	-	-	-	["michael", "michelle"]
7	User sees suggestions	-	-	-	-	["michael", "michelle"]

💡 Autocomplete suggestions returned when prefix 'mic' matches edge n-gram tokens.

Variable Tracker

Variable	Start	After Step 1	After Step 2	After Step 3	After Step 4	After Step 6	Final
Input Text	-	michael	michelle	mic	mic	mic	mic
Tokens Created	-	["m", "mi", "mic", "mich", "micha", "michae", "michael"]	["m", "mi", "mic", "mich", "miche", "michel", "michell", "michelle"]	-	-	-	-
Query Tokens	-	-	-	["mic"]	["mic"]	["mic"]	["mic"]
Matched Tokens	-	-	-	-	["mic"]	["mic"]	["mic"]
Output Suggestions	-	-	-	-	-	["michael", "michelle"]	["michael", "michelle"]

Key Moments - 3 Insights

Why do we create multiple tokens like "m", "mi", "mic" for a single word?

Why is the search analyzer set to standard instead of autocomplete_analyzer?

What happens if the user types a prefix longer than max_gram?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table at Step 1. What tokens are created for the word "michael"?

A["michael"]

B["m", "mi", "mic", "mich", "micha", "michae", "michael"]

C["mic", "mich", "micha"]

D["m", "mi", "michelle"]

Concept Snapshot

Autocomplete with edge n-gram:
- Use edge_ngram tokenizer to create prefix tokens
- Index tokens like 'm', 'mi', 'mic' for words
- Search uses standard analyzer for full prefix
- Matches tokens starting with user input
- Returns suggestions starting with typed prefix

Full Transcript

This visual trace shows how Elasticsearch uses edge n-gram tokenizer to support autocomplete. When indexing, words like 'michael' are broken into prefix tokens such as 'm', 'mi', 'mic', etc. When a user types a prefix like 'mic', Elasticsearch searches for tokens starting with 'mic' and returns matching suggestions like 'michael' and 'michelle'. The search analyzer is standard to keep the query as a full prefix. The execution table tracks each step from indexing to returning suggestions, and the variable tracker shows how tokens and queries evolve. Key moments clarify why multiple tokens are created and why the search analyzer differs. The quiz tests understanding of token creation and matching steps.