Challenge - 5 Problems
Standard Analyzer Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
❓ Predict Output
intermediate2:00remaining
What is the output of the standard analyzer on the text?
Given the text
"The Quick Brown Foxes jumped over the lazy dogs.", what tokens does the standard analyzer produce?Elasticsearch
{
"analyze": {
"analyzer": "standard",
"text": "The Quick Brown Foxes jumped over the lazy dogs."
}
}Attempts:
2 left
💡 Hint
The standard analyzer lowercases all tokens and splits on punctuation and whitespace.
✗ Incorrect
The standard analyzer lowercases all words and splits on whitespace and punctuation. It does not stem words, so 'foxes' and 'dogs' remain plural.
🧠 Conceptual
intermediate1:30remaining
Which feature is NOT part of the standard analyzer?
Select the feature that the standard analyzer does NOT perform on input text.
Attempts:
2 left
💡 Hint
Think about whether the standard analyzer removes common words like "the" or "and".
✗ Incorrect
The standard analyzer does not remove stop words by default. It lowercases, tokenizes on punctuation and whitespace, but keeps all tokens including stop words.
🔧 Debug
advanced2:00remaining
Why does this standard analyzer output include the token "can't" split into two tokens?
Given the text
"I can't do this." analyzed with the standard analyzer, the output tokens are ["i", "can't", "do", "this"]. Why is "can't" not split into "can" and "t"?Elasticsearch
{
"analyze": {
"analyzer": "standard",
"text": "I can't do this."
}
}Attempts:
2 left
💡 Hint
Consider how the standard analyzer treats punctuation inside words.
✗ Incorrect
The standard analyzer keeps apostrophes inside words as part of the token, so "can't" remains one token.
📝 Syntax
advanced1:30remaining
What error occurs with this standard analyzer JSON request?
Identify the error in this JSON request to analyze text with the standard analyzer:
{
"analyze": {
"analyzer": "standard",
"text": ["Hello", "world"]
}
}Attempts:
2 left
💡 Hint
Check the expected data type for the 'text' field in the analyze API.
✗ Incorrect
The 'text' field must be a string. Passing an array causes a type error.
🚀 Application
expert2:30remaining
How many tokens does the standard analyzer produce for this text?
Analyze the text
"E-mail addresses like test@example.com are tokenized." with the standard analyzer. How many tokens are produced?Elasticsearch
{
"analyze": {
"analyzer": "standard",
"text": "E-mail addresses like test@example.com are tokenized."
}
}Attempts:
2 left
💡 Hint
The standard analyzer keeps hyphenated words like 'E-mail' as a single token but splits emails on punctuation like @ and ..
✗ Incorrect
The tokens are: ["e-mail", "addresses", "like", "test", "example", "com", "are", "tokenized"] totaling 8 tokens. The hyphenated "E-mail" remains one token, while "test@example.com" splits into "test", "example", "com".