Practice

(1/5)

1. What is the main purpose of machine learning anomaly detection in Elasticsearch?

easy

A. To automatically find unusual patterns in data

B. To store large amounts of data efficiently

C. To create visual dashboards for data

D. To backup Elasticsearch clusters

Solution

Step 1: Understand anomaly detection goal
Machine learning anomaly detection is designed to find unusual or unexpected patterns in data automatically.
Step 2: Compare options with purpose
Options B, C, and D describe other Elasticsearch features, not anomaly detection.
Final Answer:
To automatically find unusual patterns in data -> Option A
Quick Check:
Purpose of anomaly detection = find unusual patterns [OK]

Hint: Anomaly detection finds unusual data automatically [OK]

Common Mistakes:

Confusing anomaly detection with data storage
Thinking anomaly detection creates dashboards
Mixing anomaly detection with backup tasks

2. Which Elasticsearch API call starts the anomaly detection process by feeding data to the job?

easy

A. POST _ml/anomaly_detectors/<job_id>/_start_datafeed

B. GET _ml/anomaly_detectors/<job_id>/results

C. PUT _ml/anomaly_detectors/<job_id>

D. DELETE _ml/anomaly_detectors/<job_id>

Solution

Step 1: Identify datafeed start API
The API to start feeding data to an anomaly detection job is POST _ml/anomaly_detectors/<job_id>/_start_datafeed.
Step 2: Eliminate other options
GET retrieves results, PUT creates or updates jobs, DELETE removes jobs.
Final Answer:
POST _ml/anomaly_detectors/<job_id>/_start_datafeed -> Option A
Quick Check:
Start datafeed = POST _start_datafeed [OK]

Hint: Start datafeed uses POST with _start_datafeed endpoint [OK]

Common Mistakes:

Using GET instead of POST to start datafeed
Confusing job creation with starting datafeed
Deleting job instead of starting datafeed

3. Given this Elasticsearch ML job result snippet:

{"job_id":"sales_anomaly","results":[{"timestamp":1680000000000,"anomaly_score":75},{"timestamp":1680003600000,"anomaly_score":5}]}

Which timestamp shows a likely anomaly?

medium

A. Neither timestamp

B. 1680003600000

C. Both timestamps

D. 1680000000000

Solution

Step 1: Understand anomaly score meaning
Higher anomaly scores indicate more unusual data points. A score of 75 is high, 5 is low.
Step 2: Identify timestamp with high score
The timestamp 1680000000000 has anomaly_score 75, indicating a likely anomaly.
Final Answer:
1680000000000 -> Option D
Quick Check:
High anomaly score = likely anomaly [OK]

Hint: Higher anomaly_score means more likely anomaly [OK]

Common Mistakes:

Choosing low anomaly score as anomaly
Selecting both timestamps without checking scores
Ignoring anomaly_score values

4. You created an anomaly detection job but see no results after starting the datafeed. What is a likely cause?

medium

A. The job was deleted before starting

B. The Elasticsearch cluster is offline

C. The datafeed is not running or has stopped

D. The anomaly scores are all zero

Solution

Step 1: Check datafeed status
If no results appear, the datafeed may not be running or has stopped feeding data to the job.
Step 2: Evaluate other options
Job deletion would prevent starting datafeed; cluster offline causes broader failures; zero scores still produce results.
Final Answer:
The datafeed is not running or has stopped -> Option C
Quick Check:
No results usually mean datafeed stopped [OK]

Hint: No results? Check if datafeed is running [OK]

Common Mistakes:

Assuming zero scores mean no results
Ignoring datafeed status
Blaming cluster offline without checking datafeed

5. You want to detect unusual spikes in website traffic using Elasticsearch ML anomaly detection. Which steps should you follow to set this up correctly?

hard

A. Backup traffic data, create index pattern, then visualize spikes

B. Create a job with traffic data, start datafeed, then analyze anomaly results

C. Create a dashboard, upload traffic logs, then run anomaly detection manually

D. Delete old data, create job without datafeed, then check results

Solution

Step 1: Create ML job with traffic data
Define an anomaly detection job using the website traffic data to analyze patterns.
Step 2: Start the datafeed to feed data into the job
Start the datafeed so the job can process incoming traffic data continuously.
Step 3: Analyze the anomaly detection results
Review the results to identify unusual spikes or anomalies in traffic.
Final Answer:
Create a job with traffic data, start datafeed, then analyze anomaly results -> Option B
Quick Check:
Job + datafeed + analyze = correct setup [OK]

Hint: Job creation + datafeed start + result check = setup [OK]

Common Mistakes:

Skipping datafeed start step
Confusing dashboards with anomaly detection setup
Deleting data before analysis

Input Size (n)	Approx. Operations
10,000 data points	~10,000 operations (each point processed once)
100,000 data points	~100,000 operations
1,000,000 data points	~1,000,000 operations

Machine learning anomaly detection in Elasticsearch - Time & Space Complexity

Start learning this pattern below

Practice

Solution

Step 1: Understand anomaly detection goal

Step 2: Compare options with purpose

Final Answer:

Quick Check:

Solution

Step 1: Identify datafeed start API

Step 2: Eliminate other options

Final Answer:

Quick Check:

Solution

Step 1: Understand anomaly score meaning

Step 2: Identify timestamp with high score

Final Answer:

Quick Check:

Solution

Step 1: Check datafeed status

Step 2: Evaluate other options

Final Answer:

Quick Check:

Solution

Step 1: Create ML job with traffic data

Step 2: Start the datafeed to feed data into the job

Step 3: Analyze the anomaly detection results

Final Answer:

Quick Check: