Practice

(1/5)

1. What is the main purpose of platform observability in MLOps?

easy

A. To monitor and understand system performance in real time

B. To set legal contracts with users

C. To deploy machine learning models automatically

D. To store large amounts of data efficiently

Solution

Step 1: Understand observability concept
Observability means seeing how the system behaves and performs live.
Step 2: Match purpose with options
Only To monitor and understand system performance in real time talks about monitoring and understanding performance in real time.
Final Answer:
To monitor and understand system performance in real time -> Option A
Quick Check:
Observability = Real-time performance monitoring [OK]

Hint: Observability = watching system health live [OK]

Common Mistakes:

Confusing observability with deployment
Thinking observability sets contracts
Mixing observability with data storage

2. Which of the following is the correct way to define an SLA uptime of 99.9% in a YAML configuration?

easy

A. sla: uptime: '99.9%'

B. sla: uptime: 99.9

C. sla: uptime: 0.999

D. sla: uptime: '99,9%'

Solution

Step 1: Understand SLA uptime format
SLA uptime is usually expressed as a percentage string like '99.9%'.
Step 2: Check YAML syntax and value correctness
sla: uptime: '99.9%' uses correct YAML syntax and proper string format with percent sign.
Final Answer:
sla:\n uptime: '99.9%' -> Option A
Quick Check:
Correct SLA uptime format = '99.9%' string [OK]

Hint: Use string with percent sign for SLA uptime [OK]

Common Mistakes:

Using number without percent sign
Using decimal instead of percentage
Using comma instead of dot in percentage

3. Given this monitoring alert rule snippet:

if error_rate > 0.05:
  alert('High error rate')
else:
  alert('Error rate normal')

What will be the alert message if error_rate is 0.03?

medium

A. No alert

B. High error rate

C. Error rate normal

D. Syntax error

Solution

Step 1: Evaluate the condition with error_rate = 0.03
0.03 is less than 0.05, so the condition error_rate > 0.05 is false.
Step 2: Determine which alert triggers
Since condition is false, the else branch runs, triggering alert('Error rate normal').
Final Answer:
Error rate normal -> Option C
Quick Check:
0.03 < 0.05 triggers else alert [OK]

Hint: Check if error_rate exceeds threshold [OK]

Common Mistakes:

Confusing greater than with less than
Assuming no alert triggers
Thinking code has syntax error

4. You have this SLA configuration:

sla:
  uptime: '99.95%'
  response_time_ms: 200

But your monitoring shows frequent alerts for response time exceeding 200ms. What is the most likely cause?

medium

A. The uptime percentage is incorrect

B. The SLA response_time_ms is set too low for actual system performance

C. The SLA syntax is invalid YAML

D. The monitoring tool is not running

Solution

Step 1: Analyze SLA and alert mismatch
The SLA sets response_time_ms to 200ms, but alerts show it often exceeds this.
Step 2: Identify cause of frequent alerts
This means the system often responds slower than 200ms, so SLA is too strict or system needs improvement.
Final Answer:
The SLA response_time_ms is set too low for actual system performance -> Option B
Quick Check:
Strict SLA causes frequent alerts [OK]

Hint: Check if SLA limits match real system speed [OK]

Common Mistakes:

Blaming uptime for response time alerts
Assuming YAML syntax error without checking
Ignoring monitoring tool status

5. You want to combine observability metrics and SLA checks to alert only when uptime drops below 99.9% and error rate exceeds 1%. Which pseudo-code correctly implements this?

hard

A. if uptime >= 99.9 and error_rate >= 0.01: alert('SLA breach')

B. if uptime > 99.9 or error_rate < 0.01: alert('SLA breach')

C. if uptime <= 99.9 and error_rate <= 0.01: alert('SLA breach')

D. if uptime < 99.9 and error_rate > 0.01: alert('SLA breach')

Solution

Step 1: Understand SLA breach conditions
SLA breach means uptime is less than 99.9% AND error rate is greater than 1% (0.01).
Step 2: Match condition logic with options
if uptime < 99.9 and error_rate > 0.01: alert('SLA breach') uses < for uptime and > for error rate combined with AND, matching the requirement exactly.
Final Answer:
if uptime < 99.9 and error_rate > 0.01:\n alert('SLA breach') -> Option D
Quick Check:
Use AND with correct inequalities for SLA breach [OK]

Hint: Use AND with uptime < 99.9 and error_rate > 0.01 [OK]

Common Mistakes:

Using OR instead of AND
Reversing inequality signs
Alerting on normal conditions

Input Size (n components)	Approx. Operations
10	About 10 times metrics per component
100	About 100 times metrics per component
1000	About 1000 times metrics per component

Platform observability and SLAs in MLOps - Time & Space Complexity

Start learning this pattern below

Practice

Solution

Step 1: Understand observability concept

Step 2: Match purpose with options

Final Answer:

Quick Check:

Solution

Step 1: Understand SLA uptime format

Step 2: Check YAML syntax and value correctness

Final Answer:

Quick Check:

Solution

Step 1: Evaluate the condition with error_rate = 0.03

Step 2: Determine which alert triggers

Final Answer:

Quick Check:

Solution

Step 1: Analyze SLA and alert mismatch

Step 2: Identify cause of frequent alerts

Final Answer:

Quick Check:

Solution

Step 1: Understand SLA breach conditions

Step 2: Match condition logic with options

Final Answer:

Quick Check: