0
0
Prompt Engineering / GenAIml~20 mins

Monitoring and observability in Prompt Engineering / GenAI - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Observability Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
Understanding the difference between monitoring and observability

Which statement best describes the difference between monitoring and observability in a software system?

AMonitoring collects predefined metrics and alerts on known issues, while observability enables understanding unknown issues by analyzing logs, metrics, and traces.
BMonitoring is only about logging errors, and observability is about fixing bugs automatically.
CMonitoring and observability are the same; both only collect logs for debugging.
DObservability is a tool, and monitoring is a process that runs inside it.
Attempts:
2 left
💡 Hint

Think about how each helps with known vs unknown problems.

💻 Command Output
intermediate
2:00remaining
Prometheus query output interpretation

Given the Prometheus query rate(http_requests_total[5m]), what does the output represent?

AThe number of HTTP requests in the last 5 seconds.
BThe total number of HTTP requests since the server started.
CThe average number of HTTP requests per second over the last 5 minutes.
DThe current number of active HTTP connections.
Attempts:
2 left
💡 Hint

Consider what rate() function calculates in Prometheus.

Configuration
advanced
2:30remaining
Configuring alert rules in Prometheus

Which Prometheus alert rule configuration will correctly trigger an alert when CPU usage exceeds 80% for 5 minutes?

Prompt Engineering / GenAI
groups:
- name: cpu_alerts
  rules:
  - alert: HighCPUUsage
    expr: avg(rate(cpu_seconds_total[1m])) by (instance) > 0.8
    for: 5m
    labels:
      severity: critical
    annotations:
      summary: "CPU usage is above 80%"
      description: "Instance {{ $labels.instance }} CPU usage is above 80% for more than 5 minutes."
A
expr: avg_over_time(cpu_seconds_total[5m]) > 0.8
for: 1m
B
expr: sum(cpu_seconds_total) by (instance) > 80
for: 5m
C
expr: rate(cpu_seconds_total[5m]) > 0.8
for: 5m
D
expr: avg(rate(cpu_seconds_total[1m])) by (instance) > 0.8
for: 5m
Attempts:
2 left
💡 Hint

Look for the expression that calculates CPU usage rate averaged per instance over a short interval and checks if it exceeds 0.8 (80%).

Troubleshoot
advanced
2:30remaining
Diagnosing missing logs in a distributed tracing system

You notice that some traces in your distributed tracing system are missing logs from certain services. What is the most likely cause?

AThe logs are stored but the dashboard is not refreshing automatically.
BThe services are not instrumented properly to send logs to the tracing system.
CThe network is blocking all traffic except metrics data.
DThe tracing system is down and cannot receive any data.
Attempts:
2 left
💡 Hint

Think about what is required for logs to appear in traces from each service.

🔀 Workflow
expert
3:00remaining
Order the steps to implement observability in a new microservices project

Arrange the following steps in the correct order to implement observability effectively in a new microservices project.

A2,1,3,4
B1,2,3,4
C2,3,1,4
D3,2,1,4
Attempts:
2 left
💡 Hint

Start by defining goals, then instrument, then collect data, then alert.