Which of the following best describes the relationship between SLA, SLO, and SLI?
Think about which term is a measurement, which is a goal, and which is a contract.
An SLI (Service Level Indicator) is a specific metric that measures service quality. An SLO (Service Level Objective) is the target value for that metric. An SLA (Service Level Agreement) is the formal contract that includes the SLO and consequences if the SLO is not met.
Which of the following is an example of a Service Level Indicator (SLI)?
SLI is a measurable metric, not a contract or penalty.
SLI is a metric that measures service quality, such as the percentage of successful HTTP requests. The other options describe agreements or penalties, not metrics.
You need to design a system to collect SLIs for a web service's availability and latency. Which architectural component is essential to ensure accurate and scalable SLI collection?
Consider real-time, scalable, and accurate metric collection.
Distributed agents collecting metrics in real-time and pushing to a central aggregator allow accurate, scalable, and timely SLI measurement. Centralized logging with offline processing is slower and less scalable. Manual review is not scalable or timely. Client-side surveys measure user experience but are not direct SLIs.
Your service has SLAs with different customers worldwide, each with unique SLOs. What is the best approach to scale SLA enforcement and reporting?
Think about customization and scalability for multiple customers.
A multi-tenant SLA enforcement system that tracks each customer's SLOs separately allows accurate enforcement and reporting per contract. Applying the strictest SLO globally is unfair and inaccurate. Outsourcing without integration loses control. Ignoring individual SLAs risks contract violations.
You want to increase the granularity of SLIs by measuring latency per API endpoint every second. What is a key tradeoff to consider?
Think about the balance between detail and resource use.
Increasing granularity improves accuracy and insight but also increases the load on monitoring infrastructure and storage needs. It does not reduce accuracy or eliminate SLAs. It cannot guarantee zero downtime.