Overview - Quality of Service classes (Guaranteed, Burstable, BestEffort)

What is it?

Quality of Service (QoS) classes in Kubernetes define how the system treats pods when resources are limited. There are three classes: Guaranteed, Burstable, and BestEffort. These classes help Kubernetes decide which pods get priority for CPU and memory during high demand. They are based on how resource requests and limits are set for each pod.

Why it matters

Without QoS classes, Kubernetes would not know which pods are more important or how to fairly share limited resources. This could cause critical applications to slow down or crash when the cluster is busy. QoS classes ensure that important workloads get the resources they need, improving stability and performance for users.

Where it fits

Learners should first understand basic Kubernetes concepts like pods, containers, and resource requests/limits. After QoS classes, they can explore advanced scheduling, resource quotas, and cluster autoscaling to manage resources efficiently.

Mental Model

Core Idea

Kubernetes uses QoS classes to prioritize pods based on their resource guarantees, ensuring fair and stable resource sharing under pressure.

Think of it like...

Imagine a busy restaurant kitchen where orders are prioritized: VIP guests get their meals cooked first (Guaranteed), regular customers get served as soon as possible (Burstable), and walk-ins wait until there's free time (BestEffort).

┌───────────────┐
│   Pod QoS     │
├───────────────┤
│ Guaranteed    │  ← Pods with equal requests and limits
│ Burstable     │  ← Pods with requests less than limits
│ BestEffort    │  ← Pods with no requests or limits
└───────────────┘

Resource Pressure → Kubernetes evicts pods starting from BestEffort, then Burstable, keeping Guaranteed last.

Build-Up - 7 Steps

1

FoundationUnderstanding Kubernetes Pods and Resources

Concept: Pods run containers and can request CPU and memory resources.

In Kubernetes, a pod is the smallest unit that runs one or more containers. Each container can ask for a certain amount of CPU and memory (called requests) and can set a maximum limit for these resources (called limits). Requests tell Kubernetes how much resource the pod needs to run well, while limits cap the maximum it can use.

Result

Pods have defined resource needs that Kubernetes can use to schedule them on nodes.

Knowing how pods request and limit resources is essential to understanding how Kubernetes manages workload priorities.

2

FoundationResource Requests vs Limits Explained

3

IntermediateDefining Guaranteed QoS Class

4

IntermediateUnderstanding Burstable QoS Class

5

IntermediateExploring BestEffort QoS Class

6

AdvancedHow Kubernetes Uses QoS for Eviction

7

ExpertSubtle Effects of Mixed Container QoS in Pods

Under the Hood

Kubernetes calculates QoS class by inspecting each container's resource requests and limits in a pod. If all containers have equal requests and limits, the pod is Guaranteed. If requests exist but limits are higher, it's Burstable. If no requests or limits are set, it's BestEffort. The kubelet uses this classification to prioritize pods during scheduling and eviction, ensuring resource fairness and stability.

Why designed this way?

This design balances strict resource guarantees with flexibility. Guaranteed pods get strong protection for critical workloads. Burstable pods allow efficient resource use by permitting bursts. BestEffort pods maximize cluster utilization but accept eviction risk. Alternatives like static priority would be less flexible and harder to manage dynamically.

┌─────────────────────────────┐
│ Pod QoS Classification Flow │
├─────────────────────────────┤
│ For each container in pod:  │
│  Check requests and limits  │
│                             │
│ ┌─────────────────────────┐ │
│ │ All equal requests=limits│─┐
│ └─────────────────────────┘ │
│           Yes              │
│           ↓                │
│       Pod = Guaranteed     │
│           No               │
│           ↓                │
│ Requests set?              │
│   Yes → Pod = Burstable    │
│   No  → Pod = BestEffort   │
└─────────────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does setting only resource limits without requests make a pod Guaranteed? Commit yes or no.

Common Belief:If I set resource limits, my pod is Guaranteed because it has resource caps.

Tap to reveal reality

Quick: Are BestEffort pods guaranteed any CPU or memory? Commit yes or no.

Common Belief:BestEffort pods still get some guaranteed CPU or memory from the node.

Tap to reveal reality

Quick: Can a pod with multiple containers have mixed QoS classes? Commit yes or no.

Common Belief:Each container in a pod can have its own QoS class independently.

Tap to reveal reality

Quick: Does Kubernetes always evict pods based on QoS class only? Commit yes or no.

Common Belief:Kubernetes evicts pods strictly by QoS class order every time.

Tap to reveal reality

Expert Zone

1

Guaranteed QoS requires exact equality of requests and limits for every resource (CPU and memory) in all containers, including init containers.

2

Burstable pods can still be evicted before Guaranteed pods even if they use less resources, because QoS is about guarantees, not actual usage.

3

Pod priority and QoS interact but are separate; a low-priority Guaranteed pod can be evicted before a high-priority Burstable pod under some conditions.

When NOT to use

Avoid relying solely on QoS classes for critical workload protection; use Pod Priority and Preemption for finer control. For strict resource isolation, consider using Kubernetes namespaces with ResourceQuotas or dedicated nodes.

Production Patterns

In production, Guaranteed QoS is used for critical system components like databases. Burstable is common for user-facing apps that can tolerate some resource variability. BestEffort is reserved for batch jobs or non-critical workloads that can be interrupted without impact.

Connections

Linux cgroups

QoS classes build on Linux cgroups resource limits and priorities.

Understanding cgroups helps grasp how Kubernetes enforces resource limits and prioritizes pods at the OS level.

Operating System Process Scheduling

Kubernetes QoS classes conceptually mirror OS process priority levels.

Knowing OS scheduling clarifies why Guaranteed pods get CPU preference similar to high-priority processes.

Traffic Prioritization in Networking

QoS in Kubernetes is similar to network QoS that prioritizes important data packets.

Recognizing this analogy helps understand how resource prioritization improves overall system performance and fairness.

Common Pitfalls

#1Not setting resource requests and limits causes pods to be BestEffort unintentionally.

Wrong approach:apiVersion: v1 kind: Pod metadata: name: mypod spec: containers: - name: app image: nginx

Correct approach:apiVersion: v1 kind: Pod metadata: name: mypod spec: containers: - name: app image: nginx resources: requests: cpu: "100m" memory: "200Mi" limits: cpu: "100m" memory: "200Mi"

Root cause:Beginners often skip resource settings, not realizing this defaults pods to BestEffort, risking eviction.

#2Setting requests but no limits expecting Guaranteed QoS.

Wrong approach:resources: requests: cpu: "100m" memory: "200Mi"

Correct approach:resources: requests: cpu: "100m" memory: "200Mi" limits: cpu: "100m" memory: "200Mi"

Root cause:Misunderstanding that both requests and limits must be equal for Guaranteed class.

#3Assuming pod QoS is determined per container, not per pod.

Wrong approach:Container A: requests=limits=100m CPU Container B: requests=50m CPU, limits=100m CPU Assuming pod is Guaranteed

Correct approach:Set both containers with equal requests and limits to achieve Guaranteed QoS for the pod.

Root cause:Confusing container-level resource settings with pod-level QoS classification.

Key Takeaways

Kubernetes QoS classes prioritize pods based on resource requests and limits to manage cluster resources fairly.

Guaranteed pods have equal requests and limits for all containers, ensuring the highest resource protection.

Burstable pods have requests less than limits, allowing flexible resource use but lower eviction priority than Guaranteed.

BestEffort pods have no resource requests or limits and are the first to be evicted under resource pressure.

Understanding QoS classes helps design stable, efficient workloads and avoid unexpected pod evictions.