Overview - Resource quotas per namespace

What is it?

Resource quotas per namespace in Kubernetes are limits set on the amount of resources like CPU, memory, and storage that can be used by all the containers and pods within a specific namespace. A namespace is like a separate space or room inside a Kubernetes cluster where resources and objects are grouped. Setting quotas helps control how much resource each namespace can consume to avoid one group using too much and affecting others.

Why it matters

Without resource quotas, one team or application could use all the cluster's resources, causing other teams' applications to slow down or fail. Quotas ensure fair sharing and prevent accidental or malicious overuse. This keeps the cluster stable and predictable, which is important for running many applications together smoothly.

Where it fits

Before learning resource quotas, you should understand Kubernetes namespaces and basic resource concepts like CPU and memory requests and limits. After mastering quotas, you can explore advanced resource management like LimitRanges, Quality of Service classes, and cluster autoscaling.

Mental Model

Core Idea

Resource quotas act like a budget that limits how much computing power and storage each namespace can spend inside a Kubernetes cluster.

Think of it like...

Imagine a shared apartment where each roommate has a monthly budget for utilities and groceries. Resource quotas are like setting those budgets so no one spends more than their share, keeping the household running smoothly.

┌───────────────────────────────┐
│ Kubernetes Cluster             │
│ ┌───────────────┐             │
│ │ Namespace A   │             │
│ │ ┌───────────┐ │             │
│ │ │ Quota:    │ │             │
│ │ │ CPU: 2    │ │             │
│ │ │ Memory: 4G│ │             │
│ │ └───────────┘ │             │
│ └───────────────┘             │
│ ┌───────────────┐             │
│ │ Namespace B   │             │
│ │ ┌───────────┐ │             │
│ │ │ Quota:    │ │             │
│ │ │ CPU: 1    │ │             │
│ │ │ Memory: 2G│ │             │
│ │ └───────────┘ │             │
│ └───────────────┘             │
└───────────────────────────────┘

Build-Up - 7 Steps

1

FoundationUnderstanding Kubernetes namespaces

Concept: Namespaces divide a Kubernetes cluster into virtual sections to organize and isolate resources.

A Kubernetes cluster can run many applications for different teams or projects. Namespaces help keep these separate by grouping resources like pods, services, and deployments into named spaces. This prevents name conflicts and allows different teams to work independently.

Result

You can create and manage resources in separate namespaces, keeping them organized and isolated.

Knowing namespaces is essential because resource quotas apply at this level, controlling resource use per namespace.

2

FoundationBasics of Kubernetes resource requests and limits

3

IntermediateWhat resource quotas control in namespaces

4

IntermediateHow to define a resource quota YAML

5

IntermediateEnforcing quotas and observing effects

6

AdvancedCombining resource quotas with LimitRanges

7

ExpertSurprising quota behavior with burstable pods

Under the Hood

Resource quotas work by Kubernetes API server tracking the sum of resource requests and object counts in each namespace. When a new resource creation or update request arrives, the API server checks if adding it would exceed any quota limit. If yes, it rejects the request with a quota exceeded error. This check happens before the resource is persisted. The quota controller continuously monitors usage and updates status to reflect current consumption.

Why designed this way?

This design centralizes quota enforcement at the API server to ensure consistency and prevent race conditions. It avoids complex runtime enforcement inside nodes, which would be harder to coordinate. Counting requests rather than actual usage simplifies enforcement and scheduling decisions, though it allows burstable usage beyond quotas.

┌───────────────────────────────┐
│ Kubernetes API Server          │
│ ┌───────────────────────────┐ │
│ │ Receives resource requests │ │
│ └─────────────┬─────────────┘ │
│               │ Checks quota limits
│               ▼               │
│ ┌───────────────────────────┐ │
│ │ Quota Controller tracks   │ │
│ │ resource usage per ns     │ │
│ └─────────────┬─────────────┘ │
│               │ Accept or reject
│               ▼               │
│ ┌───────────────────────────┐ │
│ │ Resource stored if allowed │ │
│ └───────────────────────────┘ │
└───────────────────────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Do resource quotas limit actual CPU and memory usage or only resource requests? Commit to your answer.

Common Belief:Resource quotas limit the actual CPU and memory usage of pods in a namespace.

Tap to reveal reality

Quick: Can resource quotas be applied cluster-wide or only per namespace? Commit to your answer.

Common Belief:Resource quotas can be set globally for the entire Kubernetes cluster.

Tap to reveal reality

Quick: Does creating a pod that exceeds quota limits run then get killed, or is it blocked upfront? Commit to your answer.

Common Belief:Pods that exceed resource quotas start running but get killed later if they use too much.

Tap to reveal reality

Quick: Do resource quotas count limits or requests for resource usage? Commit to your answer.

Common Belief:Resource quotas count the resource limits set on pods and containers.

Tap to reveal reality

Expert Zone

1

Resource quotas count resource requests and object counts but do not track actual usage, so burstable pods can exceed quotas in practice.

2

Quota enforcement happens at the API server during resource creation, preventing race conditions but not controlling runtime spikes.

3

Combining resource quotas with LimitRanges allows fine-grained control over both total namespace usage and individual pod resource sizes.

When NOT to use

Resource quotas are not suitable for enforcing strict runtime resource caps or cluster-wide limits. For runtime enforcement, use cgroups or node-level controls. For cluster-wide limits, consider ClusterResourceQuota (in OpenShift) or admission controllers. Also, quotas do not prevent resource starvation caused by burstable pods exceeding requests.

Production Patterns

In production, teams use resource quotas per namespace to allocate fair resource budgets to different projects or teams. They combine quotas with LimitRanges to enforce minimum and maximum pod sizes. Monitoring tools track actual usage to detect burstable pod spikes. Quotas are integrated into CI/CD pipelines to prevent resource overuse before deployment.

Connections

Linux cgroups

Resource quotas limit resource requests at the Kubernetes API level, while cgroups enforce actual resource usage limits at the OS level.

Understanding cgroups helps grasp why Kubernetes quotas count requests but cannot strictly cap runtime usage.

Project budgeting in finance

Resource quotas are like financial budgets that limit spending per project (namespace) to avoid overspending cluster resources.

Knowing budgeting principles clarifies why quotas prevent resource overuse and ensure fair sharing.

Traffic shaping in networking

Both resource quotas and traffic shaping control how much resource (CPU/memory or bandwidth) a group can consume to prevent congestion.

Recognizing this pattern across domains highlights the importance of fair resource allocation for system stability.

Common Pitfalls

#1Trying to limit actual pod CPU usage with resource quotas.

Wrong approach:apiVersion: v1 kind: ResourceQuota metadata: name: cpu-limit-quota namespace: team-a spec: hard: limits.cpu: "4"

Correct approach:apiVersion: v1 kind: ResourceQuota metadata: name: cpu-request-quota namespace: team-a spec: hard: requests.cpu: "4"

Root cause:Resource quotas count resource requests, not limits; using limits.cpu in quotas has no effect.

#2Applying resource quotas without namespaces or to the wrong namespace.

Wrong approach:kubectl apply -f quota.yaml # quota.yaml missing or wrong namespace metadata

Correct approach:apiVersion: v1 kind: ResourceQuota metadata: name: example-quota namespace: correct-namespace spec: hard: pods: "10"

Root cause:Resource quotas must specify the target namespace; otherwise, they do not apply or apply to default namespace.

#3Expecting pods to be killed if they exceed quota after creation.

Wrong approach:Create pod exceeding quota and expect it to run then be terminated by quota enforcement.

Correct approach:Pods exceeding quota are blocked at creation time with an error; they never run.

Root cause:Misunderstanding that quota enforcement happens at resource creation, not runtime.

Key Takeaways

Resource quotas limit the total resource requests and object counts per namespace to ensure fair resource sharing in Kubernetes clusters.

Quotas enforce limits at resource creation time by the API server, preventing overuse before pods run.

They count resource requests, not actual usage or limits, so burstable pods can exceed quotas in runtime usage.

Combining resource quotas with LimitRanges provides control over both total namespace usage and individual pod resource sizes.

Understanding quotas helps maintain cluster stability and fairness among multiple teams or projects sharing the same cluster.