Overview - Control plane components (API server, scheduler, controller manager, etcd)

What is it?

The control plane in Kubernetes is the brain of the cluster. It manages the cluster's state and makes decisions about what runs and where. The main parts are the API server, scheduler, controller manager, and etcd. Together, they keep the cluster healthy and running smoothly.

Why it matters

Without the control plane, Kubernetes wouldn't know what to do or how to keep applications running. It solves the problem of managing many computers and containers automatically. Without it, you'd have to manually start, stop, and fix apps on each machine, which is slow and error-prone.

Where it fits

Before learning control plane components, you should understand basic Kubernetes concepts like pods and nodes. After this, you can learn about worker nodes, networking, and how applications run inside the cluster.

Mental Model

Core Idea

The control plane components work together like a command center that watches, decides, and controls the entire Kubernetes cluster.

Think of it like...

Imagine a busy airport control tower: the API server is the radio operator taking requests, the scheduler is the air traffic controller deciding which plane lands where, the controller manager is the ground crew fixing problems and keeping things running, and etcd is the airport's logbook storing all important information.

┌─────────────────────────────┐
│       Kubernetes Cluster     │
│                             │
│  ┌───────────────┐          │
│  │ Control Plane │          │
│  │               │          │
│  │ ┌───────────┐ │          │
│  │ │ API Server│ │<─────────┤ Client Requests
│  │ └───────────┘ │          │
│  │      │        │          │
│  │      ▼        │          │
│  │ ┌───────────┐ │          │
│  │ │Scheduler  │ │          │
│  │ └───────────┘ │          │
│  │      │        │          │
│  │      ▼        │          │
│  │ ┌───────────────┐        │
│  │ │Controller Mgr │        │
│  │ └───────────────┘        │
│  │      │                   │
│  │      ▼                   │
│  │ ┌───────────┐            │
│  │ │   etcd    │            │
│  │ └───────────┘            │
│                             │
└─────────────────────────────┘

Build-Up - 7 Steps

1

FoundationWhat is the Kubernetes Control Plane

Concept: Introduce the control plane as the central management part of Kubernetes.

The control plane is the set of components that manage the Kubernetes cluster. It decides what runs on the cluster and keeps track of the cluster's state. It runs on master nodes and communicates with worker nodes where your apps run.

Result

You understand that the control plane is the brain controlling the cluster.

Understanding the control plane as the cluster's brain helps you see why it is critical for cluster health and management.

2

FoundationRole of the API Server

3

IntermediateHow the Scheduler Assigns Work

4

IntermediateController Manager's Continuous Checks

5

Intermediateetcd as the Cluster's Database

6

AdvancedHow Components Communicate Securely

7

Expertetcd Consistency and High Availability

Under the Hood

The API server exposes a RESTful interface that all components and users interact with. It stores cluster state in etcd, a distributed key-value store using Raft consensus for consistency. The scheduler watches the API server for unscheduled pods and assigns them to nodes based on resource and policy constraints. The controller manager runs multiple controllers that continuously compare desired state (from etcd) with actual state and take corrective actions by updating the API server. All communication is secured with TLS certificates to ensure trust and prevent tampering.

Why designed this way?

Kubernetes was designed to manage large, dynamic clusters reliably. Separating concerns into distinct components allows scalability and fault tolerance. Using etcd with consensus ensures a single source of truth that survives failures. The API server centralizes communication for consistency and security. This modular design allows independent development and scaling of components.

┌───────────────┐       ┌───────────────┐
│   Clients     │──────▶│  API Server   │
└───────────────┘       └──────┬────────┘
                                │
                                ▼
                      ┌─────────────────┐
                      │      etcd       │
                      └─────────────────┘
                                ▲
                                │
       ┌───────────────┬────────┴─────────┬───────────────┐
       │               │                  │               │
┌─────────────┐ ┌───────────────┐ ┌────────────────┐ ┌───────────────┐
│ Scheduler   │ │ Controller    │ │ Other Controllers│ │ Worker Nodes  │
│             │ │ Manager       │ │                │ │ (Pods run here)│
└─────────────┘ └───────────────┘ └────────────────┘ └───────────────┘

Myth Busters - 4 Common Misconceptions

Quick: Does the scheduler run your application containers directly? Commit yes or no.

Common Belief:The scheduler runs the application containers on the nodes.

Tap to reveal reality

Quick: Is etcd just a backup storage for Kubernetes? Commit yes or no.

Common Belief:etcd is just a backup database that stores logs or snapshots occasionally.

Tap to reveal reality

Quick: Do controller managers only report problems without fixing them? Commit yes or no.

Common Belief:Controller managers only monitor and report cluster issues; humans fix them.

Tap to reveal reality

Quick: Can the API server be bypassed to change cluster state? Commit yes or no.

Common Belief:You can directly update etcd or nodes to change cluster state without the API server.

Tap to reveal reality

Expert Zone

1

The API server uses watch APIs to stream real-time updates to components, reducing load and latency compared to polling.

2

The scheduler supports custom scheduling policies and plugins, allowing fine-tuned workload placement beyond default resource checks.

3

etcd performance and latency directly impact cluster responsiveness; tuning etcd and its storage backend is critical for large clusters.

When NOT to use

The default control plane components are not suitable for extremely lightweight or edge environments where minimal resource use is critical; alternatives like K3s use simplified control planes. For very large clusters, managed Kubernetes services or custom scheduler/controller implementations may be preferred.

Production Patterns

In production, control plane components run on dedicated master nodes with high availability setups. etcd clusters are deployed with odd numbers of nodes for quorum. Custom controllers extend functionality for specific business needs. Monitoring and alerting on control plane health is standard practice.

Connections

Distributed Consensus Algorithms

etcd uses Raft consensus, a distributed consensus algorithm.

Understanding Raft helps grasp how etcd keeps data consistent and available despite failures.

Operating System Process Scheduling

Kubernetes scheduler assigns pods to nodes similar to how OS schedules processes to CPUs.

Knowing OS scheduling concepts clarifies how Kubernetes balances workloads and optimizes resource use.

Air Traffic Control Systems

The control plane components coordinate cluster operations like air traffic controllers manage flights.

Studying air traffic control systems reveals principles of coordination, safety, and fault tolerance applicable to Kubernetes control planes.

Common Pitfalls

#1Trying to update cluster state by modifying etcd data directly.

Wrong approach:etcdctl put /registry/pods/default/my-pod '{"spec":...}'

Correct approach:kubectl apply -f my-pod.yaml

Root cause:Misunderstanding that etcd is a backend store and all changes must go through the API server for validation and consistency.

#2Running control plane components on the same nodes as worker nodes in production.

Wrong approach:Deploying API server, scheduler, controller manager, and pods all on the same physical or virtual machines.

Correct approach:Deploy control plane components on dedicated master nodes separate from worker nodes.

Root cause:Not recognizing the need for isolation to ensure control plane stability and security.

#3Ignoring etcd backup and recovery planning.

Wrong approach:No regular etcd snapshots or disaster recovery procedures configured.

Correct approach:Set up automated etcd backups and test recovery regularly.

Root cause:Underestimating etcd's critical role and the risk of data loss.

Key Takeaways

The Kubernetes control plane is the central brain managing cluster state and decisions.

The API server is the gatekeeper that validates and processes all requests to the cluster.

The scheduler decides where pods run but does not run them itself.

The controller manager continuously monitors and fixes cluster state to keep it healthy.

etcd is the distributed database that stores the entire cluster state reliably and consistently.