Apache Airflowdevops~10 mins

Kubernetes executor for dynamic scaling in Apache Airflow - Step-by-Step Execution

Choose your learning style9 modes available

Learn Why Deep Visual Try Challenge Project Recall Time

Process Flow - Kubernetes executor for dynamic scaling

Airflow Scheduler receives task

↓

Scheduler sends task to Kubernetes Executor

↓

Kubernetes Executor creates a Pod for the task

↓

Pod runs the task

↓

Task completes, Pod terminates

↓

Kubernetes Executor reports task status back to Scheduler

↓

Scheduler updates task state

↓

If more tasks, repeat Pod creation

↓

Kubernetes cluster scales Pods dynamically based on demand

The Kubernetes executor creates a new Pod for each task dynamically, runs the task inside it, and scales Pods up or down based on workload.

Execution Sample

Apache Airflow

from airflow import DAG
from airflow.operators.bash import BashOperator
from datetime import datetime

dag = DAG('example', start_date=datetime(2024,1,1))
t1 = BashOperator(task_id='print_date', bash_command='date', dag=dag)

This Airflow DAG defines a simple task that prints the date, which Kubernetes executor will run in a separate Pod.

Process Table

Step	Action	Kubernetes Executor Behavior	Pod State	Scheduler State
1	Scheduler receives task 'print_date'	No Pod yet	No Pod	Task queued
2	Scheduler sends task to Kubernetes Executor	Creates Pod 'print_date-pod'	Pod created, Pending	Task running
3	Pod starts running task	Pod status changes to Running	Running	Task running
4	Task executes 'date' command inside Pod	Pod runs command	Running	Task running
5	Task completes successfully	Pod status changes to Succeeded	Succeeded	Task success
6	Pod terminates and is cleaned up	Pod deleted	No Pod	Task success
7	Scheduler ready for next task	Waits for next task	No Pod	Idle

💡 No more tasks to run, Kubernetes executor scales down Pods to zero.

Status Tracker

Variable	Start	After Step 2	After Step 3	After Step 5	Final
Pod State	None	Pending	Running	Succeeded	Deleted
Task State	Queued	Running	Running	Success	Success

Key Moments - 3 Insights

Why does Kubernetes executor create a new Pod for each task instead of reusing one?

What happens to the Pod after the task completes?

How does Kubernetes executor handle multiple tasks arriving at once?

Visual Quiz - 3 Questions

Test your understanding

Look at the execution_table, what is the Pod state at Step 3?

APending

BRunning

CSucceeded

DDeleted

Concept Snapshot

Kubernetes executor runs each Airflow task in its own Pod.
Pods are created dynamically when tasks start and deleted after completion.
This allows Airflow to scale tasks up or down based on workload.
Scheduler sends tasks to executor, executor manages Pods.
Pods isolate tasks and free resources when done.

Full Transcript

The Kubernetes executor in Airflow works by creating a new Pod for each task it receives from the scheduler. When the scheduler sends a task, the executor creates a Pod in the Kubernetes cluster to run that task. The Pod starts in a Pending state, then moves to Running while executing the task command. Once the task finishes successfully, the Pod status changes to Succeeded and the Pod is deleted to free resources. The scheduler updates the task state accordingly. This dynamic creation and deletion of Pods allows Airflow to scale the number of running tasks up or down based on demand, efficiently using cluster resources.