0
0
Apache Airflowdevops~5 mins

Airflow architecture (scheduler, webserver, executor, metadata DB) - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is the role of the Airflow Scheduler?
The Scheduler monitors DAG definitions and triggers tasks to run at the right time. It decides when and what tasks should be executed.
Click to reveal answer
beginner
What does the Airflow Webserver do?
The Webserver provides a user interface to view DAGs, task status, logs, and manage workflows easily through a browser.
Click to reveal answer
beginner
Explain the purpose of the Airflow Executor.
The Executor runs the tasks assigned by the Scheduler. It manages how and where tasks run, such as locally or on a cluster.
Click to reveal answer
beginner
What is stored in the Airflow Metadata Database?
The Metadata Database stores information about DAGs, task status, schedules, and logs. It keeps track of workflow history and state.
Click to reveal answer
intermediate
How do the Scheduler, Webserver, Executor, and Metadata DB work together in Airflow?
The Scheduler decides when tasks run and tells the Executor to run them. The Executor runs tasks and updates the Metadata DB with status. The Webserver shows this info to users.
Click to reveal answer
Which Airflow component is responsible for triggering tasks at scheduled times?
AScheduler
BWebserver
CExecutor
DMetadata Database
Where can you view the status and logs of Airflow tasks?
AExecutor
BWebserver
CScheduler
DMetadata Database
What does the Airflow Executor do?
ADisplays the UI
BStores task logs
CSchedules DAG runs
DRuns tasks assigned by the Scheduler
What kind of data is stored in the Airflow Metadata Database?
ATask execution status and DAG info
BTask execution logs only
CUser interface settings
DTask code files
Which component updates the Metadata Database with task status?
AScheduler
BWebserver
CExecutor
DNone of the above
Describe how the Airflow Scheduler, Executor, Webserver, and Metadata Database interact to run and monitor workflows.
Think of the Scheduler as the planner, Executor as the worker, Metadata DB as the notebook, and Webserver as the dashboard.
You got /4 concepts.
    Explain the purpose of the Airflow Metadata Database and why it is important.
    It is like a diary that remembers everything about your workflows.
    You got /4 concepts.