0
0
Apache Airflowdevops~20 mins

Why production Airflow needs careful setup - Challenge Your Understanding

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
Airflow Production Mastery
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
Why is a metadata database critical in production Airflow?

In a production Airflow setup, what is the main role of the metadata database?

AIt runs the actual tasks defined in workflows.
BIt stores the state and history of all workflows and tasks.
CIt provides the user interface for monitoring workflows.
DIt schedules the execution of tasks directly.
Attempts:
2 left
💡 Hint

Think about where Airflow keeps track of what tasks have run and their results.

Best Practice
intermediate
2:00remaining
What is a recommended practice for Airflow scheduler in production?

Which practice helps ensure the Airflow scheduler runs reliably in production?

ARun multiple schedulers in parallel with a shared metadata database.
BRun a single scheduler on a local machine without backups.
CUse the scheduler only during business hours.
DDisable the scheduler and trigger tasks manually.
Attempts:
2 left
💡 Hint

Consider how to avoid a single point of failure in scheduling.

Troubleshoot
advanced
2:00remaining
What error occurs if Airflow workers lose connection to the metadata database?

In production, if Airflow workers cannot connect to the metadata database, what is the most likely outcome?

AThe webserver crashes immediately.
BAirflow automatically switches to a backup database without interruption.
CThe scheduler stops scheduling new tasks but workers continue running.
DTasks fail to update their status, causing stuck or orphaned tasks.
Attempts:
2 left
💡 Hint

Think about what happens when task state updates cannot be saved.

🔀 Workflow
advanced
2:00remaining
How to ensure Airflow DAGs are deployed safely in production?

Which approach best ensures safe deployment of DAGs in a production Airflow environment?

AEdit DAG files directly on the production server via SSH.
BUpload DAGs manually through the Airflow web UI.
CUse a version-controlled repository and CI/CD pipeline to deploy DAGs.
DDeploy DAGs by copying files randomly to the scheduler machine.
Attempts:
2 left
💡 Hint

Consider how to track changes and avoid errors during deployment.

🧠 Conceptual
expert
2:00remaining
Why is resource isolation important for Airflow workers in production?

In a production Airflow setup, why should workers be isolated in separate containers or virtual machines?

ATo prevent one task's failure or resource use from affecting others.
BTo allow all tasks to share the same memory and CPU freely.
CTo reduce the number of machines needed by running all tasks together.
DTo make the Airflow webserver run faster.
Attempts:
2 left
💡 Hint

Think about how one task might impact others if not isolated.