Challenge - 5 Problems
Airflow ML Orchestration Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
💻 Command Output
intermediate2:00remaining
Output of Airflow DAG run command
You run the command
airflow dags trigger example_ml_pipeline on your terminal. What is the expected immediate output?Attempts:
2 left
💡 Hint
Think about what happens when you trigger a DAG manually.
✗ Incorrect
Triggering a DAG manually creates a DagRun instance with a timestamp and marks it as externally triggered. If the DAG exists and is valid, this message appears.
🧠 Conceptual
intermediate2:00remaining
Purpose of Airflow Sensors in ML pipelines
In an ML pipeline orchestrated by Airflow, what is the main purpose of a Sensor task?
Attempts:
2 left
💡 Hint
Sensors are special tasks that pause execution until a condition is met.
✗ Incorrect
Sensors wait for external triggers like file availability or database entries before allowing downstream tasks to run, ensuring dependencies are met.
🔀 Workflow
advanced3:00remaining
Correct order of Airflow tasks for ML model deployment
Arrange the following Airflow tasks in the correct order for a typical ML model deployment pipeline:
Attempts:
2 left
💡 Hint
Think about the logical flow from raw data to deployment.
✗ Incorrect
First, data must be preprocessed, then the model is trained on that data, followed by validation, and finally deployment if validation passes.
❓ Troubleshoot
advanced2:30remaining
Diagnosing Airflow task failure due to missing dependencies
An Airflow task in your ML pipeline fails with the error:
TaskInstance failed because upstream task did not complete. What is the most likely cause?Attempts:
2 left
💡 Hint
Check the status of tasks that run before the failing one.
✗ Incorrect
Airflow enforces task dependencies. If an upstream task fails or is skipped, downstream tasks will not run and report this error.
✅ Best Practice
expert3:00remaining
Best practice for managing ML model versioning in Airflow
Which approach is best for managing multiple versions of ML models in an Airflow orchestration pipeline?
Attempts:
2 left
💡 Hint
Think about reproducibility and traceability in ML workflows.
✗ Incorrect
Using version control and Airflow variables allows clear tracking and selection of model versions, improving reproducibility and rollback capability.