Challenge - 5 Problems
Data Drift Detection Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate1:30remaining
Understanding Data Drift
What does data drift mean in the context of machine learning models?
Attempts:
2 left
💡 Hint
Think about how the data the model sees might change after deployment.
✗ Incorrect
Data drift refers to changes in the input data's statistical properties over time. This can cause the model to perform worse because it was trained on different data.
💻 Command Output
intermediate2:00remaining
Detecting Data Drift with a Python Library
Given the following Python code snippet using the
alibi-detect library to detect data drift, what will be the output if the new data distribution is significantly different from the reference data?MLOps
from alibi_detect.cd import KSDrift import numpy as np # Reference data: normal distribution ref_data = np.random.normal(0, 1, 1000) # New data: shifted mean new_data = np.random.normal(2, 1, 1000) cd = KSDrift(ref_data) drift_result = cd.predict(new_data) print(drift_result['data']['is_drift'])
Attempts:
2 left
💡 Hint
KSDrift uses the Kolmogorov-Smirnov test to detect if distributions differ.
✗ Incorrect
The new data has a different mean, so the KS test will detect drift and return True.
❓ Configuration
advanced2:30remaining
Configuring a Data Drift Monitoring Pipeline
Which of the following YAML configurations correctly sets up a data drift monitoring job that runs daily and alerts if drift is detected?
Attempts:
2 left
💡 Hint
Check for correct keys and valid values for alert and threshold.
✗ Incorrect
Option B uses correct keys and values: 'daily' schedule, 'data_drift' type, alert enabled as boolean, and a reasonable threshold.
❓ Troubleshoot
advanced2:00remaining
Troubleshooting False Negatives in Data Drift Detection
A data drift detection system is not alerting even though the input data has clearly changed. Which of the following is the most likely cause?
Attempts:
2 left
💡 Hint
Consider how sensitivity settings affect detection.
✗ Incorrect
If the threshold is too high, small or moderate changes won't trigger alerts, causing false negatives.
🔀 Workflow
expert2:30remaining
Order of Steps in Data Drift Detection Workflow
Arrange the following steps in the correct order for a typical data drift detection workflow.
Attempts:
2 left
💡 Hint
Think about the logical flow from data collection to alerting.
✗ Incorrect
The workflow starts with collecting reference data, then new data, followed by testing, and finally alerting if drift is found.