What is the main cause of training-serving skew in machine learning systems?
Think about what happens if the features used during training are not the same as those used when the model makes predictions.
Training-serving skew happens when the features or their transformations differ between training and serving, causing the model to see different data than it was trained on.
How do feature stores help prevent training-serving skew?
Consider how having one place for features can help keep training and serving consistent.
Feature stores centralize feature definitions and data, ensuring the same features and transformations are used during both training and serving, thus preventing skew.
Which step in a feature store workflow is crucial to ensure no training-serving skew occurs?
Think about how the feature transformations are applied in both training and serving.
Using the same transformation code in both batch and online pipelines ensures that features are consistent, preventing skew.
You notice your model performs well during training but poorly in production. Which of the following is a likely symptom of training-serving skew?
Think about what changes in the data the model sees when deployed.
If the feature distributions differ between training and serving, the model may not generalize well, indicating training-serving skew.
Which practice best ensures that feature values remain consistent and up-to-date in both training and serving environments when using a feature store?
Consider how to keep features synchronized and fresh automatically.
Feature versioning combined with automated pipelines ensures features are consistent, fresh, and reproducible across training and serving.