Experiment - scikit-learn Pipeline
Problem:You have a dataset with numeric features that need scaling before training a model. Currently, you manually scale the data and then train a logistic regression model separately.
Current Metrics:Training accuracy: 95%, Validation accuracy: 80%
Issue:The manual scaling and model training are done separately, which can cause errors and inconsistent preprocessing during prediction. Also, the validation accuracy is much lower than training, indicating possible overfitting.