Experiment - Bag of Words (CountVectorizer)
Problem:We want to classify movie reviews as positive or negative using a simple Bag of Words model with CountVectorizer and a logistic regression classifier.
Current Metrics:Training accuracy: 98%, Validation accuracy: 70%
Issue:The model is overfitting: it performs very well on training data but poorly on validation data.