Experiment - TF-IDF (TfidfVectorizer)
Problem:You want to convert a collection of text documents into numbers that show how important each word is in each document. You are using TF-IDF to do this.
Current Metrics:The current TF-IDF vectorizer uses default settings and creates very large feature vectors with many unimportant words included.
Issue:The model is slow and the vectors are too large because many common words that do not help distinguish documents are included.