Experiment - Why preprocessing cleans raw text
Problem:You have raw text data with lots of noise like punctuation, uppercase letters, and extra spaces. This noise makes it hard for a model to learn useful patterns.
Current Metrics:Model accuracy on text classification: 65% on training, 60% on validation
Issue:The model struggles because the raw text contains noise that confuses it, leading to lower accuracy.