Recall & Review
beginner
What is the main purpose of preprocessing raw text in NLP?
The main purpose is to clean and prepare the text so that the machine learning model can understand it better and make accurate predictions.
Click to reveal answer
beginner
Name two common preprocessing steps used to clean raw text.
Removing punctuation and converting all text to lowercase are two common preprocessing steps.
Click to reveal answer
beginner
Why do we remove stop words during text preprocessing?
Stop words are common words like 'the', 'is', and 'and' that do not add much meaning. Removing them helps the model focus on important words.
Click to reveal answer
intermediate
How does preprocessing help improve model accuracy?
By cleaning text, removing noise, and standardizing words, preprocessing reduces confusion for the model and helps it learn patterns more clearly.
Click to reveal answer
intermediate
What problems can raw text cause if not preprocessed?
Raw text can have typos, inconsistent capitalization, extra spaces, and irrelevant symbols that confuse the model and lower prediction quality.
Click to reveal answer
Why do we convert text to lowercase during preprocessing?
✗ Incorrect
Converting to lowercase helps the model treat words with different cases as the same word.
What is a stop word in text preprocessing?
✗ Incorrect
Stop words are common words like 'the' or 'and' that usually do not add important meaning.
Which of these is NOT a typical preprocessing step?
✗ Incorrect
Adding random words is not a preprocessing step; it would confuse the model.
How does preprocessing affect machine learning models?
✗ Incorrect
Preprocessing cleans and standardizes text so models can learn patterns more effectively.
What problem can raw text with typos cause?
✗ Incorrect
Typos add noise and confuse the model, reducing prediction quality.
Explain why preprocessing is important for cleaning raw text in NLP.
Think about how messy text can confuse a model.
You got /4 concepts.
List common preprocessing steps used to clean raw text and why each is useful.
Consider how each step simplifies or clarifies the text.
You got /4 concepts.