Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to create a CountVectorizer instance.

ML Python

from sklearn.feature_extraction.text import [1]
vectorizer = [1]()

Drag options to blanks, or click blank then click option'

ALabelEncoder

BTfidfVectorizer

CCountVectorizer

DDictVectorizer

Attempts:

3 left

2fill in blank

medium

Complete the code to transform text data into a count matrix.

ML Python

texts = ['hello world', 'hello machine learning']
count_matrix = vectorizer.[1](texts)

Drag options to blanks, or click blank then click option'

Afit

Bpredict

Ctransform

Dfit_transform

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to create a TF-IDF vectorizer.

ML Python

from sklearn.feature_extraction.text import [1]
tfidf_vectorizer = [1]()

Drag options to blanks, or click blank then click option'

ACountVectorizer

BTfidfVectorizer

CTfidfTransformer

DHashingVectorizer

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary of word counts for words longer than 3 letters.

ML Python

words = ['data', 'science', 'is', 'fun']
word_counts = {word: [1] for word in words if len(word) [2] 3}

Drag options to blanks, or click blank then click option'

C>=

Dlen(word)

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a TF-IDF matrix from text data.

ML Python

from sklearn.feature_extraction.text import [1]
texts = ['machine learning', 'deep learning', 'machine intelligence']
tfidf = [2]()
matrix = tfidf.[3](texts)

Drag options to blanks, or click blank then click option'

ATfidfVectorizer

BTfidfTransformer

Cfit_transform

DCountVectorizer

Attempts:

3 left

Practice

(1/5)

1. What does CountVectorizer do in text processing?

easy

A. Calculates the importance of words based on frequency and rarity

B. Counts how many times each word appears in the text

C. Removes stop words from the text

D. Converts text into lowercase only

Text feature basics (CountVectorizer, TF-IDF) in ML Python - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand CountVectorizer's role

Step 2: Differentiate from TF-IDF

Final Answer:

Quick Check:

Solution

Step 1: Recall correct sklearn import path

Step 2: Check syntax correctness

Final Answer:

Quick Check:

Solution

Step 1: Count unique words in sentences

Step 2: Understand shape of output matrix

Final Answer:

Quick Check:

Solution

Step 1: Check method usage for feature names

Step 2: Use updated method

Final Answer:

Quick Check:

Solution

Step 1: Understand the goal of reducing common word impact

Step 2: Identify method that weighs words by importance

Final Answer:

Quick Check: