Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to tokenize the sentence into words using Python.

Intro to Computing

from nltk.tokenize import word_tokenize
sentence = "Hello world!"
tokens = [1](sentence)

Drag options to blanks, or click blank then click option'

Asplit

Btokenize

Ctokenize_words

Dword_tokenize

Attempts:

3 left

2fill in blank

medium

Complete the code to convert all tokens to lowercase.

Intro to Computing

tokens = ['Hello', 'World']
lower_tokens = [[1] for token in tokens]

Drag options to blanks, or click blank then click option'

Atoken.capitalize()

Btoken.upper()

Ctoken.lower()

Dtoken.title()

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to remove stopwords from the token list.

Intro to Computing

from nltk.corpus import stopwords
stop_words = set(stopwords.words('english'))
tokens = ['this', 'is', 'a', 'test']
filtered = [word for word in tokens if [1] not in stop_words]

Drag options to blanks, or click blank then click option'

Astop_words

Bword

Ctokens

Dfiltered

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary of word counts from a list of tokens.

Intro to Computing

tokens = ['apple', 'banana', 'apple', 'orange', 'banana', 'apple']
word_counts = { [1]: tokens.count([2]) for [1] in set(tokens) }

Drag options to blanks, or click blank then click option'

Aword

Btoken

Ditem

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a list of stemmed tokens using NLTK's PorterStemmer.

Intro to Computing

from nltk.stem import PorterStemmer
ps = PorterStemmer()
tokens = ['running', 'jumps', 'easily']
stemmed = [ps.[1](token) for [2] in tokens if len([3]) > 2]

Drag options to blanks, or click blank then click option'

Astem

Bword

Dtoken

Attempts:

3 left