Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to increase the number of partitions of the DataFrame.

Apache Spark

df_repartitioned = df.[1](10)

Drag options to blanks, or click blank then click option'

Arepartition

Bcoalesce

CpartitionBy

DgroupBy

Attempts:

3 left

2fill in blank

medium

Complete the code to reduce the number of partitions without a full shuffle.

Apache Spark

df_coalesced = df.[1](5)

Drag options to blanks, or click blank then click option'

Arepartition

Bcoalesce

CgroupBy

DpartitionBy

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to repartition the DataFrame by a column.

Apache Spark

df_partitioned = df.[1]('category')

Drag options to blanks, or click blank then click option'

Arepartition

Bcoalesce

CpartitionBy

DgroupBy

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a DataFrame with fewer partitions using the efficient method.

Apache Spark

df_small = df.[1]([2])

Drag options to blanks, or click blank then click option'

Acoalesce

Brepartition

C10

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a dictionary mapping words to their lengths for words longer than 3 letters.

Apache Spark

word_counts = {word: [1] for word in words if len(word) [2] 3 and word [3] stop_words}

Drag options to blanks, or click blank then click option'

Alen(word)

Cnot in

Din

Attempts:

3 left