Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to get the number of partitions in the DataFrame.

Apache Spark

num_partitions = df.rdd.[1]

Drag options to blanks, or click blank then click option'

Acount

BgetNumPartitions

Cpartitions

DnumPartitions

Attempts:

3 left

2fill in blank

medium

Complete the code to repartition the DataFrame into 4 partitions.

Apache Spark

df_repart = df.[1](4)

Drag options to blanks, or click blank then click option'

ApartitionBy

Bcoalesce

Crepartition

Dsplit

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to reduce partitions without full shuffle.

Apache Spark

df_less = df.[1](2)

Drag options to blanks, or click blank then click option'

Acoalesce

BpartitionBy

Crepartition

Dsplit

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary with word lengths for words longer than 3 characters.

Apache Spark

lengths = {word: [1] for word in words if len(word) [2] 3}

Drag options to blanks, or click blank then click option'

Alen(word)

Dword

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a dictionary with uppercase words as keys and their lengths as values for words longer than 4.

Apache Spark

result = { [1]: [2] for w in words if len(w) [3] 4 }

Drag options to blanks, or click blank then click option'

Aw.upper()

Blen(w)

Dw.lower()

Attempts:

3 left