Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to read a Parquet file into a Spark DataFrame.

Apache Spark

df = spark.read.[1]("data.parquet")

Drag options to blanks, or click blank then click option'

Aparquet

Bcsv

Cjson

Dtext

Attempts:

3 left

2fill in blank

medium

Complete the code to write a DataFrame in an efficient columnar format.

Apache Spark

df.write.[1]("output_path")

Drag options to blanks, or click blank then click option'

Aparquet

Bcsv

Cjson

Dtext

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to read a JSON file with Spark.

Apache Spark

df = spark.read.[1]("data.json")

Drag options to blanks, or click blank then click option'

Aparquet

Bcsv

Ctext

Djson

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary of word lengths for words longer than 3 characters.

Apache Spark

lengths = {word: [1] for word in words if len(word) [2] 3}

Drag options to blanks, or click blank then click option'

Alen(word)

B<=

Dword

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a dictionary with uppercase keys and values greater than zero.

Apache Spark

result = { [1]: [2] for k, v in data.items() if v [3] 0 }

Drag options to blanks, or click blank then click option'

Ak.upper()

Attempts:

3 left