Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to read a JSON file into a DataFrame.

Apache Spark

df = spark.read.[1]("data.json")

Drag options to blanks, or click blank then click option'

Ajson

Bparquet

Ctext

Dcsv

Attempts:

3 left

2fill in blank

medium

Complete the code to select the nested field 'address.city' from the DataFrame.

Apache Spark

df.select("[1]")

Drag options to blanks, or click blank then click option'

Aaddress

Bcity

Caddress.city

Daddress[city]

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to explode the nested array field 'phones'.

Apache Spark

from pyspark.sql.functions import [1]
df.select(explode(df.phones)).show()

Drag options to blanks, or click blank then click option'

Aexplode

Bflatten

Ccollect_list

Darray

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary of word lengths for words longer than 3 characters.

Apache Spark

lengths = {word: [1] for word in words if [2]

Drag options to blanks, or click blank then click option'

Alen(word)

Blen(word) > 3

Cword.startswith('a')

Dword.isalpha()

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a dictionary with uppercase keys and values greater than 0.

Apache Spark

result = [1]: [2] for k, v in data.items() if v [3] 0}

Drag options to blanks, or click blank then click option'

Ak.upper()

Dk.lower()

Attempts:

3 left