Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to create a Spark DataFrame from a list.

Apache Spark

data = [(1, 'Alice'), (2, 'Bob')]
spark_df = spark.createDataFrame([1], ['id', 'name'])

Drag options to blanks, or click blank then click option'

Adata

Bsc

Clist

Drdd

Attempts:

3 left

2fill in blank

medium

Complete the code to apply a filter transformation on the DataFrame.

Apache Spark

filtered_df = spark_df.filter(spark_df['id'] [1] 1)

Drag options to blanks, or click blank then click option'

A<=

B==

C!=

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to trigger the lazy evaluation and show the results.

Apache Spark

result = filtered_df.[1]()

Drag options to blanks, or click blank then click option'

Ashow

Bfilter

Cselect

Dmap

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a new DataFrame with selected columns and trigger computation.

Apache Spark

selected_df = spark_df.[1]('name')
selected_df.[2]()

Drag options to blanks, or click blank then click option'

Aselect

Bshow

Cfilter

Dcount

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a filtered DataFrame, select a column, and count the rows.

Apache Spark

filtered = spark_df.filter(spark_df['id'] [1] 1)
selected = filtered.[2]('name')
row_count = selected.[3]()

Drag options to blanks, or click blank then click option'

Bselect

Ccount

Dfilter

Attempts:

3 left