Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to read a batch file from HDFS.

Hadoop

hadoop fs -cat /data/batch/[1]

Drag options to blanks, or click blank then click option'

Ainput.txt

Bstreaming

Crealtime

Dlive

Attempts:

3 left

2fill in blank

medium

Complete the code to start a real-time data stream using Flume.

Hadoop

flume-ng agent --conf conf --conf-file [1] --name agent1 -Dflume.root.logger=INFO,console

Drag options to blanks, or click blank then click option'

Adata.csv

Bbatch.conf

Cinput.txt

Dstream.conf

Attempts:

3 left

3fill in blank

hard

Fix the error in the command to run a batch MapReduce job.

Hadoop

hadoop jar wordcount.jar WordCount [1] /output

Drag options to blanks, or click blank then click option'

A/output

B/input/batch

C/input/streaming

D/tmp

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a dictionary comprehension that filters batch files larger than 100MB.

Hadoop

large_files = {file: size for file, size in files.items() if size [1] [2]

Drag options to blanks, or click blank then click option'

B100

D50

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to create a dictionary of real-time streams with their status if active.

Hadoop

active_streams = {stream[1]: status for stream, status in streams.items() if status [2] 'active' and stream.startswith([3])}

Drag options to blanks, or click blank then click option'

A.upper()

B==

C'rt_'

D!=

Attempts:

3 left