Practice - 5 Tasks

Answer the questions below

1fill in blank

easy

Complete the code to create input splits for a file in Hadoop.

Hadoop

FileInputFormat.addInputPath(job, new Path([1]));

Drag options to blanks, or click blank then click option'

Ajob

B"inputfile.txt"

C"/user/data/input"

Dnew Path()

Attempts:

3 left

2fill in blank

medium

Complete the code to get the number of input splits from the job context.

Hadoop

List<InputSplit> splits = inputFormat.getSplits([1]);

Drag options to blanks, or click blank then click option'

Ajob

Bcontext

Cconfiguration

DinputFormat

Attempts:

3 left

3fill in blank

hard

Fix the error in the code to ensure data locality is considered when processing splits.

Hadoop

for (InputSplit split : splits) {
    String[] locations = split.getLocations();
    if (locations.length > [1]) {
        System.out.println("Data is local to node: " + locations[0]);
    }
}

Drag options to blanks, or click blank then click option'

C-1

Dlocations.length

Attempts:

3 left

4fill in blank

hard

Fill both blanks to create a map of split locations and their corresponding split sizes.

Hadoop

Map<String, Long> splitSizes = new HashMap<>();
for (InputSplit split : splits) {
    String[] locations = split.getLocations();
    long size = split.[1]();
    for (String loc : locations) {
        splitSizes.put(loc, splitSizes.getOrDefault(loc, [2]) + size);
    }
}

Drag options to blanks, or click blank then click option'

AgetLength

B0L

CgetSize

Dnull

Attempts:

3 left

5fill in blank

hard

Fill all three blanks to filter input splits larger than 128MB and print their first location.

Hadoop

for (InputSplit split : splits) {
    if (split.[1]() > [2]) {
        String[] locs = split.[3]();
        if (locs.length > 0) {
            System.out.println("Large split at: " + locs[0]);
        }
    }
}

Drag options to blanks, or click blank then click option'

AgetLength

B134217728

CgetLocations

DgetSize

Attempts:

3 left