Challenge - 5 Problems
MapReduce Word Count Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
❓ Predict Output
intermediate2:00remaining
Output of the Mapper function in Word Count
Given the following Mapper code snippet in Hadoop MapReduce for word count, what is the output for the input line:
"hello world hello"?Hadoop
public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(Object key, Text value, Context context) throws IOException, InterruptedException { StringTokenizer itr = new StringTokenizer(value.toString()); while (itr.hasMoreTokens()) { word.set(itr.nextToken()); context.write(word, one); } } }
Attempts:
2 left
💡 Hint
The Mapper outputs each word with a count of 1 for every occurrence.
✗ Incorrect
The Mapper emits each word separately with a count of 1. It does not aggregate counts; that happens in the Reducer.
❓ Predict Output
intermediate2:00remaining
Output of the Reducer function in Word Count
Given the following Reducer code snippet in Hadoop MapReduce for word count, what is the output for the input key
"hello" and values [1, 1, 1]?Hadoop
public static class IntSumReducer extends Reducer<Text, IntWritable, Text, IntWritable> { private IntWritable result = new IntWritable(); public void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException { int sum = 0; for (IntWritable val : values) { sum += val.get(); } result.set(sum); context.write(key, result); } }
Attempts:
2 left
💡 Hint
The Reducer sums all counts for the same word.
✗ Incorrect
The Reducer adds all values for the key 'hello': 1+1+1 = 3.
❓ data_output
advanced2:00remaining
Final output of Word Count MapReduce job
Suppose the input text file contains the lines:
What is the final output of the Word Count MapReduce job?
hello worldhello hadoopWhat is the final output of the Word Count MapReduce job?
Attempts:
2 left
💡 Hint
Count how many times each word appears in all lines combined.
✗ Incorrect
The word 'hello' appears twice, 'world' once, and 'hadoop' once.
🔧 Debug
advanced2:00remaining
Identify the error in this Mapper code
What error will this Mapper code produce when run?
public static class TokenizerMapper extends Mapper
Attempts:
2 left
💡 Hint
Check punctuation carefully in Java code.
✗ Incorrect
The line
word.set(itr.nextToken()) is missing a semicolon at the end, causing a syntax error.🚀 Application
expert3:00remaining
Choosing the correct Combiner function for Word Count
Which of the following Combiner implementations correctly optimizes the Word Count MapReduce job by reducing data transfer between Mapper and Reducer?
Attempts:
2 left
💡 Hint
The Combiner must have the same input and output types as the Reducer.
✗ Incorrect
Option A correctly sums IntWritable values and outputs Text and IntWritable, matching Mapper and Reducer types. Other options have type mismatches or incorrect class inheritance.