Bird
0
0

Given the following configuration snippet:

medium📝 Predict Output Q4 of 15
Hadoop - Performance Tuning
Given the following configuration snippet:
mapreduce.task.io.sort.mb=100
mapreduce.task.io.sort.factor=10

What effect does increasing mapreduce.task.io.sort.mb have on the MapReduce job?
ALimits the number of map tasks running simultaneously
BDecreases the number of reduce tasks
CIncreases the memory allocated to reduce tasks
DIncreases the buffer size for sorting map outputs, reducing spills
Step-by-Step Solution
Solution:
  1. Step 1: Understand map output sorting buffer

    mapreduce.task.io.sort.mb sets the size of the buffer used to sort map outputs before spilling to disk.
  2. Step 2: Effect of increasing buffer size

    Larger buffer reduces the number of spills to disk, improving performance.
  3. Final Answer:

    Increases the buffer size for sorting map outputs, reducing spills -> Option D
  4. Quick Check:

    mapreduce.task.io.sort.mb controls map output buffer size [OK]
Quick Trick: Bigger sort buffer means fewer spills and better map performance [OK]
Common Mistakes:
  • Confusing with reduce task memory
  • Thinking it controls task concurrency
  • Assuming it changes reduce task count

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More Hadoop Quizzes