beginner

What is the first step in the MapReduce job execution flow?

The first step is Job Submission, where the client submits the job to the Hadoop cluster, including the code and input data location.

Click to reveal answer

intermediate

What role does the JobTracker play in MapReduce execution?

The JobTracker manages the job by splitting it into tasks, assigning them to TaskTrackers, and monitoring their progress.

Click to reveal answer

beginner

Explain the function of the Mapper in MapReduce.

The Mapper processes input data splits and produces intermediate key-value pairs for further processing.

Click to reveal answer

intermediate

What happens during the Shuffle and Sort phase?

Intermediate data from Mappers is shuffled and sorted by keys to prepare for the Reducer, ensuring all values for a key are grouped together.

Click to reveal answer

beginner

Describe the final step in the MapReduce job execution flow.

The final step is Job Completion, where the Reducers write output to HDFS, and the JobTracker reports job status back to the client.

Click to reveal answer

What component assigns tasks to nodes in a Hadoop cluster?

ANameNode

BJobTracker

CDataNode

DResourceManager

During which phase are intermediate key-value pairs grouped by key?

AJob submission

BMap phase

CReduce phase

DShuffle and Sort phase

What does the Mapper output in MapReduce?

AIntermediate key-value pairs

BInput splits

CJob status

DFinal results

Where is the final output of a MapReduce job stored?

AHDFS

BLocal disk

CJobTracker memory

DMapper output buffer

Which phase immediately follows the Map phase?

AJob submission

BReduce phase

CShuffle and Sort phase

DJob completion

Describe the complete flow of a MapReduce job from submission to completion.

Explain the purpose of the Shuffle and Sort phase in MapReduce.