Challenge - 5 Problems
Block Storage and Replication Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate2:00remaining
Understanding HDFS Block Replication Factor
In Hadoop Distributed File System (HDFS), what happens if the replication factor of a file is set to 1?
Attempts:
2 left
💡 Hint
Think about what replication factor means for data copies.
✗ Incorrect
A replication factor of 1 means there is only one copy of each block, stored on a single DataNode. No additional copies exist for fault tolerance.
❓ data_output
intermediate2:00remaining
Calculating Number of Blocks for a File
Given a file size of 450 MB and HDFS block size of 128 MB, how many blocks will HDFS create to store this file?
Attempts:
2 left
💡 Hint
Divide file size by block size and round up.
✗ Incorrect
450 MB divided by 128 MB equals about 3.52, so HDFS creates 4 blocks to store the entire file.
❓ Predict Output
advanced2:00remaining
HDFS Block Replication Command Output
What is the output of the following HDFS command?
hdfs fsck /user/data/file.txt -files -blocks -racks
Hadoop
Sample output snippet: Status: HEALTHY Total size: 2560 B Total blocks: 2 (avg. block size 1280 B) Minimally replicated blocks: 2 Over-replicated blocks: 0 Under-replicated blocks: 0 Block replica on datanode1:50010 Block replica on datanode2:50010
Attempts:
2 left
💡 Hint
Look at the replication status lines carefully.
✗ Incorrect
The output shows 2 blocks with minimal replication and no under or over replication, indicating healthy block storage.
❓ visualization
advanced2:00remaining
Visualizing Block Distribution Across DataNodes
You have a file split into 3 blocks with replication factor 3. Which visualization best represents the block distribution across 3 DataNodes?
Attempts:
2 left
💡 Hint
Replication factor 3 means 3 copies of each block on different nodes.
✗ Incorrect
With replication factor 3, each block has 3 copies on different DataNodes, ensuring fault tolerance and balanced storage.
🔧 Debug
expert2:00remaining
Diagnosing Under-Replication in HDFS
You notice that the HDFS report shows some blocks as under-replicated. Which of the following is the most likely cause?
Attempts:
2 left
💡 Hint
Think about what causes missing replicas in a distributed system.
✗ Incorrect
Under-replication usually happens when some DataNodes holding replicas are offline or unreachable, reducing the number of available copies.