Bird
0
0

You need to plan a Hadoop cluster to process 100 TB of data with a replication factor of 3. If each node has 10 TB of usable disk space, how many nodes are required at minimum?

hard📝 Application Q8 of 15
Hadoop - Cluster Administration
You need to plan a Hadoop cluster to process 100 TB of data with a replication factor of 3. If each node has 10 TB of usable disk space, how many nodes are required at minimum?
A30 nodes
B300 nodes
C10 nodes
D33 nodes
Step-by-Step Solution
Solution:
  1. Step 1: Calculate total storage needed with replication

    Data size 100 TB * replication factor 3 = 300 TB total storage needed.
  2. Step 2: Calculate nodes needed

    Each node has 10 TB usable space, so nodes = 300 TB / 10 TB = 30 nodes minimum.
  3. Step 3: Consider rounding

    Since 300/10 = 30 exactly, but to ensure capacity, add 10% buffer: 30 * 1.1 = 33 nodes.
  4. Final Answer:

    33 nodes -> Option D
  5. Quick Check:

    Nodes = (Data * Replication) / Disk + buffer [OK]
Quick Trick: Multiply data by replication, divide by disk, add buffer [OK]
Common Mistakes:
  • Ignoring replication factor
  • Not adding buffer nodes
  • Dividing data size by disk without replication

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More Hadoop Quizzes