0
0
Hadoopdata~20 mins

HDFS high availability in Hadoop - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
HDFS High Availability Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
Understanding the Role of NameNodes in HDFS High Availability

In an HDFS high availability setup, what is the primary role of the standby NameNode?

AIt keeps a synchronized copy of the namespace and takes over if the active NameNode fails.
BIt remains passive and does not synchronize with the active NameNode.
CIt actively manages the file system namespace and serves client requests.
DIt stores data blocks and handles data replication.
Attempts:
2 left
💡 Hint

Think about what happens when the active NameNode stops working.

🧠 Conceptual
intermediate
2:00remaining
Quorum Journal Manager in HDFS HA

What is the purpose of the Quorum Journal Manager (QJM) in HDFS high availability?

AIt balances the load between multiple NameNodes.
BIt stores data blocks across DataNodes for fault tolerance.
CIt manages the edit logs by replicating them across multiple JournalNodes to ensure consistency between active and standby NameNodes.
DIt handles client authentication and authorization.
Attempts:
2 left
💡 Hint

Consider how edit logs are shared between NameNodes.

data_output
advanced
2:30remaining
Analyzing Failover Logs in HDFS HA

Given the following simplified log snippet from an HDFS cluster during failover, what is the final state of the NameNodes?

2024-04-01 10:00:00 Active NameNode started
2024-04-01 10:05:00 Standby NameNode synchronized
2024-04-01 10:10:00 Active NameNode failed
2024-04-01 10:10:05 Standby NameNode transitioned to active
2024-04-01 10:15:00 New standby NameNode started
AThe original active NameNode is still active; standby is inactive.
BThe standby NameNode became active after failover; a new standby started.
CBoth NameNodes are active simultaneously, causing split-brain.
DNo NameNode is active after the failover.
Attempts:
2 left
💡 Hint

Look for the transition events after the active NameNode failure.

🔧 Debug
advanced
2:30remaining
Identifying the Cause of Split-Brain in HDFS HA

In an HDFS high availability setup, a split-brain scenario occurred where both NameNodes became active simultaneously. Which misconfiguration below most likely caused this?

AThe standby NameNode is not running.
BThe client is using an outdated Hadoop version.
CDataNodes are not reporting block status to NameNodes.
DJournalNodes are not configured or unreachable, so edit logs are not shared.
Attempts:
2 left
💡 Hint

Think about what ensures only one NameNode is active at a time.

🚀 Application
expert
3:00remaining
Designing a Resilient HDFS HA Architecture

You are tasked with designing an HDFS high availability cluster for a critical application. Which combination of components and configurations below ensures maximum resilience and zero downtime during failover?

ATwo NameNodes (active and standby), three JournalNodes for quorum, and automatic failover enabled with ZooKeeper.
BOne NameNode with frequent checkpoints and manual failover procedures.
CTwo NameNodes both active simultaneously with no JournalNodes, relying on DataNode heartbeats.
DMultiple DataNodes with replication factor 1 and a single NameNode.
Attempts:
2 left
💡 Hint

Consider components that coordinate failover automatically and maintain consistent state.