Recall & Review
beginner
What is the main goal of backup in Hadoop?
The main goal of backup in Hadoop is to create copies of important data to protect it from loss or corruption, so it can be restored if needed.
Click to reveal answer
beginner
Define disaster recovery in the context of Hadoop.
Disaster recovery in Hadoop means having a plan and tools to restore data and services quickly after a failure or disaster, ensuring minimal downtime.
Click to reveal answer
intermediate
Name two common backup methods used in Hadoop.
Two common backup methods in Hadoop are: 1) DistCp (distributed copy) to copy data between clusters, and 2) Snapshots to capture the state of data at a point in time.
Click to reveal answer
intermediate
How do Hadoop snapshots help in disaster recovery?
Hadoop snapshots save the exact state of the file system at a moment, allowing quick rollback to that state if data is lost or corrupted.
Click to reveal answer
beginner
Why is it important to test your disaster recovery plan regularly?
Testing ensures the plan works correctly, helps find problems early, and trains the team to respond quickly during a real disaster.
Click to reveal answer
What Hadoop tool is commonly used to copy large amounts of data between clusters for backup?
✗ Incorrect
DistCp is designed for distributed copying of large data sets between Hadoop clusters.
What does a Hadoop snapshot capture?
✗ Incorrect
Snapshots capture the exact state of the file system at a specific time.
Which of the following is NOT a goal of disaster recovery?
✗ Incorrect
Increasing data duplication is not a direct goal; the focus is on recovery and minimizing impact.
Why should backup data be stored in a different location?
✗ Incorrect
Storing backups offsite protects data if the main site is damaged.
What is a key benefit of regularly testing your disaster recovery plan?
✗ Incorrect
Testing helps find and fix problems before a real disaster occurs.
Explain how Hadoop snapshots and DistCp work together to support backup and disaster recovery.
Think about capturing data state and copying it safely.
You got /4 concepts.
Describe why having a disaster recovery plan is critical for Hadoop data environments.
Consider what happens if data or services suddenly stop working.
You got /4 concepts.