Bird
0
0

A data scientist reports that their experimental datasets stored in the sandbox zone of a Hadoop data lake are missing after a scheduled maintenance. What is the most probable cause?

medium📝 Debug Q7 of 15
Hadoop - Modern Data Architecture with Hadoop
A data scientist reports that their experimental datasets stored in the sandbox zone of a Hadoop data lake are missing after a scheduled maintenance. What is the most probable cause?
ASandbox data was deleted as part of routine cleanup since it is temporary
BCurated zone policies automatically moved sandbox data to processed zone
CRaw data ingestion failed causing sandbox data loss
DData was overwritten by automated backup restoration
Step-by-Step Solution
Solution:
  1. Step 1: Understand sandbox purpose

    Sandbox is for temporary, experimental data and often cleaned up regularly.
  2. Step 2: Analyze maintenance impact

    Routine maintenance may include deleting sandbox data to free space.
  3. Step 3: Exclude other options

    Curated policies do not move sandbox data; ingestion failure affects raw zone; backup restoration does not overwrite sandbox.
  4. Final Answer:

    Sandbox data was deleted as part of routine cleanup since it is temporary -> Option A
  5. Quick Check:

    Sandbox = temporary, often cleaned [OK]
Quick Trick: Sandbox data is temporary and often deleted [OK]
Common Mistakes:
  • Assuming sandbox data is permanent
  • Confusing sandbox with curated or raw zones
  • Blaming ingestion or backup for sandbox loss

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More Hadoop Quizzes