Bird
0
0

You want to design a data lake architecture using Hadoop that centralizes data from multiple sources including CSV files, images, and logs. Which approach best supports this goal?

hard📝 Application Q15 of 15
Hadoop - Modern Data Architecture with Hadoop
You want to design a data lake architecture using Hadoop that centralizes data from multiple sources including CSV files, images, and logs. Which approach best supports this goal?
AStore all data types in Hadoop HDFS without converting formats, enabling flexible access
BConvert all data to a single format before storing in separate folders in HDFS
CStore only structured data in HDFS and keep unstructured data outside the lake
DUse multiple storage systems for each data type and link them with Hadoop
Step-by-Step Solution
Solution:
  1. Step 1: Understand data lake centralization goal

    Data lakes centralize all data types in one place to allow flexible storage and easy access.
  2. Step 2: Evaluate storage approaches

    Store all data types in Hadoop HDFS without converting formats, enabling flexible access stores all data types in Hadoop HDFS without forcing format conversion, supporting centralization and flexibility. Other options separate data or limit types, which breaks centralization.
  3. Final Answer:

    Store all data types in Hadoop HDFS without converting formats, enabling flexible access -> Option A
  4. Quick Check:

    Centralize all data types in one system = B [OK]
Quick Trick: Keep all data types together in HDFS for true centralization [OK]
Common Mistakes:
  • Converting all data to one format losing flexibility
  • Separating unstructured data outside the lake
  • Using multiple storage systems breaking centralization

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More Hadoop Quizzes