Challenge - 5 Problems
Data Storage Mastery
Get all challenges correct to earn this badge!
Test your skills under time pressure!
π§ Conceptual
intermediate2:00remaining
Key difference in data storage format
Which option correctly describes the primary difference in data storage format between a data warehouse and a data lake?
Attempts:
2 left
π‘ Hint
Think about how data is organized and processed in each system.
β Incorrect
Data warehouses organize data in structured tables optimized for queries. Data lakes keep raw data as-is, including unstructured formats.
β Architecture
intermediate2:00remaining
Choosing between data warehouse and data lake for analytics
A company wants to perform fast business reporting on cleaned sales data and also explore raw customer interaction logs for insights. Which architecture best supports both needs?
Attempts:
2 left
π‘ Hint
Consider the strengths of each system for different data types and use cases.
β Incorrect
Data lakes handle raw, unstructured data well for exploration. Data warehouses excel at fast queries on cleaned, structured data.
β scaling
advanced2:00remaining
Scaling storage for growing data volumes
A companyβs data volume is growing rapidly with diverse data types. Which scaling approach best suits a data lake compared to a data warehouse?
Attempts:
2 left
π‘ Hint
Think about cost and flexibility of storage expansion in each system.
β Incorrect
Data lakes use low-cost storage that can grow easily. Data warehouses often need costly compute and storage upgrades to scale.
β tradeoff
advanced2:00remaining
Tradeoff between data governance and flexibility
Which statement best describes the tradeoff between data governance and flexibility when choosing between a data warehouse and a data lake?
Attempts:
2 left
π‘ Hint
Consider how each system manages data quality and access controls.
β Incorrect
Data warehouses have structured schemas and controls for governance. Data lakes allow raw data but need extra effort to govern properly.
β estimation
expert2:00remaining
Estimating query performance impact
A data warehouse and a data lake both store 10 TB of data. Which factor most impacts query performance difference between them?
Attempts:
2 left
π‘ Hint
Think about how data organization affects how fast queries run.
β Incorrect
Data warehouses optimize queries with indexes and schemas. Data lakes often require scanning large raw datasets, slowing queries.