Recall & Review
beginner
What is a data warehouse?
A data warehouse is a system used to store structured data from multiple sources. It organizes data for easy analysis and reporting, often using tables and schemas.
Click to reveal answer
beginner
What is a data lake?
A data lake is a storage system that holds large amounts of raw data in its original format, including structured, semi-structured, and unstructured data.
Click to reveal answer
intermediate
Which type of data storage is best for fast, complex queries on cleaned data?
Data warehouses are best for fast, complex queries on cleaned and organized data because they use schemas and indexing.
Click to reveal answer
intermediate
Why might a company choose a data lake over a data warehouse?
A company might choose a data lake to store all types of data in raw form, allowing flexibility for future analysis and machine learning without upfront structuring.
Click to reveal answer
intermediate
What is a key difference in data processing between data warehouses and data lakes?
Data warehouses process data before storing it (ETL), while data lakes store raw data first and process it later (ELT).
Click to reveal answer
Which system stores data in its raw, original format?
✗ Incorrect
Data lakes store raw data in its original format, unlike data warehouses which store processed data.
What type of data is primarily stored in a data warehouse?
✗ Incorrect
Data warehouses store structured data that is cleaned and organized for analysis.
Which process is commonly used before loading data into a data warehouse?
✗ Incorrect
ETL means data is extracted, transformed (cleaned and structured), then loaded into the warehouse.
Which system is more flexible for storing different data types?
✗ Incorrect
Data lakes can store structured, semi-structured, and unstructured data, making them more flexible.
Which system is optimized for fast analytical queries on cleaned data?
✗ Incorrect
Data warehouses are designed for fast, complex queries on cleaned and organized data.
Explain the main differences between a data warehouse and a data lake.
Think about data format, processing, and query speed.
You got /6 concepts.
Describe scenarios where a company might prefer a data lake over a data warehouse.
Consider flexibility and data variety.
You got /5 concepts.