Recall & Review
beginner
What is a data lake in simple terms?
A data lake is a big storage place where all kinds of data are kept together in their original form, like a big digital lake holding water from many streams.
Click to reveal answer
beginner
Why does a data lake centralize data?
Because it collects all data from different sources into one place, making it easier to find, use, and analyze without moving data around.
Click to reveal answer
beginner
How does centralizing data in a data lake help businesses?
It helps businesses by giving them one place to access all their data quickly, which saves time and helps make better decisions.
Click to reveal answer
intermediate
What role does Hadoop play in a data lake architecture?
Hadoop provides the technology to store and manage huge amounts of data in a data lake, allowing easy access and processing.
Click to reveal answer
beginner
What types of data can be stored in a data lake?
All types: structured (like tables), semi-structured (like logs), and unstructured (like videos or emails) data can be stored together.
Click to reveal answer
What is the main reason data lakes centralize data?
✗ Incorrect
Data lakes centralize data to keep it all in one place, making it easier to access and analyze.
Which technology is commonly used to build data lakes?
✗ Incorrect
Hadoop is a popular technology for storing and managing large data lakes.
What types of data can a data lake store?
✗ Incorrect
Data lakes can store all types of data, including structured, semi-structured, and unstructured.
How does centralizing data in a data lake benefit data analysis?
✗ Incorrect
Centralizing data makes it easier and faster to analyze because all data is in one place.
Which of these is NOT a benefit of data lake centralization?
✗ Incorrect
Data duplication is reduced, not increased, by centralizing data in a data lake.
Explain in your own words why data lake architecture centralizes data.
Think about how having all your files in one folder helps you find them faster.
You got /4 concepts.
Describe how Hadoop supports the centralization of data in a data lake.
Imagine Hadoop as the big warehouse that keeps all your data safe and ready to use.
You got /4 concepts.