Recall & Review
beginner
What is Hadoop?
Hadoop is a free software framework that helps store and process very large data sets across many computers in a simple and efficient way.
Click to reveal answer
beginner
What are the main components of Hadoop?
The main parts are HDFS (Hadoop Distributed File System) for storing data, and MapReduce for processing data in parallel.
Click to reveal answer
beginner
Why do we use Hadoop?
We use Hadoop to handle huge amounts of data that normal computers cannot process easily, by spreading the work across many machines.
Click to reveal answer
beginner
What is HDFS in Hadoop?
HDFS is a system that splits big files into smaller pieces and saves them on many computers to keep data safe and easy to access.
Click to reveal answer
beginner
What does MapReduce do in Hadoop?
MapReduce breaks down data tasks into small parts, processes them on many computers at once, and then combines the results.
Click to reveal answer
What is the main purpose of Hadoop?
✗ Incorrect
Hadoop is designed to handle big data by distributing storage and processing across many machines.
Which component of Hadoop stores data?
✗ Incorrect
HDFS (Hadoop Distributed File System) is responsible for storing data in Hadoop.
What does MapReduce do?
✗ Incorrect
MapReduce processes data by splitting tasks and running them on many computers at the same time.
Why is Hadoop useful for big data?
✗ Incorrect
Hadoop handles big data by distributing storage and processing across multiple machines.
Which of these is NOT a Hadoop component?
✗ Incorrect
SQL Server is a database system, not part of Hadoop.
Explain what Hadoop is and why it is important for handling big data.
Think about how big data is too large for one computer.
You got /4 concepts.
Describe the roles of HDFS and MapReduce in Hadoop.
One part saves data, the other works on data.
You got /4 concepts.