Concept Flow - What is Hadoop
Start
Input Data
Split Data into Blocks
Store Blocks on Cluster Nodes
Process Data Blocks in Parallel
Combine Results
Output Final Result
End
Hadoop takes big data, splits it, stores it across many computers, processes pieces at the same time, then combines results.