Concept Flow - Batch vs real-time ingestion
Data arrives
Collect data
Store in HDFS
Run jobs later
Generate reports
Data available for analysis
Data can be collected in large chunks (batch) or processed immediately as it arrives (real-time). Both feed data for analysis but differ in timing and tools.