Concept Flow - What is Apache Spark
Start: Data Input
Spark Core: Distribute Data
Transformations: Map, Filter, etc.
Actions: Collect, Count, Save
Output: Results or Files
Apache Spark takes data, splits it across computers, applies steps to change or analyze it, then collects results.