Concept Flow - What is an RDD (Resilient Distributed Dataset)
Start: Data in Cluster
Create RDD from Data
RDD is Distributed
RDD is Immutable
Apply Transformations
Apply Actions
Results Computed and Returned
This flow shows how data is loaded into an RDD, which is distributed and immutable, then transformations and actions are applied to get results.