Process Flow - Data pipelines with DVC
Define stages in dvc.yaml
Run dvc repro to execute pipeline
DVC checks dependencies and outputs
Execute commands for each stage
Save outputs and update dvc.lock
Track data and pipeline with git and dvc
Repeat: modify data/code -> dvc repro -> track changes
This flow shows how DVC runs a data pipeline by defining stages, executing them, tracking outputs, and updating pipeline state.