Data versioning is about tracking changes in datasets over time. The key metric here is data consistency and reproducibility. This means ensuring that the exact data used to train a model can be retrieved later to reproduce results. Unlike model accuracy, this metric is about traceability and integrity of data versions, not prediction quality.
Why? Because if data changes without tracking, model results can't be trusted or repeated. DVC helps keep data versions organized and linked to model versions, so you always know which data produced which results.