0
0
MLOpsdevops~20 mins

DVC (Data Version Control) basics in MLOps - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
DVC Mastery Badge
Get all challenges correct to earn this badge!
Test your skills under time pressure!
💻 Command Output
intermediate
2:00remaining
What is the output of dvc init command?
You run dvc init in a new project folder. What output message do you expect to see?
MLOps
dvc init
A
Initialized DVC repository.

You can now track data files with 'dvc add'.
BError: dvc command not found
Cfatal: not a git repository (or any of the parent directories): .git
DWarning: DVC is already initialized in this directory
Attempts:
2 left
💡 Hint
Think about what happens when you start DVC in a fresh folder.
🧠 Conceptual
intermediate
2:00remaining
What does dvc add do in a project?
Choose the best description of what dvc add data.csv does.
AIt deletes <code>data.csv</code> from the project and replaces it with a link.
BIt uploads <code>data.csv</code> to a remote cloud storage automatically.
CIt tracks the file <code>data.csv</code> by creating a .dvc file and adding the file to DVC cache.
DIt converts <code>data.csv</code> into a Git commit.
Attempts:
2 left
💡 Hint
Think about how DVC tracks large files without storing them directly in Git.
🔀 Workflow
advanced
3:00remaining
Order the steps to track a new data file with DVC and push it to remote storage
Put these commands in the correct order to track a file dataset.csv and push it to remote storage.
A2,1,3,4
B1,2,3,4
C1,3,2,4
D3,2,1,4
Attempts:
2 left
💡 Hint
Remember to add the .dvc file to Git before committing, then push data with DVC.
Troubleshoot
advanced
2:00remaining
What error occurs if you run dvc push without configuring remote storage?
You run dvc push but forgot to set up remote storage with dvc remote add. What error message do you get?
MLOps
dvc push
AWarning: No data files to push
Bfatal: not a git repository (or any of the parent directories): .git
CSyntaxError: invalid syntax
DERROR: failed to push data to remote storage: no remote storage configured
Attempts:
2 left
💡 Hint
Think about what DVC needs before pushing data.
Best Practice
expert
3:00remaining
Which practice is best for managing large datasets with DVC in a team?
Choose the best practice to ensure smooth collaboration when using DVC to manage large datasets.
AUse <code>dvc add</code> to track data files, commit .dvc files to Git, and push data to a shared remote storage.
BStore data files only locally and do not track them with DVC.
CAvoid using remote storage and share data files manually via email or USB drives.
DCommit large data files directly to Git to keep everything in one place.
Attempts:
2 left
💡 Hint
Think about how DVC separates data tracking and storage for collaboration.