What is MLOps - Complexity Analysis
We want to understand how the time needed to run MLOps tasks changes as the amount of data or models grows.
How does the work increase when we add more machine learning models or data?
Analyze the time complexity of the following code snippet.
for model in models:
preprocess(data)
train(model, data)
evaluate(model, data)
This code runs preprocessing, training, and evaluation for each machine learning model on the same data.
Identify the loops, recursion, array traversals that repeat.
- Primary operation: Loop over each model to train and evaluate.
- How many times: Once per model in the list.
As the number of models grows, the total work grows roughly the same amount.
| Input Size (n) | Approx. Operations |
|---|---|
| 10 models | 10 times the work |
| 100 models | 100 times the work |
| 1000 models | 1000 times the work |
Pattern observation: Doubling the number of models doubles the work needed.
Time Complexity: O(n)
This means the time grows directly in proportion to the number of models you process.
[X] Wrong: "Adding more models won't affect the time much because data stays the same."
[OK] Correct: Each model requires its own training and evaluation, so more models mean more total work.
Understanding how work grows with more models helps you explain and plan machine learning pipelines clearly and confidently.
"What if we preprocess the data only once before the loop? How would the time complexity change?"