When optimizing costs in machine learning, the key metric is cost per prediction or total operational cost. This includes compute time, memory use, and energy consumption. We want to reduce these costs while keeping model accuracy acceptable. Metrics like inference latency and model size also matter because smaller, faster models cost less to run.
Cost optimization strategies in Agentic Ai - Model Metrics & Evaluation
Cost optimization is not about classification accuracy but about balancing cost and performance. However, a confusion matrix can help understand if cost savings hurt accuracy.
Confusion Matrix Example:
-------------------------
| | Pred Pos | Pred Neg |
|---------|----------|----------|
| True Pos| 80 | 20 |
| True Neg| 10 | 90 |
-------------------------
Total samples = 200
Accuracy = (80 + 90) / 200 = 85%
If cost optimization reduces model size but accuracy drops from 85% to 70%, the tradeoff may be too high.
Cost optimization often means using simpler models that may reduce precision or recall. For example:
- High precision but low recall: The model is careful and only predicts positive when very sure, reducing false alarms but missing some real positives.
- High recall but low precision: The model catches most positives but also has many false alarms, increasing cost in manual checks.
Choosing a balance depends on cost impact. For fraud detection, missing fraud (low recall) is costly, so prioritize recall even if cost rises.
Good: A model that reduces cost per prediction by 30% while keeping accuracy above 80% and inference time low.
Bad: A model that cuts cost by 50% but accuracy falls below 60%, causing many wrong decisions and extra manual work.
- Accuracy paradox: Lower cost models may seem good if only accuracy is checked, ignoring increased errors.
- Data leakage: Optimizing cost on leaked data can give false confidence.
- Overfitting indicators: Very low cost but perfect training accuracy may mean the model won't generalize.
- Ignoring latency: A cheap model that is slow can increase overall cost.
Your model has 98% accuracy but only 12% recall on fraud cases. Is it good for production? Why or why not?
Answer: No, it is not good. Even though accuracy is high, the model misses 88% of fraud cases (low recall), which is very costly. For fraud detection, high recall is critical to catch most frauds.
