MlopsConceptBeginner · 3 min read

What is AUC Score in Machine Learning in Python

The AUC score stands for Area Under the Curve and measures how well a classification model distinguishes between classes. In Python, you can calculate it using roc_auc_score from sklearn.metrics, which gives a value between 0 and 1 indicating model performance.

⚙️

How It Works

The AUC score measures the ability of a model to separate positive and negative classes. Imagine you are sorting apples and oranges. A perfect sorter would always pick apples correctly and never mistake an orange for an apple. The AUC score tells you how good your model is at this sorting task.

It is based on the ROC curve, which plots the true positive rate against the false positive rate at different thresholds. The area under this curve (AUC) shows the overall skill of the model. A score of 1 means perfect sorting, 0.5 means random guessing, and below 0.5 means worse than guessing.

💻

Example

This example shows how to calculate the AUC score for a simple binary classification using sklearn in Python.

python

from sklearn.datasets import make_classification
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import roc_auc_score

# Create a simple dataset
X, y = make_classification(n_samples=1000, n_features=20, random_state=42)

# Split into train and test sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Train a logistic regression model
model = LogisticRegression(max_iter=1000)
model.fit(X_train, y_train)

# Predict probabilities for the positive class
y_probs = model.predict_proba(X_test)[:, 1]

# Calculate AUC score
auc = roc_auc_score(y_test, y_probs)
print(f"AUC score: {auc:.3f}")

Output

AUC score: 0.940

🎯

When to Use

Use the AUC score when you want to evaluate how well your classification model ranks positive cases higher than negative ones, especially when classes are imbalanced. It is useful in medical diagnosis, fraud detection, and any case where distinguishing between two groups is important.

Unlike accuracy, AUC focuses on the model's ability to rank predictions rather than just counting correct labels, making it more reliable when the cost of false positives and false negatives differs.

✅

Key Points

AUC score ranges from 0 to 1, with 1 being perfect and 0.5 meaning random guessing.
It measures the model's ability to rank positive samples higher than negative ones.
Calculated from the ROC curve, which plots true positive rate vs false positive rate.
Useful for imbalanced datasets and when ranking quality matters more than exact labels.

✅

Key Takeaways

AUC score measures how well a model separates positive and negative classes.

Use sklearn's roc_auc_score to calculate AUC in Python.

AUC is especially useful for imbalanced datasets and ranking tasks.

A score close to 1 means excellent model performance.

AUC focuses on ranking quality, not just accuracy.