What is Early stopping implementation in PyTorch?

PyTorchml~5 mins

Early stopping implementation in PyTorch

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Introduction

Early stopping helps stop training a model when it stops improving, saving time and avoiding overfitting.

When training a model and you want to avoid wasting time after it stops getting better.

When you want to prevent the model from learning noise in the training data.

When you have limited computing resources and want to save energy.

When you want to get the best model based on validation performance.

When training deep learning models that can easily overfit.

Syntax

PyTorch

class EarlyStopping:
    def __init__(self, patience=5, min_delta=0):
        self.patience = patience
        self.min_delta = min_delta
        self.counter = 0
        self.best_loss = None
        self.early_stop = False

    def __call__(self, val_loss):
        if self.best_loss is None:
            self.best_loss = val_loss
        elif val_loss > self.best_loss - self.min_delta:
            self.counter += 1
            if self.counter >= self.patience:
                self.early_stop = True
        else:
            self.best_loss = val_loss
            self.counter = 0

patience: number of times validation loss can fail to improve before stopping.

min_delta: minimum change to qualify as an improvement.

Examples

Create early stopping that waits 3 bad epochs and requires at least 0.01 improvement.

PyTorch

early_stopping = EarlyStopping(patience=3, min_delta=0.01)

Call early stopping with current validation loss to check if training should stop.

PyTorch

early_stopping(val_loss)
if early_stopping.early_stop:
    print('Stop training')

Sample Model

This code trains a simple linear model to fit y=2x with noise. It uses early stopping to stop training if validation loss does not improve for 5 epochs by at least 0.001.

PyTorch

import torch
import torch.nn as nn
import torch.optim as optim

# Simple model
class SimpleNet(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(1, 1)
    def forward(self, x):
        return self.linear(x)

# EarlyStopping class
class EarlyStopping:
    def __init__(self, patience=3, min_delta=0):
        self.patience = patience
        self.min_delta = min_delta
        self.counter = 0
        self.best_loss = None
        self.early_stop = False

    def __call__(self, val_loss):
        if self.best_loss is None:
            self.best_loss = val_loss
        elif val_loss > self.best_loss - self.min_delta:
            self.counter += 1
            if self.counter >= self.patience:
                self.early_stop = True
        else:
            self.best_loss = val_loss
            self.counter = 0

# Data: y = 2x + noise
x_train = torch.unsqueeze(torch.linspace(-1, 1, 100), dim=1)
y_train = 2 * x_train + 0.1 * torch.randn(x_train.size())

x_val = torch.unsqueeze(torch.linspace(-1, 1, 20), dim=1)
y_val = 2 * x_val + 0.1 * torch.randn(x_val.size())

model = SimpleNet()
criterion = nn.MSELoss()
optimizer = optim.SGD(model.parameters(), lr=0.1)
early_stopping = EarlyStopping(patience=5, min_delta=0.001)

for epoch in range(100):
    model.train()
    optimizer.zero_grad()
    outputs = model(x_train)
    loss = criterion(outputs, y_train)
    loss.backward()
    optimizer.step()

    model.eval()
    with torch.no_grad():
        val_outputs = model(x_val)
        val_loss = criterion(val_outputs, y_val)

    print(f'Epoch {epoch+1}, Training Loss: {loss.item():.4f}, Validation Loss: {val_loss.item():.4f}')

    early_stopping(val_loss.item())
    if early_stopping.early_stop:
        print(f'Early stopping at epoch {epoch+1}')
        break

OutputSuccess

Important Notes

Early stopping monitors validation loss to decide when to stop training.

Choosing patience too low may stop training too early; too high may waste time.

min_delta helps ignore tiny changes that are not meaningful improvements.

Summary

Early stopping stops training when validation loss stops improving.

It helps save time and avoid overfitting.

You set patience and min_delta to control how sensitive it is.

Practice

(1/5)

1. What is the main purpose of early stopping in PyTorch training?

easy

A. To increase the training batch size automatically

B. To stop training when validation loss stops improving

C. To save the model weights after every epoch

D. To shuffle the training data before each epoch

Early stopping implementation in PyTorch

Start learning this pattern below

Practice

Solution

Step 1: Understand early stopping concept

Step 2: Identify the correct purpose

Final Answer:

Quick Check:

Solution

Step 1: Check parameter names and values

Step 2: Match correct argument order and names

Final Answer:

Quick Check:

Solution

Step 1: Track validation loss improvements

Step 2: Apply patience logic

Step 3: Check code behavior

Final Answer:

Quick Check:

Solution

Step 1: Analyze loop order

Step 2: Correct order for early stopping check

Final Answer:

Quick Check:

Solution

Step 1: Understand patience and min_delta roles

Step 2: Match requirement to parameters

Final Answer:

Quick Check: