What is Feature extraction strategy in PyTorch?

PyTorchml~5 mins

Feature extraction strategy in PyTorch

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Introduction

Feature extraction helps us use important parts of data to teach a model faster and better. It saves time and improves results by focusing on useful information.

When you want to use a pre-trained model to get useful data features without training from scratch.

When you have limited data but want to use knowledge from a bigger dataset.

When you want to speed up training by freezing some parts of the model.

When you want to improve model accuracy by using strong features from another model.

When you want to reduce the size of input data by extracting key features.

Syntax

PyTorch

import torch
import torchvision.models as models

# Load a pre-trained model
model = models.resnet18(pretrained=True)

# Freeze all layers to prevent training
for param in model.parameters():
    param.requires_grad = False

# Replace the final layer to match your task
model.fc = torch.nn.Linear(model.fc.in_features, num_classes)

# Now only the final layer will be trained

Freezing layers means their weights won't change during training.

Replacing the final layer adapts the model to your specific problem.

Examples

Freeze all layers and change the last layer for 10 classes.

PyTorch

model = models.resnet18(pretrained=True)
for param in model.parameters():
    param.requires_grad = False
model.fc = torch.nn.Linear(model.fc.in_features, 10)

Freeze only feature layers in VGG16 and change classifier for 5 classes.

PyTorch

model = models.vgg16(pretrained=True)
for param in model.features.parameters():
    param.requires_grad = False
model.classifier[6] = torch.nn.Linear(4096, 5)

Sample Model

This code loads a pre-trained ResNet18, freezes all layers, replaces the last layer for 3 classes, and runs dummy data through it. It prints the output shape and how many parameters will be trained (should be 1 layer).

PyTorch

import torch
import torchvision.models as models
import torch.nn as nn

# Number of classes for new task
num_classes = 3

# Load pre-trained ResNet18
model = models.resnet18(pretrained=True)

# Freeze all layers
for param in model.parameters():
    param.requires_grad = False

# Replace final fully connected layer
model.fc = nn.Linear(model.fc.in_features, num_classes)

# Create dummy input (batch size 2, 3 color channels, 224x224 image)
dummy_input = torch.randn(2, 3, 224, 224)

# Get output predictions
output = model(dummy_input)

# Print output shape and requires_grad status of parameters
print(f"Output shape: {output.shape}")
trainable_params = [p for p in model.parameters() if p.requires_grad]
print(f"Number of trainable parameters: {len(trainable_params)}")

OutputSuccess

Important Notes

Freezing layers helps keep learned features and reduces training time.

Only the replaced final layer's parameters require gradients and will update during training.

Use dummy inputs with correct shape to test model output before training.

Summary

Feature extraction uses pre-trained models to get useful data features.

Freeze layers to keep their knowledge and train only new parts.

Replace the final layer to fit your specific task.

Practice

(1/5)

1. What is the main purpose of using a pre-trained model for feature extraction in PyTorch?

easy

A. To replace the optimizer with a new one

B. To use learned features from a large dataset and avoid training from scratch

C. To train all layers from random weights

D. To increase the size of the dataset automatically

Feature extraction strategy in PyTorch

Start learning this pattern below

Practice

Solution

Step 1: Understand feature extraction concept

Step 2: Identify the main benefit

Final Answer:

Quick Check:

Solution

Step 1: Freeze all layers by setting requires_grad to false

Step 2: Replace the final layer with a new one to train

Final Answer:

Quick Check:

Solution

Step 1: Understand model modification

Step 2: Know ResNet18 feature size

Final Answer:

Quick Check:

Solution

Step 1: Check freezing timing

Step 2: Verify optimizer behavior

Final Answer:

Quick Check:

Solution

Step 1: Understand freezing impact

Step 2: Fine-tune some deeper layers

Final Answer:

Quick Check: