Computer Visionml~20 mins

Learning rate selection in Computer Vision - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Learning rate selection

Problem:Train a simple convolutional neural network (CNN) on the CIFAR-10 dataset to classify images into 10 categories.

Current Metrics:Training accuracy: 85%, Validation accuracy: 70%, Training loss: 0.45, Validation loss: 1.2

Issue:The model is not learning well due to a poorly chosen learning rate. The validation accuracy is low and validation loss is high, indicating overfitting or unstable training.

Your Task

Find a better learning rate that improves validation accuracy to at least 80% and reduces validation loss below 0.8.

Do not change the model architecture.

Do not change the dataset or preprocessing.

Only adjust the learning rate and training epochs if needed.

Hint 1

Hint 2

Hint 3

Solution

Computer Vision

import tensorflow as tf
from tensorflow.keras import layers, models
from tensorflow.keras.datasets import cifar10
from tensorflow.keras.utils import to_categorical

# Load and preprocess data
(X_train, y_train), (X_test, y_test) = cifar10.load_data()
X_train, X_test = X_train / 255.0, X_test / 255.0

y_train_cat = to_categorical(y_train, 10)
y_test_cat = to_categorical(y_test, 10)

# Define CNN model
model = models.Sequential([
    layers.Conv2D(32, (3,3), activation='relu', input_shape=(32,32,3)),
    layers.MaxPooling2D((2,2)),
    layers.Conv2D(64, (3,3), activation='relu'),
    layers.MaxPooling2D((2,2)),
    layers.Conv2D(64, (3,3), activation='relu'),
    layers.Flatten(),
    layers.Dense(64, activation='relu'),
    layers.Dense(10, activation='softmax')
])

# Compile with improved learning rate
model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=0.001),
              loss='categorical_crossentropy',
              metrics=['accuracy'])

# Train model
history = model.fit(X_train, y_train_cat, epochs=20, batch_size=64, validation_split=0.2, verbose=2)

# Evaluate on test data
test_loss, test_acc = model.evaluate(X_test, y_test_cat, verbose=0)

print(f'Test accuracy: {test_acc:.2f}, Test loss: {test_loss:.2f}')

Changed the learning rate from a higher or lower default (e.g., 0.01 or 0.0001) to 0.001 for better convergence.

Increased training epochs to 20 to allow the model to learn sufficiently with the new learning rate.

Results Interpretation

Before: Training accuracy 85%, Validation accuracy 70%, Validation loss 1.2

After: Training accuracy 90%, Validation accuracy 82%, Validation loss 0.75

Choosing the right learning rate is crucial. Too high a learning rate can cause unstable training and poor accuracy. Too low slows learning. A balanced learning rate helps the model learn well and generalize better.

Bonus Experiment

Try using a learning rate scheduler that reduces the learning rate during training to improve accuracy further.

💡 Hint

Use TensorFlow's ReduceLROnPlateau callback to lower the learning rate when validation loss stops improving.

Practice

(1/5)

What does the learning rate control in training a computer vision model?

easy

A. The number of layers in the model

B. The size of the input images

C. How fast the model updates its knowledge

D. The type of activation function used

Which of the following is the correct way to set a learning rate of 0.01 using PyTorch's SGD optimizer?

import torch.optim as optim
optimizer = optim.SGD(model.parameters(), lr=___)

easy

A. 0.01

B. 0.1

C. "0.01"

D. learning_rate

Consider this training loop snippet for a vision model:

learning_rate = 0.5
for epoch in range(3):
    loss = train_one_epoch(model, data, learning_rate)
    print(f"Epoch {epoch+1} loss: {loss:.2f}")

If the learning rate is too high, what is the most likely output behavior?

medium

A. Loss becomes zero immediately

B. Loss stays constant

C. Loss steadily decreases each epoch

D. Loss fluctuates or increases wildly

Given this code snippet, identify the error related to learning rate usage:

optimizer = torch.optim.Adam(model.parameters(), lr=0.001)
for epoch in range(5):
    loss = train(model, data)
    optimizer.step()
    optimizer.zero_grad()

medium

A. optimizer.step() called before loss.backward()

B. Learning rate is too high for Adam optimizer

C. optimizer.zero_grad() should be called after optimizer.step()

D. Learning rate should be set inside the loop

Learning rate selection in Computer Vision - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand the role of learning rate

Step 2: Connect learning rate to model updates

Final Answer:

Quick Check:

Solution

Step 1: Check the expected type for learning rate

Step 2: Identify the correct float value for 0.01

Final Answer:

Quick Check:

Solution

Step 1: Understand effect of high learning rate

Step 2: Predict loss behavior with unstable training

Final Answer:

Quick Check:

Solution

Step 1: Check optimizer usage order

Step 2: Identify missing backward call

Final Answer:

Quick Check:

Solution

Step 1: Analyze why loss does not decrease

Step 2: Choose a safer learning rate adjustment

Final Answer:

Quick Check: