Challenge - 5 Problems
Torchvision Pretrained Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
❓ Predict Output
intermediate2:00remaining
Output shape of pre-trained ResNet18 model
What is the shape of the output tensor when passing a batch of 8 RGB images of size 224x224 through a pre-trained ResNet18 model from torchvision?
PyTorch
import torch import torchvision.models as models model = models.resnet18(pretrained=True) model.eval() input_tensor = torch.randn(8, 3, 224, 224) output = model(input_tensor) output_shape = output.shape print(output_shape)
Attempts:
2 left
💡 Hint
The pre-trained ResNet18 model outputs class scores for 1000 classes.
✗ Incorrect
The pre-trained ResNet18 model outputs a tensor of shape (batch_size, 1000) representing the scores for 1000 ImageNet classes.
❓ Model Choice
intermediate2:00remaining
Choosing a model for feature extraction
You want to extract 512-dimensional feature vectors from images using a pre-trained torchvision model without the final classification layer. Which model and modification is correct?
Attempts:
2 left
💡 Hint
The final fully connected layer in ResNet18 is model.fc.
✗ Incorrect
In ResNet18, the final classification layer is model.fc. Replacing it with Identity returns the 512-dimensional features before classification.
❓ Hyperparameter
advanced2:00remaining
Correct input normalization for pre-trained models
Which normalization mean and standard deviation values should be used to preprocess images before feeding them into any torchvision pre-trained model?
Attempts:
2 left
💡 Hint
These values are based on ImageNet dataset statistics.
✗ Incorrect
All torchvision pre-trained models expect input images normalized with mean=[0.485, 0.456, 0.406] and std=[0.229, 0.224, 0.225].
❓ Metrics
advanced2:00remaining
Top-1 accuracy of pre-trained models on ImageNet validation
Which of the following is the closest reported Top-1 accuracy on ImageNet validation for the pre-trained torchvision DenseNet121 model?
Attempts:
2 left
💡 Hint
DenseNet121 accuracy is generally around mid-70s percent.
✗ Incorrect
The DenseNet121 pre-trained model achieves about 74.9% Top-1 accuracy on ImageNet validation.
🔧 Debug
expert2:00remaining
Identifying error when loading pre-trained model weights
What error will occur if you try to load a pre-trained model with torchvision.models.resnet50(pretrained=False) and then call model.load_state_dict(torch.load('resnet50_weights.pth')) where the weights file contains pre-trained weights?
Attempts:
2 left
💡 Hint
Mismatch in model architecture or missing keys causes loading errors.
✗ Incorrect
If the weights file does not exactly match the model architecture or keys, load_state_dict raises a RuntimeError with details about missing or unexpected keys.