0
0
Computer Visionml~20 mins

EfficientNet scaling in Computer Vision - Practice Problems & Coding Challenges

Choose your learning style9 modes available
Challenge - 5 Problems
🎖️
EfficientNet Scaling Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate
2:00remaining
What does compound scaling in EfficientNet do?

EfficientNet uses a compound scaling method to scale up the model. What does this method do?

AIt randomly changes the network architecture during training to find the best model.
BIt only increases the depth of the network while keeping width and resolution constant.
CIt scales the number of output classes to improve accuracy.
DIt scales depth, width, and resolution of the network uniformly using fixed coefficients.
Attempts:
2 left
💡 Hint

Think about how EfficientNet balances different model dimensions together.

Model Choice
intermediate
2:00remaining
Choosing EfficientNet variant for limited GPU memory

You want to train an EfficientNet model on a GPU with limited memory. Which variant should you choose to balance accuracy and memory use?

AEfficientNet-B0
BEfficientNet-B7
CEfficientNet-B5
DEfficientNet-B3
Attempts:
2 left
💡 Hint

Smaller variants use less memory but have lower accuracy.

Predict Output
advanced
2:00remaining
Output shape after EfficientNet scaling

Given the following code snippet using PyTorch, what is the output shape of the tensor after scaling?

Computer Vision
import torch
from torchvision.models import efficientnet_b0

model = efficientnet_b0()
input_tensor = torch.randn(1, 3, 224, 224)
output = model.features(input_tensor)
print(output.shape)
Atorch.Size([1, 1280, 14, 14])
Btorch.Size([1, 1280, 7, 7])
Ctorch.Size([1, 320, 7, 7])
Dtorch.Size([1, 1000])
Attempts:
2 left
💡 Hint

Look at the output channels (1280) and spatial size after the feature extractor in EfficientNet-B0.

Hyperparameter
advanced
2:00remaining
Effect of increasing input resolution in EfficientNet scaling

What is the main effect of increasing the input image resolution in EfficientNet's compound scaling?

AIt increases the spatial size of feature maps, improving fine detail capture but increasing computation.
BIt reduces the number of layers in the network to speed up training.
CIt decreases the number of channels in each layer to reduce overfitting.
DIt changes the activation functions to improve non-linearity.
Attempts:
2 left
💡 Hint

Think about what happens when you feed larger images into a convolutional network.

Metrics
expert
2:00remaining
Comparing accuracy and FLOPS of EfficientNet variants

Which EfficientNet variant has approximately 19 billion FLOPS and achieves around 84.0% top-1 accuracy on ImageNet?

AEfficientNet-B7
BEfficientNet-B5
CEfficientNet-B6
DEfficientNet-B4
Attempts:
2 left
💡 Hint

Recall the FLOPS and accuracy increase with variant number.