0
0
Computer Visionml~5 mins

Pre-trained models (ResNet, VGG, EfficientNet) in Computer Vision - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is a pre-trained model in computer vision?
A pre-trained model is a neural network that has already been trained on a large dataset. It can be reused to solve similar tasks without training from scratch, saving time and resources.
Click to reveal answer
intermediate
What is the main idea behind the ResNet architecture?
ResNet uses 'skip connections' or 'residual connections' that let the model learn differences from the input instead of the full transformation. This helps train very deep networks without losing information.
Click to reveal answer
intermediate
How does VGG differ from ResNet in design?
VGG uses a simple and uniform design with many layers of small 3x3 filters stacked one after another. It does not use skip connections like ResNet but focuses on depth and simplicity.
Click to reveal answer
advanced
What is special about EfficientNet compared to older models like VGG and ResNet?
EfficientNet scales the model's depth, width, and resolution together in a balanced way. This makes it more efficient, achieving better accuracy with fewer parameters and less computation.
Click to reveal answer
beginner
Why do we use pre-trained models like ResNet, VGG, and EfficientNet in new computer vision tasks?
We use them to save time and improve performance. Since they learned useful features from large datasets, they help new models learn faster and better even with less data.
Click to reveal answer
What is the key feature of ResNet that helps train very deep networks?
AVery large convolution filters
BReducing the number of layers
CUsing only fully connected layers
DSkip connections that add input to output
Which model uses many small 3x3 convolution filters stacked deeply?
AVGG
BResNet
CEfficientNet
DAlexNet
What does EfficientNet optimize to improve performance?
AOnly depth of the network
BOnly width of the network
CDepth, width, and resolution together
DOnly resolution of input images
Why are pre-trained models useful for new tasks?
AThey require more data to train
BThey start with learned features from large datasets
CThey always have fewer layers
DThey do not need any training
Which of these is NOT a characteristic of VGG?
AUse of skip connections
BDeep network with many layers
CStacked 3x3 convolution layers
DSimple and uniform architecture
Explain how ResNet's skip connections help in training deep neural networks.
Think about how adding input directly to output helps the network learn.
You got /4 concepts.
    Describe the main differences between VGG and EfficientNet architectures.
    Focus on design style and scaling approach.
    You got /4 concepts.