0
0
Computer Visionml~5 mins

Model optimization (pruning, quantization) in Computer Vision - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is model pruning in machine learning?
Model pruning is a technique that removes less important parts of a neural network, like some connections or neurons, to make the model smaller and faster without losing much accuracy.
Click to reveal answer
beginner
Explain quantization in the context of neural networks.
Quantization reduces the precision of the numbers used in a model, for example changing 32-bit floats to 8-bit integers, which makes the model smaller and faster to run, especially on devices with limited resources.
Click to reveal answer
intermediate
How does pruning help improve model performance?
Pruning removes unnecessary parts of the model, which reduces its size and speeds up predictions, making it easier to run on devices with less memory or slower processors.
Click to reveal answer
intermediate
What is a common trade-off when applying quantization to a model?
The trade-off is between model size and speed versus accuracy. Quantization makes the model smaller and faster but can slightly reduce its accuracy due to lower number precision.
Click to reveal answer
advanced
Name two common types of pruning used in model optimization.
Two common types are: 1) Weight pruning, which removes individual connections with small weights, and 2) Structured pruning, which removes entire neurons or filters to simplify the model structure.
Click to reveal answer
What does pruning mainly remove from a neural network?
ATraining data samples
BOutput layers
CInput features
DLess important connections or neurons
Quantization typically changes model numbers from:
AIntegers to floats
B32-bit floats to 8-bit integers
C8-bit integers to 32-bit floats
DStrings to numbers
Which is a benefit of model pruning?
ASpeeds up model inference
BRequires more memory
CAdds more layers
DIncreases model size
What is a possible downside of quantization?
AModel becomes slower
BModel uses more memory
CModel accuracy may slightly decrease
DModel requires more training data
Structured pruning removes:
AEntire neurons or filters
BIndividual weights only
CTraining samples
DInput features
Describe how pruning and quantization help optimize a computer vision model for deployment on mobile devices.
Think about how smaller and faster models help on phones.
You got /3 concepts.
    Explain the trade-offs involved when applying pruning and quantization to a neural network.
    Consider what you lose and gain with these techniques.
    You got /4 concepts.