beginner

What is model quantization in machine learning?

Model quantization is a technique that reduces the precision of the numbers used to represent a model's parameters, typically from 32-bit floats to lower-bit integers, to make the model smaller and faster without much loss in accuracy.

Click to reveal answer

beginner

Explain model pruning in simple terms.

Model pruning means removing parts of a neural network, like some weights or neurons, that are less important. This makes the model smaller and faster while trying to keep its performance close to the original.

Click to reveal answer

intermediate

How does quantization help in deploying models on mobile devices?

Quantization reduces the model size and speeds up computation by using fewer bits for numbers. This saves memory and battery on mobile devices, making models run faster and use less power.

Click to reveal answer

intermediate

What is the main trade-off when applying pruning to a neural network?

The main trade-off is between model size and accuracy. Pruning reduces size and speeds up the model but can cause some loss in accuracy if important parts are removed.

Click to reveal answer

intermediate

Name two common types of pruning methods.

Two common pruning methods are: 1) Magnitude-based pruning, which removes weights with small values, and 2) Structured pruning, which removes entire neurons or filters.

Click to reveal answer

What does quantization primarily reduce in a neural network model?

AThe number of training samples

BThe number of layers

CThe learning rate

DThe precision of numbers used for weights

Which of the following is a goal of pruning a model?

AIncrease model size

BRemove less important weights or neurons

CAdd more layers

DIncrease training time

What is a common effect of quantization on model accuracy?

AIt may slightly reduce accuracy

BIt has no effect on accuracy

CIt always improves accuracy

DIt causes the model to forget training data

Which pruning method removes entire neurons or filters?

AStructured pruning

BMagnitude-based pruning

CRandom pruning

DGradient pruning

Why is model optimization important for edge devices?

ATo increase model complexity

BTo use more power

CTo reduce memory and computation needs

DTo make training slower

Describe in your own words what model quantization and pruning are, and why they are useful.

Explain the trade-offs involved when applying pruning to a neural network model.