Practice

(1/5)

1. What is the main idea behind EfficientNet scaling in computer vision models?

easy

A. It uses only higher image resolution without changing the model.

B. It only increases the number of layers to improve accuracy.

C. It reduces model size by removing layers randomly.

D. It scales depth, width, and resolution together using fixed constants.

Solution

Step 1: Understand EfficientNet scaling components
EfficientNet scales three model dimensions: depth (layers), width (channels), and input resolution together.
Step 2: Recognize the use of constants
It uses constants alpha, beta, gamma with a scaling factor phi to balance these dimensions.
Final Answer:
It scales depth, width, and resolution together using fixed constants. -> Option D
Quick Check:
EfficientNet scales depth, width, resolution together [OK]

Hint: Remember: EfficientNet scales depth, width, and resolution together [OK]

Common Mistakes:

Thinking it only increases layers
Assuming it changes only resolution
Believing it randomly removes layers

2. Which formula correctly represents the compound scaling method used in EfficientNet for depth (d), width (w), and resolution (r)?

easy

A. d = phi * alpha, w = phi * beta, r = phi * gamma

B. d = alpha + phi, w = beta + phi, r = gamma + phi

C. d = alpha^phi, w = beta^phi, r = gamma^phi

D. d = alpha / phi, w = beta / phi, r = gamma / phi

Solution

Step 1: Recall EfficientNet scaling formula
EfficientNet uses exponential scaling: depth = alpha^phi, width = beta^phi, resolution = gamma^phi.
Step 2: Compare options with formula
Only d = alpha^phi, w = beta^phi, r = gamma^phi matches the exponential form with constants raised to the power phi.
Final Answer:
d = alpha^phi, w = beta^phi, r = gamma^phi -> Option C
Quick Check:
Uses exponentiation alpha^phi [OK]

Hint: Look for exponential scaling with phi as power [OK]

Common Mistakes:

Using multiplication instead of exponentiation
Adding phi instead of exponentiating
Dividing constants by phi

3. Given alpha=1.2, beta=1.1, gamma=1.15, and phi=2, what is the scaled depth (d) using EfficientNet scaling?

medium

A. 1.2^2 = 1.44

B. 1.2 * 2 = 2.4

C. 1.2 + 2 = 3.2

D. 2 / 1.2 = 1.67

Solution

Step 1: Apply the formula for depth scaling
Depth d = alpha^phi = 1.2^2 = 1.44.
Step 2: Calculate the value
1.2 squared equals 1.44, matching 1.2^2 = 1.44.
Final Answer:
1.44 -> Option A
Quick Check:
1.2^2 = 1.44 [OK]

Hint: Raise alpha to the power phi for depth [OK]

Common Mistakes:

Multiplying alpha by phi instead of exponentiating
Adding phi to alpha
Dividing phi by alpha

4. Identify the error in this Python code snippet for EfficientNet scaling:

alpha, beta, gamma, phi = 1.2, 1.1, 1.15, 2
depth = alpha * phi
width = beta ** phi
resolution = gamma ** phi

medium

A. Depth should be alpha ** phi, not alpha * phi

B. Width should be beta * phi, not beta ** phi

C. Resolution should be gamma * phi, not gamma ** phi

D. No error, the code is correct

Solution

Step 1: Review EfficientNet scaling formula
Depth should be scaled as alpha raised to phi (alpha ** phi), not multiplied.
Step 2: Check code for depth calculation
Code uses alpha * phi which is incorrect; width and resolution use exponentiation correctly.
Final Answer:
Depth should be alpha ** phi, not alpha * phi -> Option A
Quick Check:
Depth uses exponentiation (**), not multiplication (*) [OK]

Hint: Depth uses exponentiation, not multiplication [OK]

Common Mistakes:

Confusing multiplication with exponentiation
Assuming width or resolution calculations are wrong
Thinking code has no errors

5. You want to scale an EfficientNet model with phi=3, alpha=1.2, beta=1.1, gamma=1.15. Which of these sets of scaled values (depth, width, resolution) is closest to the correct scaling?

hard

A. (1.2+3, 1.1+3, 1.15+3) = (4.2, 4.1, 4.15)

B. (1.2^3, 1.1^3, 1.15^3) ≈ (1.73, 1.33, 1.52)

C. (3*1.2, 3*1.1, 3*1.15) = (3.6, 3.3, 3.45)

D. (3/1.2, 3/1.1, 3/1.15) ≈ (2.5, 2.73, 2.61)

Solution

Step 1: Apply compound scaling formula
Scale each dimension by raising constants to the power phi: depth = 1.2^3, width = 1.1^3, resolution = 1.15^3.
Step 2: Calculate approximate values
1.2^3 ≈ 1.73, 1.1^3 ≈ 1.33, 1.15^3 ≈ 1.52, matching (1.2^3, 1.1^3, 1.15^3) ≈ (1.73, 1.33, 1.52).
Final Answer:
(1.73, 1.33, 1.52) -> Option B
Quick Check:
1.2^3 ≈ 1.73, 1.1^3 ≈ 1.33, 1.15^3 ≈ 1.52 [OK]

Hint: Use powers, not multiplication or addition for scaling [OK]

Common Mistakes:

Multiplying constants by phi instead of exponentiating
Adding phi to constants
Dividing phi by constants

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.8	0.45	Model starts learning basic features, moderate accuracy
5	1	0.68	Loss decreases steadily, accuracy improves
10	0.6	0.8	Model captures complex patterns, good accuracy
15	0.4	0.87	Loss continues to decrease, accuracy nearing convergence
20	0.35	0.89	Training stabilizes with high accuracy

EfficientNet scaling in Computer Vision - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand EfficientNet scaling components

Step 2: Recognize the use of constants

Final Answer:

Quick Check:

Solution

Step 1: Recall EfficientNet scaling formula

Step 2: Compare options with formula

Final Answer:

Quick Check:

Solution

Step 1: Apply the formula for depth scaling

Step 2: Calculate the value

Final Answer:

Quick Check:

Solution

Step 1: Review EfficientNet scaling formula

Step 2: Check code for depth calculation

Final Answer:

Quick Check:

Solution

Step 1: Apply compound scaling formula

Step 2: Calculate approximate values

Final Answer:

Quick Check: