Practice

(1/5)

1. What is the main purpose of A/B testing in model deployment?

easy

A. To train a model faster using multiple GPUs

B. To compare two model versions by splitting user traffic

C. To backup model data in the cloud

D. To monitor server CPU usage during training

Solution

Step 1: Understand A/B testing concept
A/B testing involves running two versions of a model simultaneously to compare their performance.
Step 2: Identify the main goal
The goal is to split user traffic between two models to see which performs better in real conditions.
Final Answer:
To compare two model versions by splitting user traffic -> Option B
Quick Check:
A/B testing = compare models by traffic split [OK]

Hint: A/B testing means splitting users to compare models [OK]

Common Mistakes:

Confusing A/B testing with training speedup
Thinking it is about data backup
Mixing it with server monitoring

2. Which of the following is the correct way to define a traffic split for A/B testing in YAML?

easy

A. traffic: - model: v1 split: 50 - model: v2 split: 50

B. traffic: modelVersion: v1 percent: 50 modelVersion: v2 percent: 50

C. traffic: - version: v1 percent: 50 - version: v2 percent: 50

D. traffic: - modelVersion: v1 percent: 50 - modelVersion: v2 percent: 50

Solution

Step 1: Check YAML list syntax for traffic split
The correct YAML uses a list with dash (-) for each model version and keys 'modelVersion' and 'percent'.
Step 2: Validate keys and indentation
traffic: - modelVersion: v1 percent: 50 - modelVersion: v2 percent: 50 correctly uses 'modelVersion' and 'percent' with proper indentation and list format.
Final Answer:
traffic: - modelVersion: v1 percent: 50 - modelVersion: v2 percent: 50 -> Option D
Quick Check:
YAML list with modelVersion and percent = traffic: - modelVersion: v1 percent: 50 - modelVersion: v2 percent: 50 [OK]

Hint: YAML lists use dash and proper keys for traffic split [OK]

Common Mistakes:

Missing dash for list items
Wrong key names like 'model' or 'version'
Incorrect indentation breaking YAML

3. Given this Python snippet for A/B testing traffic assignment:

import random
traffic_split = {'v1': 70, 'v2': 30}
user_id = 12345
random.seed(user_id)
roll = random.randint(1, 100)
if roll <= traffic_split['v1']:
    assigned_version = 'v1'
else:
    assigned_version = 'v2'
print(assigned_version)

What will be the printed output?

medium

A. Random output each run

B. v2

C. v1

D. Error due to wrong seed usage

Solution

Step 1: Understand random seed and randint
Setting seed to user_id makes random output deterministic for that user. randint(1,100) generates a number between 1 and 100.
Step 2: Calculate roll value for user_id=12345
With seed 12345, roll is 54 (verified by running the code). Since 54 <= 70, assigned_version is 'v1'.
Final Answer:
v1 -> Option C
Quick Check:
roll=54 <= 70 means assign v1 [OK]

Hint: Seed fixes random; check roll against split [OK]

Common Mistakes:

Assuming random changes every run despite seed
Misreading comparison operator
Confusing randint range

4. You have this traffic split config for A/B testing:

traffic:
  - modelVersion: v1
    percent: 60
  - modelVersion: v2
    percent: 50

What is the main problem with this configuration?

medium

A. Percentages add up to more than 100%

B. Missing modelVersion key for v2

C. Percentages must be equal for A/B testing

D. YAML syntax error due to indentation

Solution

Step 1: Sum the traffic percentages
60% + 50% = 110%, which is more than 100% allowed for traffic split.
Step 2: Understand traffic split constraints
Traffic percentages must sum to exactly 100% to properly split user traffic between models.
Final Answer:
Percentages add up to more than 100% -> Option A
Quick Check:
Sum of percents > 100% is invalid [OK]

Hint: Traffic split percentages must total 100% [OK]

Common Mistakes:

Ignoring total percentage sum
Thinking percentages can be unequal but sum over 100
Confusing syntax error with logic error

5. You want to run an A/B test comparing model versions v1 and v2. You have 10,000 users and want to assign 70% traffic to v1 and 30% to v2. Which approach ensures consistent user assignment and fair metric tracking?

hard

A. Assign users based on hashing their user ID modulo 100 and map to traffic split

B. Assign users manually by checking their signup date

C. Assign all users to v1 for the first week, then switch all to v2

D. Randomly assign users on each request without storing assignment

Solution

Step 1: Understand consistent user assignment need
Users must always get the same model version to avoid confusing metrics and user experience.
Step 2: Evaluate assignment methods
Hashing user ID modulo 100 maps users consistently to a number 0-99, which can be split 70/30 for v1/v2.
Step 3: Reject other options
Random assignment each request causes inconsistency; switching all users breaks A/B test; manual assignment is impractical and biased.
Final Answer:
Assign users based on hashing their user ID modulo 100 and map to traffic split -> Option A
Quick Check:
Consistent hashing ensures stable A/B assignment [OK]

Hint: Use hashing on user ID for stable traffic split [OK]

Common Mistakes:

Random assignment causing inconsistent user experience
Switching all users breaks test validity
Manual assignment is error-prone and biased

Why A/B testing model versions in MLOps? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand A/B testing concept

Step 2: Identify the main goal

Final Answer:

Quick Check:

Solution

Step 1: Check YAML list syntax for traffic split

Step 2: Validate keys and indentation

Final Answer:

Quick Check:

Solution

Step 1: Understand random seed and randint

Step 2: Calculate roll value for user_id=12345

Final Answer:

Quick Check:

Solution

Step 1: Sum the traffic percentages

Step 2: Understand traffic split constraints

Final Answer:

Quick Check:

Solution

Step 1: Understand consistent user assignment need

Step 2: Evaluate assignment methods

Step 3: Reject other options

Final Answer:

Quick Check: