Practice

(1/5)

1. What does bias in NLP models usually mean?

easy

A. The model always predicts correctly

B. Unfair treatment of some groups by the model

C. The model runs faster on some data

D. The model uses more memory for some inputs

Solution

Step 1: Understand the meaning of bias in NLP
Bias refers to when a model treats some groups unfairly, often due to skewed training data or design.
Step 2: Compare options to definition
Only Unfair treatment of some groups by the model describes unfair treatment, which matches the definition of bias in NLP.
Final Answer:
Unfair treatment of some groups by the model -> Option B
Quick Check:
Bias = Unfair treatment [OK]

Hint: Bias means unfairness in model predictions [OK]

Common Mistakes:

Confusing bias with model speed or memory use
Thinking bias means always correct predictions

2. Which of the following is the correct way to check fairness in an NLP model?

easy

A. Count the number of layers in the model

B. Check if the model uses GPU acceleration

C. Compare accuracy across different demographic groups

D. Measure the model's training time

Solution

Step 1: Identify fairness checking methods
Fairness is checked by comparing performance metrics like accuracy across groups to ensure equal treatment.
Step 2: Evaluate options
Only Compare accuracy across different demographic groups relates to fairness by comparing accuracy across groups; others are unrelated to fairness.
Final Answer:
Compare accuracy across different demographic groups -> Option C
Quick Check:
Fairness check = Compare accuracy by group [OK]

Hint: Fairness means equal accuracy for all groups [OK]

Common Mistakes:

Confusing fairness with model speed or architecture
Ignoring group-based performance differences

3. Consider this Python code snippet checking fairness metrics:

group_accuracies = {'groupA': 0.85, 'groupB': 0.60}
if abs(group_accuracies['groupA'] - group_accuracies['groupB']) > 0.2:
    print('Fairness issue detected')
else:
    print('No fairness issue')

What will this code print?

medium

A. KeyError

B. No fairness issue

C. SyntaxError

D. Fairness issue detected

Solution

Step 1: Calculate difference in accuracies
The difference is |0.85 - 0.60| = 0.25, which is greater than 0.2.
Step 2: Evaluate the if condition
Since 0.25 > 0.2, the condition is true, so it prints 'Fairness issue detected'.
Final Answer:
Fairness issue detected -> Option D
Quick Check:
Difference 0.25 > 0.2 = Fairness issue [OK]

Hint: Check if accuracy difference > threshold for fairness [OK]

Common Mistakes:

Miscomputing the absolute difference
Confusing greater than with less than
Expecting syntax or key errors

4. This code tries to calculate fairness but has a bug:

metrics = {'group1': {'accuracy': 0.9}, 'group2': {'accuracy': 0.85}}
diff = metrics['group1']['accuracy'] - metrics['group3']['accuracy']
if abs(diff) > 0.05:
    print('Bias detected')

What is the error and how to fix it?

medium

A. KeyError because 'group3' does not exist; fix by checking keys first

B. SyntaxError due to missing colon; fix by adding colon

C. TypeError because accuracy is not a number; fix by converting to float

D. No error; code runs fine

Solution

Step 1: Identify the error cause
The code accesses metrics['group3'], which is not in the dictionary, causing a KeyError.
Step 2: Suggest fix
Check if 'group3' exists in metrics before accessing or handle missing keys to avoid error.
Final Answer:
KeyError because 'group3' does not exist; fix by checking keys first -> Option A
Quick Check:
Missing key access = KeyError [OK]

Hint: Check dictionary keys before access to avoid KeyError [OK]

Common Mistakes:

Assuming all keys exist without checking
Confusing KeyError with SyntaxError or TypeError

5. You have an NLP sentiment model that predicts positive or negative sentiment. You notice it predicts positive sentiment 90% for group A but only 60% for group B, though both groups have similar real sentiment. What is the best way to improve fairness?

hard

A. Collect more balanced training data including both groups equally

B. Increase model size to improve overall accuracy

C. Use a faster optimizer to train the model

D. Remove group B data from training to avoid confusion

Solution

Step 1: Understand the fairness problem
The model predicts differently for groups with similar real sentiment, indicating bias likely from unbalanced data.
Step 2: Choose the best fix
Collecting balanced data ensures the model learns equally from both groups, improving fairness.
Final Answer:
Collect more balanced training data including both groups equally -> Option A
Quick Check:
Balanced data improves fairness [OK]

Hint: Balanced data helps fix bias in predictions [OK]

Common Mistakes:

Thinking bigger models fix bias automatically
Ignoring data imbalance as cause of unfairness
Removing data from minority groups

Epoch	Loss ↓	Accuracy ↑	Observation
1	0.65	0.6	Model starts learning, bias still high
2	0.5	0.7	Loss decreases, accuracy improves, bias reducing
3	0.4	0.78	Better fairness observed, model balances accuracy and bias
4	0.35	0.82	Model converging, bias metric low
5	0.3	0.85	Final epoch, good accuracy and fairness

Bias and fairness in NLP - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand the meaning of bias in NLP

Step 2: Compare options to definition

Final Answer:

Quick Check:

Solution

Step 1: Identify fairness checking methods

Step 2: Evaluate options

Final Answer:

Quick Check:

Solution

Step 1: Calculate difference in accuracies

Step 2: Evaluate the if condition

Final Answer:

Quick Check:

Solution

Step 1: Identify the error cause

Step 2: Suggest fix

Final Answer:

Quick Check:

Solution

Step 1: Understand the fairness problem

Step 2: Choose the best fix

Final Answer:

Quick Check: