Practice

(1/5)

1. What does dataset bias in computer vision mean?

easy

A. The data does not fairly represent all types of images or cases

B. The model always predicts perfectly on all images

C. The dataset is too large to process

D. The images are all black and white

Solution

Step 1: Understand dataset bias meaning
Dataset bias means the data used to train a model does not cover all possible cases fairly.
Step 2: Compare options to definition
Only The data does not fairly represent all types of images or cases describes this correctly. Other options describe unrelated issues.
Final Answer:
The data does not fairly represent all types of images or cases -> Option A
Quick Check:
Dataset bias = unfair data representation [OK]

Hint: Bias means data is not fair or balanced [OK]

Common Mistakes:

Thinking bias means model is perfect
Confusing bias with dataset size
Assuming bias means image color

2. Which of the following is the correct way to check for dataset bias in a vision dataset using Python?

easy

A. Use random.shuffle(dataset) to fix bias

B. Use value_counts() on labels to see class distribution

C. Use len(dataset) without checking labels

D. Use print(dataset) only

Solution

Step 1: Identify method to check bias
Checking class distribution with value_counts() helps find imbalance in labels.
Step 2: Evaluate other options
Printing dataset or length alone doesn't show bias. Shuffling data doesn't check bias.
Final Answer:
Use value_counts() on labels to see class distribution -> Option B
Quick Check:
Check label counts = value_counts() [OK]

Hint: Check label counts to find bias [OK]

Common Mistakes:

Only printing dataset without analysis
Assuming dataset length shows bias
Thinking shuffling fixes bias

3. Given this Python code snippet analyzing a vision dataset labels:

import pandas as pd
labels = ['cat', 'dog', 'cat', 'cat', 'dog', 'bird']
label_counts = pd.Series(labels).value_counts()
print(label_counts)

What is the output?

medium

A. bird 3 cat 2 dog 1

B. cat 2 dog 3 bird 1

C. cat 3 dog 2 bird 1

D. dog 3 cat 3 bird 3

Solution

Step 1: Count occurrences of each label
Labels list has 'cat' 3 times, 'dog' 2 times, and 'bird' 1 time.
Step 2: Understand value_counts output
value_counts() returns counts sorted descending by default.
Final Answer:
cat 3\ndog 2\nbird 1 -> Option C
Quick Check:
Count labels correctly = cat:3, dog:2, bird:1 [OK]

Hint: Count each label frequency carefully [OK]

Common Mistakes:

Mixing counts of dog and cat
Assuming alphabetical order instead of count order
Miscounting occurrences

4. You have this code to check dataset bias:

labels = ['car', 'car', 'truck', 'car', 'truck']
counts = {}
for label in labels:
    counts[label] = counts.get(label, 0) + 1
print(counts)

But the output is {}. What is the likely error?

medium

A. The 'get' method is used incorrectly with wrong parameters

B. The print statement is outside the loop and missing indentation

C. The code is correct; output should be {'car': 3, 'truck': 2}

D. The dictionary 'counts' was reinitialized inside the loop

Solution

Step 1: Analyze code behavior
If 'counts' is reset inside the loop, it will be empty after loop ends.
Step 2: Identify cause of empty output
Reinitializing 'counts' inside loop clears previous counts, causing empty dict at print.
Final Answer:
The dictionary 'counts' was reinitialized inside the loop -> Option D
Quick Check:
Resetting dict inside loop empties counts [OK]

Hint: Check if dict is reset inside loop [OK]

Common Mistakes:

Thinking print indentation causes empty output
Assuming get() method is wrong
Ignoring variable scope inside loop

5. You have a vision dataset with 90% images of cats and 10% dogs. Which method best reduces dataset bias to improve model fairness?

hard

A. Collect more dog images to balance classes

B. Remove all cat images to keep only dogs

C. Train model only on cat images

D. Ignore class imbalance and train as is

Solution

Step 1: Understand dataset imbalance problem
Having 90% cats and 10% dogs causes bias favoring cats.
Step 2: Choose method to fix bias
Collecting more dog images balances classes, reducing bias and improving fairness.
Final Answer:
Collect more dog images to balance classes -> Option A
Quick Check:
Balance classes by adding data [OK]

Hint: Balance classes by adding underrepresented data [OK]

Common Mistakes:

Removing majority class loses useful data
Training on one class ignores others
Ignoring imbalance causes biased model

Why Dataset bias in vision in Computer Vision? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand dataset bias meaning

Step 2: Compare options to definition

Final Answer:

Quick Check:

Solution

Step 1: Identify method to check bias

Step 2: Evaluate other options

Final Answer:

Quick Check:

Solution

Step 1: Count occurrences of each label

Step 2: Understand value_counts output

Final Answer:

Quick Check:

Solution

Step 1: Analyze code behavior

Step 2: Identify cause of empty output

Final Answer:

Quick Check:

Solution

Step 1: Understand dataset imbalance problem

Step 2: Choose method to fix bias

Final Answer:

Quick Check: