Bird
0
0

Why might the Calinski-Harabasz Index give misleading results on clusters with very different sizes?

hard📝 Conceptual Q10 of 15
SciPy - Clustering and Distance
Why might the Calinski-Harabasz Index give misleading results on clusters with very different sizes?
AIt is sensitive to the number of features, not cluster size
BIt assumes clusters have similar variance and size
CIt only measures cluster compactness, ignoring separation
DIt requires true labels to work correctly
Step-by-Step Solution
Solution:
  1. Step 1: Understand Calinski-Harabasz assumptions

    This index assumes clusters are roughly similar in size and variance for meaningful comparison.
  2. Step 2: Explain why different sizes cause issues

    Clusters with very different sizes can distort the variance ratio, misleading the index.
  3. Final Answer:

    It assumes clusters have similar variance and size -> Option B
  4. Quick Check:

    Calinski-Harabasz assumption = D [OK]
Quick Trick: Calinski-Harabasz assumes similar cluster sizes [OK]
Common Mistakes:
  • Thinking it requires true labels
  • Believing it ignores cluster separation
  • Confusing sensitivity to features with cluster size

Want More Practice?

15+ quiz questions · All difficulty levels · Free

Free Signup - Practice All Questions
More SciPy Quizzes