0
0
ML Pythonml~5 mins

Cluster evaluation metrics in ML Python - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is the purpose of cluster evaluation metrics?
Cluster evaluation metrics help us measure how well a clustering algorithm groups data points. They tell us if clusters are tight and well-separated.
Click to reveal answer
intermediate
Explain the Silhouette Score in clustering.
The Silhouette Score measures how similar a point is to its own cluster compared to other clusters. Scores near +1 mean good clustering, near 0 means overlapping clusters, and negative means wrong cluster assignment.
Click to reveal answer
intermediate
What does the Davies-Bouldin Index indicate?
The Davies-Bouldin Index measures average similarity between clusters. Lower values mean clusters are compact and far apart, which is better.
Click to reveal answer
intermediate
Describe the difference between internal and external cluster evaluation metrics.
Internal metrics use only the data and cluster labels to evaluate quality (e.g., Silhouette Score). External metrics compare clusters to known true labels (e.g., Adjusted Rand Index).
Click to reveal answer
intermediate
What is the Adjusted Rand Index (ARI) used for?
ARI measures similarity between the clustering result and true labels, adjusting for chance. It ranges from -1 to 1, where 1 means perfect match.
Click to reveal answer
Which cluster evaluation metric ranges from -1 to 1 and measures similarity to true labels?
ASilhouette Score
BAdjusted Rand Index
CDavies-Bouldin Index
DCalinski-Harabasz Index
A high Silhouette Score indicates:
AClusters overlap heavily
BClusters are poorly formed
CClusters are well separated and compact
DClusters have many outliers
Which metric should be minimized for better clustering?
ADavies-Bouldin Index
BAdjusted Rand Index
CSilhouette Score
DHomogeneity Score
Internal cluster evaluation metrics require:
AHuman expert input
BTrue labels of data
CExternal validation data
DOnly data and cluster assignments
Which metric compares clustering results to known labels adjusting for chance grouping?
AAdjusted Rand Index
BCalinski-Harabasz Index
CDavies-Bouldin Index
DSilhouette Score
Explain how the Silhouette Score helps evaluate clustering quality.
Think about how close points are to their own cluster versus others.
You got /3 concepts.
    Describe the difference between internal and external cluster evaluation metrics with examples.
    Consider whether true labels are needed.
    You got /3 concepts.