0
0
SciPydata~5 mins

Cluster evaluation metrics in SciPy - Cheat Sheet & Quick Revision

Choose your learning style9 modes available
Recall & Review
beginner
What is the purpose of cluster evaluation metrics?
Cluster evaluation metrics help us measure how well a clustering algorithm has grouped data points. They tell us if clusters are tight and well-separated.
Click to reveal answer
beginner
Explain the Silhouette Score in clustering.
The Silhouette Score measures how similar a point is to its own cluster compared to other clusters. Scores range from -1 to 1, where a higher score means better clustering.
Click to reveal answer
intermediate
What does the Davies-Bouldin Index indicate?
The Davies-Bouldin Index measures the average similarity between clusters. Lower values mean clusters are more distinct and better separated.
Click to reveal answer
intermediate
How does the Calinski-Harabasz Index evaluate clusters?
The Calinski-Harabasz Index compares the variance within clusters to the variance between clusters. Higher values indicate better defined clusters.
Click to reveal answer
intermediate
What is the difference between internal and external cluster evaluation metrics?
Internal metrics use only the data and cluster labels to evaluate quality (e.g., Silhouette Score). External metrics compare clustering results to known true labels (e.g., Adjusted Rand Index).
Click to reveal answer
Which cluster evaluation metric ranges from -1 to 1 and measures how well points fit their clusters?
ACalinski-Harabasz Index
BDavies-Bouldin Index
CAdjusted Rand Index
DSilhouette Score
A lower value of which metric indicates better cluster separation?
ADavies-Bouldin Index
BSilhouette Score
CCalinski-Harabasz Index
DAdjusted Rand Index
Which metric compares variance within clusters to variance between clusters?
ACalinski-Harabasz Index
BSilhouette Score
CDavies-Bouldin Index
DAdjusted Rand Index
Which cluster evaluation metric requires true labels to compare clustering results?
ACalinski-Harabasz Index
BDavies-Bouldin Index
CAdjusted Rand Index
DSilhouette Score
Internal cluster evaluation metrics use:
ARandom data
BOnly data and cluster labels
CExternal validation data
DTrue labels only
Describe the Silhouette Score and how it helps evaluate clusters.
Think about how close points are to their own cluster versus others.
You got /3 concepts.
    Explain the difference between internal and external cluster evaluation metrics.
    Consider whether true labels are needed.
    You got /3 concepts.