Challenge - 5 Problems
Cluster Evaluation Master
Get all challenges correct to earn this badge!
Test your skills under time pressure!
🧠 Conceptual
intermediate2:00remaining
Understanding Silhouette Score Interpretation
Which statement best describes what a Silhouette Score close to +1 indicates about a clustering result?
Attempts:
2 left
💡 Hint
Think about what a high Silhouette Score means for the distance between points and their own cluster versus other clusters.
✗ Incorrect
A Silhouette Score near +1 means points are much closer to their own cluster than to other clusters, indicating good separation and assignment.
❓ Predict Output
intermediate2:00remaining
Output of Adjusted Rand Index Calculation
What is the output of the following Python code snippet using sklearn.metrics.adjusted_rand_score?
ML Python
from sklearn.metrics import adjusted_rand_score labels_true = [0, 0, 1, 1, 2, 2] labels_pred = [1, 1, 0, 0, 2, 2] score = adjusted_rand_score(labels_true, labels_pred) print(round(score, 2))
Attempts:
2 left
💡 Hint
Adjusted Rand Index is symmetric and equals 1 when clusterings are identical up to label permutation.
✗ Incorrect
The predicted labels are a perfect permutation of the true labels, so the Adjusted Rand Index is 1.0.
❓ Model Choice
advanced2:00remaining
Choosing the Best Metric for Clustering with Unknown Labels
You have unlabeled data and want to evaluate your clustering algorithm's quality. Which metric is most appropriate?
Attempts:
2 left
💡 Hint
Consider which metrics require true labels and which do not.
✗ Incorrect
Silhouette Score does not require true labels and measures how well clusters are separated internally.
❓ Metrics
advanced2:00remaining
Interpreting Davies-Bouldin Index Values
Which of the following statements about the Davies-Bouldin Index (DBI) is true?
Attempts:
2 left
💡 Hint
Think about what a low versus high DBI means for cluster similarity and separation.
✗ Incorrect
A lower Davies-Bouldin Index means clusters are compact and far from each other, indicating better clustering.
🔧 Debug
expert2:00remaining
Identifying the Error in Cluster Evaluation Code
What error will the following code raise when executed?
ML Python
from sklearn.metrics import silhouette_score X = [[1, 2], [2, 3], [10, 10], [11, 11]] labels = [0, 0, 1] score = silhouette_score(X, labels) print(score)
Attempts:
2 left
💡 Hint
Check if the number of labels matches the number of data points.
✗ Incorrect
The labels list has fewer elements than the data points, causing a ValueError.