Challenge - 5 Problems

🎖️

Cluster Evaluation Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

Understanding Silhouette Score Interpretation

Which statement best describes what a Silhouette Score close to +1 indicates about a clustering result?

AClusters have many outliers and points are far from cluster centers

BClusters are overlapping heavily and points are assigned randomly

CClusters are well separated and points are appropriately assigned to their clusters

DClusters are too small and have very few points

Attempts:

2 left

❓ Predict Output

intermediate

2:00remaining

Output of Adjusted Rand Index Calculation

What is the output of the following Python code snippet using sklearn.metrics.adjusted_rand_score?

ML Python

from sklearn.metrics import adjusted_rand_score
labels_true = [0, 0, 1, 1, 2, 2]
labels_pred = [1, 1, 0, 0, 2, 2]
score = adjusted_rand_score(labels_true, labels_pred)
print(round(score, 2))

A1.0

B0.0

C-1.0

D0.5

Attempts:

2 left

❓ Model Choice

advanced

2:00remaining

Choosing the Best Metric for Clustering with Unknown Labels

You have unlabeled data and want to evaluate your clustering algorithm's quality. Which metric is most appropriate?

AAdjusted Rand Index

BSilhouette Score

CNormalized Mutual Information

DAccuracy

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

Interpreting Davies-Bouldin Index Values

Which of the following statements about the Davies-Bouldin Index (DBI) is true?

ALower DBI values indicate better clustering with compact and well-separated clusters

BHigher DBI values indicate better clustering quality

CDBI values range from 0 to 1, where 1 is perfect clustering

DDBI is only valid for binary clustering problems

Attempts:

2 left

🔧 Debug

expert

2:00remaining

Identifying the Error in Cluster Evaluation Code

What error will the following code raise when executed?

ML Python

from sklearn.metrics import silhouette_score
X = [[1, 2], [2, 3], [10, 10], [11, 11]]
labels = [0, 0, 1]
score = silhouette_score(X, labels)
print(score)

ANo error, prints a float score

BTypeError: silhouette_score() missing required positional argument

CIndexError: list index out of range

DValueError: Number of labels does not match number of samples

Attempts:

2 left

Practice

(1/5)

1. Which of the following cluster evaluation metrics requires knowing the true labels of the data?

easy

A. Davies-Bouldin Index

B. Silhouette Score

C. Adjusted Rand Index (ARI)

D. Calinski-Harabasz Index

Cluster evaluation metrics in ML Python - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand metric types

Step 2: Identify ARI as external metric

Final Answer:

Quick Check:

Solution

Step 1: Check import source

Step 2: Check function parameters

Final Answer:

Quick Check:

Solution

Step 1: Understand Davies-Bouldin Index meaning

Step 2: Calculate score using sklearn

Final Answer:

Quick Check:

Solution

Step 1: Check input lengths

Step 2: Understand silhouette_score input requirements

Final Answer:

Quick Check:

Solution

Step 1: Identify metrics that do not require true labels

Step 2: Understand other metrics need true labels

Final Answer:

Quick Check: