DBSCAN is an advanced clustering algorithm. Why can it find clusters with complex shapes?
Think about how DBSCAN decides which points belong together.
DBSCAN groups points that are close together in dense regions, allowing it to find clusters of any shape, unlike methods that rely on distance to a center.
What is the shape of the clusters found by spectral clustering on a two-moon shaped dataset?
from sklearn.datasets import make_moons from sklearn.cluster import SpectralClustering import numpy as np X, _ = make_moons(n_samples=200, noise=0.05) model = SpectralClustering(n_clusters=2, affinity='nearest_neighbors', assign_labels='kmeans') labels = model.fit_predict(X) unique_labels = np.unique(labels) len(unique_labels)
How many clusters does the model try to find?
Spectral clustering correctly finds 2 clusters matching the two moons shape.
You have data with clusters shaped like spirals. Which clustering model is best to find these complex shapes?
Think about which method can find clusters based on density rather than shape.
DBSCAN can find clusters of arbitrary shape by grouping dense regions, making it suitable for spiral clusters.
In DBSCAN, what happens if you set the epsilon (eps) parameter too high?
Think about how increasing the neighborhood radius affects grouping.
Setting eps too high causes points to be considered neighbors over large distances, merging clusters and reducing their number.
You applied an advanced clustering algorithm that finds complex-shaped clusters. Which metric best evaluates the quality of these clusters without knowing true labels?
Consider metrics that measure cluster cohesion and separation without labels.
Silhouette score measures how well clusters are separated and how tight they are, useful when true labels are unknown.