Beruflich Dokumente
Kultur Dokumente
Dissimilarity
K-Medoids
PAM CLARANS
CLARA
Spatial Data
What is Spatial Data?
● Draws a sample of the dataset and applies PAM on the sample in order to find the
medoids
● the algorithm can’t find the best solution if one of the best k-medoids is not among the
selected sample
● Improvement
○ select multiple samples
○ choose the sample(k-medoids) that has the lowest average dissimilarity of all objects in the
entire dataset
CLARANS - Randomized CLARA
Efficient and Effective - outperforms PAM Efficiency depends on the sample size
and CLARA
A good clustering on samples will not
Return higher quality clusters than PAM necessarily represent a good clustering of
and CLARA the whole data set if the sample is biased