Beruflich Dokumente
Kultur Dokumente
Clustering
ROCK Clustering Algorithm
Input:
This is done by selecting a random sample Li from each cluster Ci, then we
assign each point p to the cluster for which it has the strongest linkage with
Li.
As we said, we will consider the whole data in the process of forming clusters.
Computation of links:
1. using the similarity threshold , we can convert the similarity matrix into an
adjacency matrix (A)
2. Then we obtain a matrix indicating the number of links by calculating (A x
A ) , i.e., by multiplying the adjacency matrix A with itself
3. Suppose we have four verses contains some subjects , as follows:
P1={ judgment, faith, prayer, fair} {Milk, butter, bread, eggs}
P2={ fasting, faith, prayer}
P3={ fair, fasting, faith}
By multiplying the adjacency table with itself, we derive the following table
which shows the number of links (or common neighbors) :
we compute the goodness measure for all adjacent points ,assuming that
f() =1- / 1+
g ( Pi , Pj )
link [ Pi , Pj ]
1 2 f ( )
(n m)
n1 2 f ( ) m1 2 f ( )
etc.
Then we can obtain the following goodness measures for all adjacent
clusters:
It should be P3,P4= 0.45
g((P1,P2),P3)= 6 / (2+1)2.06 - 22.06 - 12.06 = 1.34 or 1.31
g((P1,P2),P4)= 3 / (2+1)2.06 - 22.06 - 12.06 = 0.67 or 0.66