Beruflich Dokumente
Kultur Dokumente
Manasi Datar
mpdatar@cc.usu.edu
April 22, 2004
Presentation Outline
• Core Concepts
– Segmentation
– Segmentation and clustering
– Self-Organizing Map (SOM)
– Clustering and SOM
– Segmentation and SOM
• My Idea :)
Segmentation of complex images using SOM for
clustering
– Approach: not novel… but applied to a new domain :)
– Application: pre-processing of images for knowledge
discovery
Segmentation
• What is it?
A process to distinguish meaningful objects in an
image from the background
• How is it done?
– threshold techniques
– edge-based methods
– region-based techniques
– connectivity-preserving relaxation-based methods
Segmentation Techniques
• Threshold techniques
– Make decisions based on local pixel information
– Are effective when the intensity levels of the objects
fall squarely outside the range of levels in the
background
– Spatial information is ignored
• Edge-based techniques
– Center around contour detection
– Weakness: connecting together broken contour lines
– Prone to failure in the presence of blurring
Segmentation Techniques…
contd
• Region-based Techniques
– Connected regions: grouping neighboring pixels of similar
intensity levels
– Region merging: adjacent regions are merged under some
criterion
• Overstringent criteria: fragmentation
• Lenient: blurred boundaries overlooked (overmerge)
• Segmentation as clustering
– represent an image in terms of clusters
of pixels that “belong together”
Clustering Algorithms
Make each point a separate Construct a single cluster containing
cluster all points
Until the clustering is satisfactory
Until the clustering is satisfactory Split the cluster that yields
Merge the two clusters the two components with the
with the smallest inter- cluster largest inter-cluster distance
distance End
End
Divisive
Agglomerative
Clustering for Segmentation
K-means algorithm
• A natural objective function can be obtained by assuming that we know there are
k clusters, where k is known.
• Each cluster is assumed to have a center; we write the center of the ith cluster as
ci. The i
th
element to be clustered is described by a feature vector xi
• We now assume that elements are close to the center of their cluster, yielding the
objective function
• Note: if the allocation of points to clusters is known, it is easy to compute the best
center for each cluster
Clustering for Segmentation…
contd
K-means Algorithm
• Choose k data points to act as cluster centers
• Until the cluster centers are unchanged
– Allocate each data point to cluster whose center is nearest
– Now ensure that every cluster has at least one data point
{possible techniques for doing this include supplying empty clusters with a
point chosen at random from points far from their cluster center}
• End
Clustering for Segmentation
• The winning processing element and its neighbors are allowed to adjust
their weights, while the remaining ones are left unchanged
Input Topologic
imag al map
e
SOM for clustering
• Properties of SOM
– Classification
– Visualization
Rectified
Corrections SOM
Landsat 7 image
Conventional
Classifier Classified image
Post
Classified image
processing
• Vesanto, J.: Data Mining Techniques Based on the Self-Organizing Map. MSc
Thesis, Helsinki University of Technology, Espoo, Finland (1997)
• Goldberg, M.; Shlien, S.: A clustering scheme for multi-spectral images. IEEE
Transactions on Systems, Man and Cybernetics SMC 8. (1978)
• Brown, D. A.; Craw I.; Lewthwaite, J.: Towards Core Image Processing with Self-
Organizing Maps. (2001)
• Mather P.: Computer Processing of Remotely Sensed Images, John Wiley &
Sons, Inc. New York, NY, USA (1999)
• Landsat data:
http://landsat.gsfc.nasa.gov/data/Browse/Cities/L7_CitiesGallery.html