Sie sind auf Seite 1von 3

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/245507626

Clustering (Xu, R. and Wunsch, D.C.; 2008) [Book review]

Article  in  IEEE Pulse · July 2010


DOI: 10.1109/MPUL.2010.937237

CITATION READS
1 140

1 author:

Edward Sazonov
University of Alabama
169 PUBLICATIONS   2,654 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Wearable sensors for monitoring of cigarette smoking View project

Wearable sensors for monitoring of diet and ingestive behavior View project

All content following this page was uploaded by Edward Sazonov on 06 August 2014.

The user has requested enhancement of the downloaded file.


BOOK REVIEWS NEURO ENGINEERING

Four Books for the Scientist’s Library of partitional clustering. The chapter
starts with formulations of traditional
k-means and expectation maximization
Paul King algorithms, which are followed by more
recent algorithms, such as fuzzy c-means
and fuzzy c-shells. Another point of inter-
est is description of search-based cluster-
ing methods based on genetic algorithms,
simulated annealing, and particle swarm
Clustering cluster analysis and proximity measures optimization. Four different application

R
ui Xu and Donald C. Wunsch, II, and progressing on to descriptions of ad- examples highlight the use of partition
Wiley-IEEE Press, 2008. ISBN- vanced clustering techniques. The writ- clustering in bioinformatics, computer
13: 978-0470276808, 358 pages, ing style is easy to understand and clear. vision, and medical diagnostics. Chapter
US$122.50. Multiple illustrations and tables also posi- 5 deals with the use of artificial neural
The modern world is teeming with tively contribute to the ease of reading. networks in cluster analysis. A larger por-
data. Computers enable us to collect, An extensive bibliography is a valuable tion of this chapter is dedicated to vari-
store, and transmit huge amounts of in- resource to anyone seeking to find addi- ous flavors of adaptive resonance theory,
formation almost effortlessly. The fields tional information of the subject. followed by discussion of learning vec-
of biomedical engineering and bioinfor- A typical chapter starts with an intro- tor quantization, self-organizing maps,
matics are no exception. Advances in im- duction to a specific issue and neural gas. Applica-
aging, signal processing, and gene map- covered in that chapter. tion examples demonstrate
ping techniques have created a plethora The introduction is fol- An extensive the use of neural-network
of data much of which awaits analysis to lowed by an extensive de- bibliography is a clus tering in magnetic
uncover hidden information. Quite often, scription of a wide variety resonance image (MRI)
very little or no prior information about of clustering algorithms.
valuable resource segmentation, gene dat a
internal dependencies in the data is avail- Clustering methodologies to anyone seeking analysis, and other appli-
able, highlighting the need for unsuper- are described in their to find additional cations. Chapter 6 pro-
vised methods of information processing, mathematical and algo- information of the gresses to a description
such as clustering. rithmic formulations and subject. of recent advances in ker-
Clustering is a comprehensive book illustrated by flowcharts. nel-based clustering and
dedicated exclusively to the area of clus- Each chapter concludes introduces such concepts
ter analysis and finding similarities in with examples of applying the discussed as kernel principal component analysis,
data. The book focuses on theory and techniques to solution of real-world clustering with kernel functions and sup-
practice of clustering and can be appeal- problems. These examples can make the port vector clustering. Chapter 7 introduc-
ing to a broad variety of readers. Applied book an invaluable resource to a beginner es clustering of sequential data in which
mathematicians and computer scientists or a practitioner and allow for easy bridg- the sequence in which observations are
will find it useful as a reference material, ing between the theory and practice. presented is as important as other fea-
summarizing most of the current day al- The following is a brief overview of tures. Discussion of sequence similarity is
gorithms. Biomedical engineers and bio- each chapter: followed by introduction of use of Hidden
informatics researchers will appreciate Chapter 1 introduces the reader to the Markov Models and other model-based
the large number of practical examples in basic concepts of cluster analysis, such as methods. Application examples here fo-
gene data analysis, image segmentation, definition of clusters and applications of cus on genomic and biological sequence
computer vision, and other applications. clustering. Chapter 2 provides a review clustering. Chapter 8 discusses issues
A substantial number of the end-of-chap- of different proximity measures for con- associated with clustering of very large
ter problems make this book suitable as a tinuous, discrete, and mixed variables. data sets and describes random sampling,
textbook in graduate-level courses in sci- Chapter 3 focuses on agglomerative and condensation-based, density-based, grid-
ence or engineering. divisive hierarchical clustering. Illus- based, and other methods followed by
The book contains 11 chapters start- trated application examples include gene several application examples. Chapter 9
ing from the introduction to the field of expression data analysis and clustering of illustrates application of clustering for vi-
extensible markup language (XML) doc- sualization of high-dimensional data and
Digital Object Identifier 10.1109/MPUL.2010.937237 uments. Chapter 4 discusses techniques discusses several linear (such as principal

74 IEEE PULSE ▼ JULY/AUGUST 2010


component analysis and independent advanced knowledge, this book presents words in context and investigates broader
component analysis) and nonlinear original applications, going beyond ex- linguistic granularity to overcome ambi-
(such as nonlinear principal component isting publications while opening up the guity of terms. Chapters VII and VIII have
analysis, multidimensional scaling, and road for a broader use of NLP in biomedi- shown the effects of ambiguity: Intrinsic
isometric feature mapping) projection cine. The chapter titled “Text Mining in ambiguity (localized in ontologies them-
techniques. Application examples dem- Biomedicine” is an introduction written selves) and induced ambiguity detected in
onstrate practical use of these techniques by Sophia Ananiadou, one of the most texts. The sentence context might erase
in visualization. Chapter 10 focuses on renowned leaders in BioNLP, who pres- lexical ambiguities effects in some cases.
description of various methods for testing ents the grounding principles of NLP and Chapter IX, named “Information Extrac-
validity of the clustering results. Among emphasizes the needs, the requirements, tion of Protein Phosphorylation from Bio-
described methods are external, internal the nature of the issues and their stakes, medical Literature,” presents a rule-based
and relative validity criteria and other ap- and the achievements of system to capture the lexi-
proaches of establishing cluster validity. the state of the art. There cal, syntactical, and se-
Chapter 11 concludes the book with au- are four core sections of mantic constraints found
thors’ remarks. the book: Three that fol- In today‘s Internet- in sentences expressing
Overall, the book clearly achieves low the NLP granularity based knowledge- phosphorylation infor-
its aim as a comprehensive reference on scope, ranging from the sharing world, NLP mation from MEDLINE
advanced cluster analysis. A plethora of lexical level in Section I is highly solicited abstracts since isolated
practical examples makes it a good choice to the sentence level in words or phrases could
by various scientific
for anyone interested in applications of Section II, and to the dis- potentially be thematic
clustering techniques in processing of course level in Section III, communities. clues but can by no means
biomedical data. and one devoted to select- account for the different
Edward Sazonov ed existing software tools stages of a process. Chapter
Clarkson University (Section IV). Chapters belonging to these X, “CorTag: A Language for a Contextual
Potsdam, New York sections are numbered from II to XIX. Tagging of the Words Within Their Sen-
Section I, titled “Works at a Lexi- tence,” complements the preceding one,
Information Retrieval in cal Level: Crossroads Between NLP and presents a design to extract contextual
Biomedicine: Natural Language Ontological Knowledge Management,” knowledge from sentences, deals with
Processing for Knowledge includes nine chapters and is by far the knowledge discovery, and how to induce
Integration largest since the research on words, con- relations between concepts recognized in
Violaine Prince and Mathieu Roche, IGI cepts, and word-to-word relations is well texts. Chapter XI, titled “Analyzing the
Global, 2009. ISBN: 978-1-60566-274-9. established and has reached a recognized Text of Clinical Literature for Question
432 pages, hardcover. US$225.00. maturity. The order in which chapters Answering,” introduces a method, a de-
Natural language processing (NLP) is have been organized is by the following sign, and a system to not only deal with
a subfield of computational sciences, ad- pattern: 1) Using existing resources to complex information at the sentence level
dressing the operation and management perform document processing tasks, in- but also present argumentative articula-
of texts as inputs or outputs of compu- cluding indexation (Chapter II), catego- tion between fragments of sentences and
tational devices. As such, this domain rization (Chapter III), and information intersection between the sentence-level
includes a large amount of distinct top- retrieval (Chapter IV); indexation and and the discourse-level investigations.
ics depending on the particular service categorization could be seen as prerequi- This chapter is also highly recommended
considered. In today‘s Internet-based sites to an intelligent information retriev- to readers who need an accurate survey of
knowledge-sharing world, NLP is highly al since they prestructure textual data current question answering systems.
solicited by various scientific communi- according to topics, domain, keywords, Section III, titled “Pragmatics, Dis-
ties as a tremendous help for the follow- or interest areas; 2) Dealing with the course Structures and Segment Level as
ing endeavors: information retrieval and cross-linguistic terminological problem: the Last Stage in the NLP Offer to Bio-
knowledge extraction, knowledge inte- From a specialists language to general medicine,” groups three chapters and
gration to existing devices, and using and language within one tongue (Chapter V) presents works on mining techniques that
applying existing knowledge structures or across different tongues (Chapter VI); focus more on structures than on words,
for services in information retrieval. 3) Enriching terminology: The begin- on the use of neural-network architec-
This book is written by subject mat- ning of a strong lexical NLP involvement ture to improve retrieval, and the need in
ter experts, comprises six sections, and (Chapter VII); 4) Increasing lexical NLP clinical domains, taking the reader from
provides relevant theoretical frameworks involvement in biomedical application the theoretical to the practical applica-
and the latest empirical research findings (Chapter VIII). tions of mining techniques. Chapter XII,
in the area of BioNLP according to a lin- Section II, titled “Going Beyond “Discourse Processing for Text Mining,”
guistic granularity. As a critical mass of Words: NLP Approaches Involving the presents discourse processing, the nature
Sentence Level” includes three chapters and effects of text structures, as well as
Digital Object Identifier 10.1109/MPUL.2010.937238 and concentrates on research that looks at the broad panel of intersentence relations

76 IEEE PULSE ▼ JULY/AUGUST 2010


View publication stats

Das könnte Ihnen auch gefallen