Abstract
Discriminant analysis is used to evaluate group separation and to develop rules for assigning observations to groups. Cluster analysis is concerned with group identification. The goal of cluster analysis is to partition a set of observations into a distinct number of unknown groups or clusters in such a manner that all observations within a group are similar, while observations in different groups are not similar. If data are represented as an n x p matrix Y = [y ij ] where
the goal of cluster analysis is to develop a classification scheme that will partition the rows of Y into k distinct groups (clusters). The rows of the matrix usually represent items or objects. To uncover the groupings in the data, a measure of nearness, also called a proximity measure needs to be defined. Two natural measures of nearness are the degree of distance or “dissimilarity” and the degree of association or “similarity” between groups. The choice of the proximity measure depends on the subject matter, scale of measurement (nominal, ordinal, interval, ratio), and type of variables (continuous, categorical) being analyzed. In many applications of cluster analysis, one begins with a proximity matrix rather than a data matrix. Given the proximity matrix of order (n x n) say, the entries may represent dissimilarities [d rs ] or similarities [s rs ] between the rth and sth objects. Cluster analysis is a tool for classifying objects into groups and is not concerned with the geometric representation of the objects in a low-dimensional space. To explore the dimensionality of the space, one may use multidimensional scaling.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag New York, Inc.
About this chapter
Cite this chapter
(2002). Cluster Analysis and Multidimensional Scaling. In: Timm, N.H. (eds) Applied Multivariate Analysis. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-0-387-22771-9_9
Download citation
DOI: https://doi.org/10.1007/978-0-387-22771-9_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-95347-2
Online ISBN: 978-0-387-22771-9
eBook Packages: Springer Book Archive