Cluster Analysis and Multidimensional Scaling

doi:10.1007/978-0-387-22771-9_9

Part of the book series: Springer Texts in Statistics ((STS))

2836 Accesses

Abstract

Discriminant analysis is used to evaluate group separation and to develop rules for assigning observations to groups. Cluster analysis is concerned with group identification. The goal of cluster analysis is to partition a set of observations into a distinct number of unknown groups or clusters in such a manner that all observations within a group are similar, while observations in different groups are not similar. If data are represented as an n x p matrix Y = [y_ij] where

$$ \mathop Y\limits_{n \times p} = \left[ \begin{gathered} y'_1 \\ y'_i \\ \vdots \\ y'_n \\ \end{gathered} \right] $$

the goal of cluster analysis is to develop a classification scheme that will partition the rows of Y into k distinct groups (clusters). The rows of the matrix usually represent items or objects. To uncover the groupings in the data, a measure of nearness, also called a proximity measure needs to be defined. Two natural measures of nearness are the degree of distance or “dissimilarity” and the degree of association or “similarity” between groups. The choice of the proximity measure depends on the subject matter, scale of measurement (nominal, ordinal, interval, ratio), and type of variables (continuous, categorical) being analyzed. In many applications of cluster analysis, one begins with a proximity matrix rather than a data matrix. Given the proximity matrix of order (n x n) say, the entries may represent dissimilarities [d_rs] or similarities [s_rs] between the r^th and s^th objects. Cluster analysis is a tool for classifying objects into groups and is not concerned with the geometric representation of the objects in a low-dimensional space. To explore the dimensionality of the space, one may use multidimensional scaling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Editor information

Editors and Affiliations

Department of Education in Psychology School of Education, University of Pittsburgh, Pittsburgh, PA, 15260
Neil H. Timm (Professor) (Professor)

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

(2002). Cluster Analysis and Multidimensional Scaling. In: Timm, N.H. (eds) Applied Multivariate Analysis. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-0-387-22771-9_9

Download citation

DOI: https://doi.org/10.1007/978-0-387-22771-9_9
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-95347-2
Online ISBN: 978-0-387-22771-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics