Skip to main content

Cluster Analysis and Multidimensional Scaling

  • Chapter
Applied Multivariate Analysis

Part of the book series: Springer Texts in Statistics ((STS))

  • 2836 Accesses

Abstract

Discriminant analysis is used to evaluate group separation and to develop rules for assigning observations to groups. Cluster analysis is concerned with group identification. The goal of cluster analysis is to partition a set of observations into a distinct number of unknown groups or clusters in such a manner that all observations within a group are similar, while observations in different groups are not similar. If data are represented as an n x p matrix Y = [y ij ] where

$$ \mathop Y\limits_{n \times p} = \left[ \begin{gathered} y'_1 \\ y'_i \\ \vdots \\ y'_n \\ \end{gathered} \right] $$

the goal of cluster analysis is to develop a classification scheme that will partition the rows of Y into k distinct groups (clusters). The rows of the matrix usually represent items or objects. To uncover the groupings in the data, a measure of nearness, also called a proximity measure needs to be defined. Two natural measures of nearness are the degree of distance or “dissimilarity” and the degree of association or “similarity” between groups. The choice of the proximity measure depends on the subject matter, scale of measurement (nominal, ordinal, interval, ratio), and type of variables (continuous, categorical) being analyzed. In many applications of cluster analysis, one begins with a proximity matrix rather than a data matrix. Given the proximity matrix of order (n x n) say, the entries may represent dissimilarities [d rs ] or similarities [s rs ] between the rth and sth objects. Cluster analysis is a tool for classifying objects into groups and is not concerned with the geometric representation of the objects in a low-dimensional space. To explore the dimensionality of the space, one may use multidimensional scaling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag New York, Inc.

About this chapter

Cite this chapter

(2002). Cluster Analysis and Multidimensional Scaling. In: Timm, N.H. (eds) Applied Multivariate Analysis. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-0-387-22771-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-22771-9_9

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-0-387-95347-2

  • Online ISBN: 978-0-387-22771-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics