Skip to main content

Metric Considerations in Clustering: Implications for Algorithms

  • Chapter
Multivariate Statistical Modeling and Data Analysis

Part of the book series: Theory and Decision Library ((TDLB,volume 8))

Abstract

Given measurements on p variables for each of n individuals, aspects of the problem of clustering the individuals are considered. Special attention is given to models based upon mixtures of distributions, esp. multivariate normal distributions. The relationship between the orientation(s) of the clusters and the nature of the within-cluster covariance matrices is reviewed, as is the inadequacy of transformation to principal components based on the overall (total) covariance matrix of the whole (mixed) sample. The nature of certain iterative algorithms is discussed; variations which result from allowing different covariance matrices within clusters are studied.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Akaike, H. (1983). ‘Statistical Inference and Measurement of Entropy.’ In G.E.P. Box, T. Leonard, and C.-F. Wu (eds.), Scientific Inference, Data Analysis, and Robustness, 165–189. New York: Academic Press.

    Google Scholar 

  • Akaike, H.(1985). ‘Prediction and Entropy.’ In A.C. Atkinson and S.E. Fienberg (eds.), A Celebration of Statistics: the ISI Centenary Volume, 1–24. New York: Springer-Verlag.

    Google Scholar 

  • Anderson, E. (1935). ‘The Irises of the Gaspe Peninsula,’ Bulletin of the American Iris Society 59, 2–5.

    Google Scholar 

  • Anderson, T.W.(1984). An Introduct ion to Multivariate Statistical Analysis, 2nd ed. New York: John Wiley and Sons.

    Google Scholar 

  • Ball, G.H., and Hall, D.J.(1967). ‘A Clustering Technique for Summarizing Multivariate Data,’ Behavioral Science 12, 153–155.

    Article  Google Scholar 

  • Bryant, P. and Williamson, J.A. (1978). ‘Asymptotic Behavior of Classification Maximum Likelihood Estimates,’ Biometrika 65, 273–281.

    Article  MATH  Google Scholar 

  • Chernoff, H. (1972). ‘Metric Considerations in Cluster Analysis,’ Proc. 6th Berkeley Symposium on Mathematical Statistics and Probability II, 621–630. Berkeley: University of California Press.

    Google Scholar 

  • Dixon, W.J., and Massey, F.J. (1969). Introduction to Statistical Analysis, 3rd ed. New York: McGraw-Hill.

    Google Scholar 

  • Fisher, R.A. (1936). ‘The Use of Multiple Measurements in Taxonomic Problems,’ Annals of Eugenics 7, 179–188.

    Article  Google Scholar 

  • Johnson, R.A., and Wichern, D.W. (1982). Applied Multivariate Statistical Analysis. New York: Prentice Hall.

    MATH  Google Scholar 

  • Kashyap, R.L. (1982). ‘Optimal Choice of AR and MA Parts in Autoregressive Moving Average Models, IEEE Transactions on Pattern Analys is and Machine Intelligence 4, 99–104.

    Article  MATH  Google Scholar 

  • MacQueen, J. (1966). ‘Some Methods for Classification and Analysis of Multivariate Observations.’ In Proc. 5th Berkeley Symposium on Mathematical Statistics and Probability I, 281–297. Berkeley: University of California Press.

    Google Scholar 

  • McLachlan, G.J. (1982). ‘The Classification and Mixture Maximum Likelihood Approaches to Cluster Analysis.’ In P.R. Krishnaiah and L.N. Kanal (eds.), Handbook of Statistics 2 (Classification, Pattern Recognition and Reduction of Dimensionality), 199–208. New York: North Holland.

    Google Scholar 

  • Marriott, F.H.C. (1975). ‘Separating Mixtures of Normal Distributions,’ Biometrics 31, 767–769.

    Article  MATH  Google Scholar 

  • Sclove, S.L.(1977). ‘Population Mixture Models and Clustering Algorithms,’ Communications in Statistics (A) 6, 417–434.

    Article  MathSciNet  Google Scholar 

  • Solomon, H. (1977). ‘Data Dependent Clustering Techniques,’ In J. Van Ryzin (ed.), Classification and Clustering, 155–174. New York: Academic Press.

    Google Scholar 

  • Van Ryzin, J., ed.(1977). Classification and Clustering. New York: Academic Press.

    Google Scholar 

  • Wolfe, J.H. (1970). ‘Pattern Clustering by Multivariate Mixture Analysis,’ Multivariate Behavi oral Research 5, 329–350.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1987 D. Reidel Publishing Company, Dordrecht, Holland

About this chapter

Cite this chapter

Sclove, S.L. (1987). Metric Considerations in Clustering: Implications for Algorithms. In: Bozdogan, H., Gupta, A.K. (eds) Multivariate Statistical Modeling and Data Analysis. Theory and Decision Library, vol 8. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3977-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-94-009-3977-6_10

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-8264-8

  • Online ISBN: 978-94-009-3977-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics