Metric Considerations in Clustering: Implications for Algorithms

Sclove, Stanley L.

doi:10.1007/978-94-009-3977-6_10

Stanley L. Sclove⁶

Part of the book series: Theory and Decision Library ((TDLB,volume 8))

156 Accesses
1 Citations

Abstract

Given measurements on p variables for each of n individuals, aspects of the problem of clustering the individuals are considered. Special attention is given to models based upon mixtures of distributions, esp. multivariate normal distributions. The relationship between the orientation(s) of the clusters and the nature of the within-cluster covariance matrices is reviewed, as is the inadequacy of transformation to principal components based on the overall (total) covariance matrix of the whole (mixed) sample. The nature of certain iterative algorithms is discussed; variations which result from allowing different covariance matrices within clusters are studied.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Akaike, H. (1983). ‘Statistical Inference and Measurement of Entropy.’ In G.E.P. Box, T. Leonard, and C.-F. Wu (eds.), Scientific Inference, Data Analysis, and Robustness, 165–189. New York: Academic Press.
Google Scholar
Akaike, H.(1985). ‘Prediction and Entropy.’ In A.C. Atkinson and S.E. Fienberg (eds.), A Celebration of Statistics: the ISI Centenary Volume, 1–24. New York: Springer-Verlag.
Google Scholar
Anderson, E. (1935). ‘The Irises of the Gaspe Peninsula,’ Bulletin of the American Iris Society 59, 2–5.
Google Scholar
Anderson, T.W.(1984). An Introduct ion to Multivariate Statistical Analysis, 2nd ed. New York: John Wiley and Sons.
Google Scholar
Ball, G.H., and Hall, D.J.(1967). ‘A Clustering Technique for Summarizing Multivariate Data,’ Behavioral Science 12, 153–155.
Article Google Scholar
Bryant, P. and Williamson, J.A. (1978). ‘Asymptotic Behavior of Classification Maximum Likelihood Estimates,’ Biometrika 65, 273–281.
Article MATH Google Scholar
Chernoff, H. (1972). ‘Metric Considerations in Cluster Analysis,’ Proc. 6th Berkeley Symposium on Mathematical Statistics and Probability II, 621–630. Berkeley: University of California Press.
Google Scholar
Dixon, W.J., and Massey, F.J. (1969). Introduction to Statistical Analysis, 3rd ed. New York: McGraw-Hill.
Google Scholar
Fisher, R.A. (1936). ‘The Use of Multiple Measurements in Taxonomic Problems,’ Annals of Eugenics 7, 179–188.
Article Google Scholar
Johnson, R.A., and Wichern, D.W. (1982). Applied Multivariate Statistical Analysis. New York: Prentice Hall.
MATH Google Scholar
Kashyap, R.L. (1982). ‘Optimal Choice of AR and MA Parts in Autoregressive Moving Average Models, IEEE Transactions on Pattern Analys is and Machine Intelligence 4, 99–104.
Article MATH Google Scholar
MacQueen, J. (1966). ‘Some Methods for Classification and Analysis of Multivariate Observations.’ In Proc. 5th Berkeley Symposium on Mathematical Statistics and Probability I, 281–297. Berkeley: University of California Press.
Google Scholar
McLachlan, G.J. (1982). ‘The Classification and Mixture Maximum Likelihood Approaches to Cluster Analysis.’ In P.R. Krishnaiah and L.N. Kanal (eds.), Handbook of Statistics 2 (Classification, Pattern Recognition and Reduction of Dimensionality), 199–208. New York: North Holland.
Google Scholar
Marriott, F.H.C. (1975). ‘Separating Mixtures of Normal Distributions,’ Biometrics 31, 767–769.
Article MATH Google Scholar
Sclove, S.L.(1977). ‘Population Mixture Models and Clustering Algorithms,’ Communications in Statistics (A) 6, 417–434.
Article MathSciNet Google Scholar
Solomon, H. (1977). ‘Data Dependent Clustering Techniques,’ In J. Van Ryzin (ed.), Classification and Clustering, 155–174. New York: Academic Press.
Google Scholar
Van Ryzin, J., ed.(1977). Classification and Clustering. New York: Academic Press.
Google Scholar
Wolfe, J.H. (1970). ‘Pattern Clustering by Multivariate Mixture Analysis,’ Multivariate Behavi oral Research 5, 329–350.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Decision Sciences College of Business Administration m/c 294, University of Illinois at Chicago, Box 4348, Chicago, IL, 60680-4348, USA
Stanley L. Sclove

Authors

Stanley L. Sclove
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics, University of Virginia, Charlottesville, Virginia, USA
H. Bozdogan
Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, Ohio, USA
A. K. Gupta

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sclove, S.L. (1987). Metric Considerations in Clustering: Implications for Algorithms. In: Bozdogan, H., Gupta, A.K. (eds) Multivariate Statistical Modeling and Data Analysis. Theory and Decision Library, vol 8. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3977-6_10

Download citation

DOI: https://doi.org/10.1007/978-94-009-3977-6_10
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-8264-8
Online ISBN: 978-94-009-3977-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics