Advertisement

An Intelligent Clustering Technique Based on Dual Scaling

  • Hans-Joachim Mucha

Summary

Methods of cluster analysis (unsupervised classification) can help you in order to “Finding groups in data”, so the suitable title of a book from Kaufman and Rousseeuw (1990). The intelligent clustering technique proposed here appears to be motivated by practical problems of analyzing mixed data. One way to deal with such problems is downgrading all data to the lowest scale level, that is, downgrading to categories without any information about their order. This new clustering technique based on dual scaling is presented in the simplest fashion possible for finding two groups (clusters) in data and for visualizing them, respectively. It is compared with well-known model-based clustering techniques. In the conclusion variations of improvement of the method are proposed.

Keywords

Misclassification Error Principal Component Analysis Plot Hierarchical Cluster Method Distance Score Time Standard Deviation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Banfield, J. D. and Raftery, A.E. (1993). Model-based Gaussian and non-Gaussian clustering. Biometrics, 49, 803–821.MathSciNetCrossRefMATHGoogle Scholar
  2. Bock, H. H. (1974). Automatische Klassifikation. Vandenhoeck & Ruprecht, Göttingen.MATHGoogle Scholar
  3. Breiman, L. (1996). Bias, variance, and arcing classifiers. Technical Report 460. Statistical Department, University of California, Berkeley.Google Scholar
  4. Fraley C. (1996). Algorithms for model-based Gaussian Hierarchical Clustering. Technical Report 311. Department of Statistics, University of Washington, Seattle.Google Scholar
  5. Hand, D. J. (1997). Intelligent data analysis: issues and opportunities. In: Advances in Intelligent Data Analysis. Liu, X. et al. (eds.), 1–14, Springer-Verlag, Heidelberg.Google Scholar
  6. Jain, A. K. and Dubes, R. C. (1988). Algorithms for clustering data. Prentice Hall, New Jersey.MATHGoogle Scholar
  7. Kaufman, L. and Rousseeuw, P. J. (1990). Finding groups in data. Wiley, New York.CrossRefGoogle Scholar
  8. Mucha, H.-J. (1992). Clusteranaly_se mit Mikrocomputern. Akademie Verlag, Berlin.Google Scholar
  9. Mucha, H.-J., Siegmund-Schultze, R., and Dübon, K. (1998). Adaptive cluster analysis techniques–software and applications. In: Data Science, Classification and Related Methods, Mucha, H.-J., Siegmund-Schultze, R., and Dübon, K. (eds.), 231–238, Springer-Verlag Tokyo.Google Scholar
  10. Nishisato, S. (1980). Analysis of Categorical Data: Dual Scaling and its Applications. University of Toronto Press, Toronto.MATHGoogle Scholar
  11. Nishisato, S. (1994). Elements of Dual Scaling: An Introduction to Practical Data Analysis. Lawrence Erlbaum Associates, Publishers, Hillsdale.Google Scholar
  12. Späth, H. (1985). Cluster dissection and analysis. Ellis Horwood, Chichester.MATHGoogle Scholar

Copyright information

© Springer Japan 2002

Authors and Affiliations

  • Hans-Joachim Mucha
    • 1
  1. 1.Weierstrass Institute for Applied Analysis and Stochastics (WIAS)BerlinGermany

Personalised recommendations