Skip to main content

Cluster Validity Measures Based on the Minimum Description Length Principle

  • Conference paper
Knowledge-Based and Intelligent Information and Engineering Systems (KES 2011)

Abstract

Determining the number of clusters is a crucial problem in cluster analysis. Cluster validity measures are one way to try to find the optimum number of clusters, especially for prototype-based clustering. However, no validity measure turns out to work well in all cases. In this paper, we propose an approach to determine the number of cluster based on the minimum description length principle which does not need high computational costs and is also applicable in the context of fuzzy clustering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rissanen, J.: Modeling by shortest data description. Automatica 14, 445–471 (1978)

    Article  MATH  Google Scholar 

  2. Gruenwald, P.D.: The Minimum Description Length Principle. The MIT Press, Cambridge (2007)

    Google Scholar 

  3. Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    Book  MATH  Google Scholar 

  4. Höppner, F., Klawonn, F., Kruse, R., Runkler, T.: Fuzzy Cluster Analysis. John Wiley& Sons, Chichester (1999)

    MATH  Google Scholar 

  5. Babuska, R.: Fuzzy Modeling for Control. Kluwer Academic Publishers, Boston (1998)

    Book  Google Scholar 

  6. Böhm, C., Goebl, S., Oswald, A., Plant, C., Plavinski, M., Wackersreuther, B.: Integrative parameter-free clustering of data with mixed type attributes. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS, vol. 6118, pp. 38–47. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  7. Kyrgyzov, I.O., Kyrgyzov, O.O., Maître, H., Campedel, M.: Kernel mdl to determine the number of clusters. In: Perner, P. (ed.) MLDM 2007. LNCS (LNAI), vol. 4571, pp. 203–217. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  8. Banerjee, A., Langford, J.: An objective evaluation criterion for clustering. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2004, pp. 515–520. ACM, New York (2004)

    Chapter  Google Scholar 

  9. Navarro, D., Lee, M.: An application of minimum description length clustering to partitioning learning curves. In: Proceedings of the 2005 IEEE International Symposium on Information Theory (2005)

    Google Scholar 

  10. Gruenwald, P.D., Myung, I., Pitt, M.A.: Advances in minimum description length: theory and applications. The MIT Press, Cambridge, Massachusetts (2005)

    Google Scholar 

  11. Hu, X., Xu, L.: Investigation on several model selection criteria for determining the number of cluster. Neural Inform. Proces. - Lett. and Reviews 4 (2004)

    Google Scholar 

  12. Berthold, M., Borgelt, C., Höppner, F., Klawonn, F.: Guide to Intelligent Data Analysis: How to Intelligently Make Sense of Real Data. Springer, London (2010)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Georgieva, O., Tschumitschew, K., Klawonn, F. (2011). Cluster Validity Measures Based on the Minimum Description Length Principle. In: König, A., Dengel, A., Hinkelmann, K., Kise, K., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2011. Lecture Notes in Computer Science(), vol 6881. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23851-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23851-2_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23850-5

  • Online ISBN: 978-3-642-23851-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics