Abstract
Determining the number of clusters is a crucial problem in cluster analysis. Cluster validity measures are one way to try to find the optimum number of clusters, especially for prototype-based clustering. However, no validity measure turns out to work well in all cases. In this paper, we propose an approach to determine the number of cluster based on the minimum description length principle which does not need high computational costs and is also applicable in the context of fuzzy clustering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rissanen, J.: Modeling by shortest data description. Automatica 14, 445–471 (1978)
Gruenwald, P.D.: The Minimum Description Length Principle. The MIT Press, Cambridge (2007)
Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Höppner, F., Klawonn, F., Kruse, R., Runkler, T.: Fuzzy Cluster Analysis. John Wiley& Sons, Chichester (1999)
Babuska, R.: Fuzzy Modeling for Control. Kluwer Academic Publishers, Boston (1998)
Böhm, C., Goebl, S., Oswald, A., Plant, C., Plavinski, M., Wackersreuther, B.: Integrative parameter-free clustering of data with mixed type attributes. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS, vol. 6118, pp. 38–47. Springer, Heidelberg (2010)
Kyrgyzov, I.O., Kyrgyzov, O.O., Maître, H., Campedel, M.: Kernel mdl to determine the number of clusters. In: Perner, P. (ed.) MLDM 2007. LNCS (LNAI), vol. 4571, pp. 203–217. Springer, Heidelberg (2007)
Banerjee, A., Langford, J.: An objective evaluation criterion for clustering. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2004, pp. 515–520. ACM, New York (2004)
Navarro, D., Lee, M.: An application of minimum description length clustering to partitioning learning curves. In: Proceedings of the 2005 IEEE International Symposium on Information Theory (2005)
Gruenwald, P.D., Myung, I., Pitt, M.A.: Advances in minimum description length: theory and applications. The MIT Press, Cambridge, Massachusetts (2005)
Hu, X., Xu, L.: Investigation on several model selection criteria for determining the number of cluster. Neural Inform. Proces. - Lett. and Reviews 4 (2004)
Berthold, M., Borgelt, C., Höppner, F., Klawonn, F.: Guide to Intelligent Data Analysis: How to Intelligently Make Sense of Real Data. Springer, London (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Georgieva, O., Tschumitschew, K., Klawonn, F. (2011). Cluster Validity Measures Based on the Minimum Description Length Principle. In: König, A., Dengel, A., Hinkelmann, K., Kise, K., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2011. Lecture Notes in Computer Science(), vol 6881. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23851-2_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-23851-2_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23850-5
Online ISBN: 978-3-642-23851-2
eBook Packages: Computer ScienceComputer Science (R0)