Abstract
This paper describes a validity index for fuzzy clustering: Pattern Distances Ratio (PDR) and a cluster number selection procedure using that index.
As other validity indices, solution presented in this paper may be used when a need for assessing of clustering or fuzzy clustering result adequacy arises. Most common example of such situation is when clustering algorithm that requires certain parameter, for example number of clusters, is selected but we lack a priori knowledge of this parameter and we would use educated guesses in concert with trial and error procedures. Validity index may allow to automate such process whenever it is necessary or convenient. In particular, it might ease incorporation of fuzzy clustering into more complex, intelligent systems.
The validity index presented in this paper might be seen as measuring the goodness of clustering of individual examples. When it is averaged over the clustered set, it bears some resemblance to the validity indices based on notions of compactness and separation. During experiments it was used as a cluster number selection criterion for fuzzy c-means. Those experiments showed that PDR can perform well in this role but a special selection procedure should be followed, instead of usual minimum search. The procedure is also described.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kruse, R., Döring, C., Lesot, M.J.: Fundamentals of fuzzy clustering. In: de Oliveira, J.V., Pedrycz, W. (eds.) Advances in Fuzzy Clustering and its Applications, pp. 3–30. John Wiley & Sons (April 2007)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms (Advanced Applications in Pattern Recognition). Springer (July 1981)
Pal, N.R., Bezdek, J.C.: On cluster validity for the fuzzy c-means model. IEEE Transactions on Fuzzy Systems 3(3), 370–379 (1995)
Brodowski, S., Podolak, I.T.: Hierarchical estimator. Expert Systems with Applications 38(10), 12237–12248 (2011)
Brodowski, S.: A validity criterion for fuzzy clustering. In: Jędrzejowicz, P., Nguyen, N.T., Hoang, K. (eds.) ICCCI 2011, Part I. LNCS, vol. 6922, pp. 113–122. Springer, Heidelberg (2011)
Höppner, F.: Fuzzy cluster analysis: methods for classification, data analysis, and image recognition. John Wiley (1999)
Akaike, H.: A new look at the statistical model identification. IEEE Transactions on Automatic Control 19(6), 716–723 (1974)
Bozdogan, H.: Model selection and akaike’s information criterion (aic): The general theory and its analytical extensions. Psychometrika 52, 345–370 (1987), doi:10.1007/BF02294361
Hu, X., Xu, L.: Investigation on several model selection criteria for determining the number of cluster. Neural Information Processing- Letters and Reviews 4 (2004)
Schwarz, G.: Estimating the dimension of a model. The Annals of Statistics 6(2), 461–464 (1978)
Cho, S.B., Yoo, S.H.: Fuzzy bayesian validation for cluster analysis of yeast cell-cycle data. Pattern Recognition 39(12), 2405–2414 (2006)
Wang, W., Zhang, Y.: On fuzzy cluster validity indices. Fuzzy Sets and Systems 158(19), 2095–2117 (2007)
Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: Validity index for crisp and fuzzy clusters. Pattern Recognition 37(3), 487–501 (2004)
Wu, K.L., Yang, M.S.: A cluster validity index for fuzzy clustering. Pattern Recognition Letters 26(9), 1275–1291 (2005)
Rezaee, B.: A cluster validity index for fuzzy clustering. Fuzzy Sets and Systems 161(23), 3014–3025 (2010); Theme: Information processing
Tsekouras, G.E., Sarimveis, H.: A new approach for measuring the validity of the fuzzy c-means algorithm. Advances in Engineering Software 35(8-9), 567–575 (2004)
Kim, M., Ramakrishna, R.: New indices for cluster validity assessment. Pattern Recognition Letters 26(15), 2353–2363 (2005)
Dave, R.N.: Validating fuzzy partitions obtained through c-shells clustering. Pattern Recognition Letters 17(6), 613–623 (1996)
Kim, Y.I., Kim, D.W., Lee, D., Lee, K.H.: A cluster validation index for gk cluster analysis based on relative degree of sharing. Information Sciences 168(1-4), 225–242 (2004)
Yu, J., Li, C.X.: Novel cluster validity index for fcm algorithm. J. Comput. Sci. Technol. 21(1), 137–140 (2006)
Falasconi, M., Gutierrez, A., Pardo, M., Sberveglieri, G., Marco, S.: A stability based validity method for fuzzy clustering. Pattern Recognition 43(4), 1292–1305 (2010)
Wu, K.L., Yang, M.S., Hsieh, J.N.: Robust cluster validity indexes. Pattern Recognition 42(11), 2541–2550 (2009)
Asuncion, A., Newman, D.: UCI machine learning repository (2007)
Hartigan, J.A.: Modal Blocks in Dentition of West Coast Mammals. Systematic Biology 25(2), 149–160 (1976)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Brodowski, S. (2014). A Validity Criterion for Fuzzy Clustering. In: Nguyen, NT., Le-Thi, H.A. (eds) Transactions on Computational Intelligence XIII. Lecture Notes in Computer Science, vol 8342. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54455-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-54455-2_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54454-5
Online ISBN: 978-3-642-54455-2
eBook Packages: Computer ScienceComputer Science (R0)