Abstract
In the field of cluster analysis, the fuzzy k-means, k-modes and k-prototypes algorithms were designed for numerical, categorical and mixed data sets respectively. However, all the above algorithms assume that each feature of the samples plays an uniform contribution for cluster analysis. To consider the particular contributions of different features, a novel feature weighted fuzzy clustering algorithm is proposed in this paper, in which the ReliefF algorithm is used to assign the weights for every feature. By weighting the features of samples, the above three clustering algorithms can be unified, and better classification results can be also achieved. The experimental results with various real data sets illustrate the effectiveness of the proposed algorithm.
This work was supported by National Natural Science Foundation of China (No.60202004) and the Key Project of Chinese Ministry of Education (No.104173).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
He, Q.: Advance of the theory and application of fuzzy clustering analysis. Fuzzy System and Fuzzy Mathematics 12(2), 89–94 (1998) (In Chinese)
Huang, Z., Ng, M.K.: A fuzzy k-modes algorithm for clustering categorical data. IEEE Trans. on Fuzzy Systems 7(4), 446–452 (1999)
Huang, Z.: A fast clustering algorithm to cluster very large categorical data sets in data Mining. In: Proceedings of the SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, Dept. of Computer Science, The University of British Columbia, Canada, pp. 1–8 (1997)
Kononenko, I.: Estimating attributes: Analysis and extensions of Relief. In: Proceedings of the 7th European Conference on Machine Learning, pp. 171–182. Springer, Berlin (1994)
Kira, K., Rendell, L.A.: A practical approach to feature selection. In: Proceedings of the 9th International Workshop on Machine Leaning, San Francisco, pp. 249–256 (1992)
Li, J., Gao, X., Jiao, L.: A CSA-Based Clustering Algorithm for Large Data Sets With Mixed Numeric and Categorical Values. Acta Electronica Sinica 32(3), 357–362 (2004)
Duda, R.O., Hart, P.E.: Pattern classification and scene analysis, New York (1973)
Hathaway, R.J., Bezdek, J.C.: Nerf C-means: Non-Euclidean relation fuzzy clustering. Pattern recognition 27(3), 429–437 (1994)
Michalski, R.S., Stepp, R.E.: Automated construction of classifications: Conceptual clustering versus numerical taxonomy. IEEE Trans. on PAMI (5), 396–410 (1983)
Jollois, F.X., Nadif, M.: Clustering large categorical data. In: Advances in Knowedge Discovery and Data Mining, pp. 257–263. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, J., Gao, X., Jiao, L. (2005). A New Feature Weighted Fuzzy Clustering Algorithm. In: Ślęzak, D., Wang, G., Szczuka, M., Düntsch, I., Yao, Y. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2005. Lecture Notes in Computer Science(), vol 3641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11548669_43
Download citation
DOI: https://doi.org/10.1007/11548669_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28653-0
Online ISBN: 978-3-540-31825-5
eBook Packages: Computer ScienceComputer Science (R0)