A Possibilistic c-means Clustering Model with Cluster Size Estimation
Most c-means clustering models have serious difficulties when facing clusters of different sizes and severely outlier data. The possibilistic c-means (PCM) algorithm can handle both problems to some extent. However, its recommended initialization using a terminal partition produced by the probabilistic fuzzy c-means does not work when severe outliers are present. This paper proposes a possibilistic c-means clustering model that uses only two parameters independently of the number of clusters, which is able to correctly handle the above mentioned obstacles. Numerical evaluation involving synthetic and standard test data sets prove the advantages of the proposed clustering model.
KeywordsFuzzy c-means clustering Possibilistic c-means clustering Cluster size sensitivity Outlier sensitivity
- 1.Anderson, E.: The irises of the Gaspe Peninsula. Bull. Am. Iris Soc. 59, 2–5 (1935)Google Scholar
- 2.Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/datasets.html
- 11.Miyamoto, S., Kurosawa, N.: Controlling cluster volume sizes in fuzzy \(c\)-means clustering. In: SCIS and ISIS, Yokohama, Japan, pp. 1–4 (2004)Google Scholar
- 12.Pal, N.R., Pal, K., Bezdek, J.C.: A mixed \(c\)-means clustering model. In: Proceedings of IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), pp. 11–21 (1997)Google Scholar