Abstract
Soft clustering plays an important role in many real world applications. Fuzzy clustering, rough clustering, evidential clustering and many other approaches are used effectively to overcome the rigidness of crisp clustering. Each approach has its own unique features that set it apart from others. In this paper, we propose an enhanced rough clustering approach by combining the strengths of rough clustering and evidential clustering. The rough K-means algorithm is augmented with an ability to determine outliers in datasets using the concepts from the Evidential c-means algorithm. Different experiments are carried on various datasets and it is found that the modified rough K-means can effectively detect outliers with relatively smaller computational complexity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Joshi, A., Krishnapuram, R.: Robust fuzzy clustering methods to support web mining. In: Proc. Workshop in Data Mining and knowledge Discovery, SIGMOD, pp. 15–22 (1998)
Bezdek, J.C., Hathaway, R.J.: Optimization of fuzzy clustering criteria using genetic algorithms. In: International Conference on Evolutionary Computation, pp. 589–594 (1994)
Pedrycz, W., Waletzky, J.: Fuzzy clustering with partial supervision. IEEE Transactions on Systems, Man, and Cybernetics, Part B 27(5), 787–795 (1997)
Lingras, P., West, C.: Interval set clustering of web users with rough k-means. Journal of Intelligent Information Systems 23, 5–16 (2004)
Peters, G.: Some refinements of rough k-means clustering. Pattern Recognition 39(8), 1481–1491 (2006)
Peters, J.F., Skowron, A., Suraj, Z., Rzasa, W., Borkowski, M.: Clustering: A rough set approach to constructing information granules. In: Soft Computing and Distributed Processing, pp. 57–61 (2002)
Masson, M., Denoeux, T.: Ecm: An evidential version of the fuzzy c-means algorithm. Pattern Recognition 41, 1384–1397 (2008)
Dave, R.N.: Clustering relational data containing noise and outliers. Pattern Recogn. Lett. 12, 657–664 (1991)
Hawkins, D.: Identification of outliers (1980)
Saad, M.F., Alimi, A.M.: Modified fuzzy possibilistic c-means. In: Proceedings of the International Multi Conference of Engineers and Computer Scientists (2009)
Eskin, E., Arnold, A., Prerau, M., Portnoy, L., Stolfo, S.: A geometric framework for unsupervised anomaly detection detecting intrusions in unlabeled data. Data Mining for Security Applications 19 (2002)
Mahoney, M.V., Chan, P.K.: Learning rules for anomaly detection of hostile network traffic. In: Proceedings of the 3rd IEEE International Conference on Data Mining, vol. 601. IEEE Computer Society (2003)
He, Z., Xu, X., Deng, S.: Discovering cluster-based local outliers 24, 1641–1650 (2003)
Varun, B.A., Vipin, K.: Anomaly detection: A survey. ACM Computing Surveys 41(3), 1641–1650 (2003)
Joshi, M., Lingras, P.: Evidential clustering or rough clustering: The choice is yours. In: Li, T., Nguyen, H.S., Wang, G., Grzymala-Busse, J., Janicki, R., Hassanien, A.E., Yu, H. (eds.) RSKT 2012. LNCS, vol. 7414, pp. 123–128. Springer, Heidelberg (2012)
Hartigan, J.A., Wong, M.A.: Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics) 28(1), 100–108 (1979), http://dx.doi.org/10.2307/2346830
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Cam, L.M.L., Neyman, J. (eds.) Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press (1967)
Lingras, P., Hogo, M., Snorek, M.: Interval set clustering of web users using modified kohonen self-organizing maps based on the properties of rough sets. In: Web Intelli. and Agent Sys., vol. 2 (August 2004)
Joshi, M., Lingras, P.: Evolutionary and iterative crisp and rough clustering ii: Experiments. In: Chaudhury, S., Mitra, S., Murthy, C.A., Sastry, P.S., Pal, S.K. (eds.) PReMI 2009. LNCS, vol. 5909, pp. 621–627. Springer, Heidelberg (2009)
Lingras, P.: Evolutionary rough k-means clustering. In: Wen, P., Li, Y., Polkowski, L., Yao, Y., Tsumoto, S., Wang, G. (eds.) RSKT 2009. LNCS, vol. 5589, pp. 68–75. Springer, Heidelberg (2009)
Lingras, P., Chen, M., Miao, D.: Precision of rough set clustering. In: Chan, C.-C., Grzymala-Busse, J.W., Ziarko, W.P. (eds.) RSCTC 2008. LNCS (LNAI), vol. 5306, pp. 369–378. Springer, Heidelberg (2008)
Denoeux, T., Masson, M.: Evclus: Evidential clustering of proximity data. IEEE Transactions on Systems Man and Cybernetics 34(1), 95–109 (2004)
Lingras, P., Chen, M., Miao, D.: Rough multi-category decision theoretic framework. In: Wang, G., Li, T., Grzymala-Busse, J.W., Miao, D., Skowron, A., Yao, Y. (eds.) RSKT 2008. LNCS (LNAI), vol. 5009, pp. 676–683. Springer, Heidelberg (2008)
Windham, M.P.: Numerical classification of proximity data with assignment measures. Journal of Classification 2, 157–172 (1985)
Frank, A., Asuncion, A.: UCI machine learning repository (2010), http://archive.ics.uci.edu/ml
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Joshi, M., Lingras, P. (2013). Enhancing Rough Clustering with Outlier Detection Based on Evidential Clustering. In: Ciucci, D., Inuiguchi, M., Yao, Y., Ślęzak, D., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2013. Lecture Notes in Computer Science(), vol 8170. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41218-9_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-41218-9_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41217-2
Online ISBN: 978-3-642-41218-9
eBook Packages: Computer ScienceComputer Science (R0)