Skip to main content

Enhancing Rough Clustering with Outlier Detection Based on Evidential Clustering

  • Conference paper
Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing (RSFDGrC 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8170))

Abstract

Soft clustering plays an important role in many real world applications. Fuzzy clustering, rough clustering, evidential clustering and many other approaches are used effectively to overcome the rigidness of crisp clustering. Each approach has its own unique features that set it apart from others. In this paper, we propose an enhanced rough clustering approach by combining the strengths of rough clustering and evidential clustering. The rough K-means algorithm is augmented with an ability to determine outliers in datasets using the concepts from the Evidential c-means algorithm. Different experiments are carried on various datasets and it is found that the modified rough K-means can effectively detect outliers with relatively smaller computational complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Joshi, A., Krishnapuram, R.: Robust fuzzy clustering methods to support web mining. In: Proc. Workshop in Data Mining and knowledge Discovery, SIGMOD, pp. 15–22 (1998)

    Google Scholar 

  2. Bezdek, J.C., Hathaway, R.J.: Optimization of fuzzy clustering criteria using genetic algorithms. In: International Conference on Evolutionary Computation, pp. 589–594 (1994)

    Google Scholar 

  3. Pedrycz, W., Waletzky, J.: Fuzzy clustering with partial supervision. IEEE Transactions on Systems, Man, and Cybernetics, Part B 27(5), 787–795 (1997)

    Article  Google Scholar 

  4. Lingras, P., West, C.: Interval set clustering of web users with rough k-means. Journal of Intelligent Information Systems 23, 5–16 (2004)

    Article  MATH  Google Scholar 

  5. Peters, G.: Some refinements of rough k-means clustering. Pattern Recognition 39(8), 1481–1491 (2006)

    Article  MATH  Google Scholar 

  6. Peters, J.F., Skowron, A., Suraj, Z., Rzasa, W., Borkowski, M.: Clustering: A rough set approach to constructing information granules. In: Soft Computing and Distributed Processing, pp. 57–61 (2002)

    Google Scholar 

  7. Masson, M., Denoeux, T.: Ecm: An evidential version of the fuzzy c-means algorithm. Pattern Recognition 41, 1384–1397 (2008)

    Article  MATH  Google Scholar 

  8. Dave, R.N.: Clustering relational data containing noise and outliers. Pattern Recogn. Lett. 12, 657–664 (1991)

    Article  Google Scholar 

  9. Hawkins, D.: Identification of outliers (1980)

    Google Scholar 

  10. Saad, M.F., Alimi, A.M.: Modified fuzzy possibilistic c-means. In: Proceedings of the International Multi Conference of Engineers and Computer Scientists (2009)

    Google Scholar 

  11. Eskin, E., Arnold, A., Prerau, M., Portnoy, L., Stolfo, S.: A geometric framework for unsupervised anomaly detection detecting intrusions in unlabeled data. Data Mining for Security Applications 19 (2002)

    Google Scholar 

  12. Mahoney, M.V., Chan, P.K.: Learning rules for anomaly detection of hostile network traffic. In: Proceedings of the 3rd IEEE International Conference on Data Mining, vol. 601. IEEE Computer Society (2003)

    Google Scholar 

  13. He, Z., Xu, X., Deng, S.: Discovering cluster-based local outliers 24, 1641–1650 (2003)

    Google Scholar 

  14. Varun, B.A., Vipin, K.: Anomaly detection: A survey. ACM Computing Surveys 41(3), 1641–1650 (2003)

    Google Scholar 

  15. Joshi, M., Lingras, P.: Evidential clustering or rough clustering: The choice is yours. In: Li, T., Nguyen, H.S., Wang, G., Grzymala-Busse, J., Janicki, R., Hassanien, A.E., Yu, H. (eds.) RSKT 2012. LNCS, vol. 7414, pp. 123–128. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  16. Hartigan, J.A., Wong, M.A.: Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics) 28(1), 100–108 (1979), http://dx.doi.org/10.2307/2346830

    MATH  Google Scholar 

  17. MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Cam, L.M.L., Neyman, J. (eds.) Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press (1967)

    Google Scholar 

  18. Lingras, P., Hogo, M., Snorek, M.: Interval set clustering of web users using modified kohonen self-organizing maps based on the properties of rough sets. In: Web Intelli. and Agent Sys., vol. 2 (August 2004)

    Google Scholar 

  19. Joshi, M., Lingras, P.: Evolutionary and iterative crisp and rough clustering ii: Experiments. In: Chaudhury, S., Mitra, S., Murthy, C.A., Sastry, P.S., Pal, S.K. (eds.) PReMI 2009. LNCS, vol. 5909, pp. 621–627. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  20. Lingras, P.: Evolutionary rough k-means clustering. In: Wen, P., Li, Y., Polkowski, L., Yao, Y., Tsumoto, S., Wang, G. (eds.) RSKT 2009. LNCS, vol. 5589, pp. 68–75. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  21. Lingras, P., Chen, M., Miao, D.: Precision of rough set clustering. In: Chan, C.-C., Grzymala-Busse, J.W., Ziarko, W.P. (eds.) RSCTC 2008. LNCS (LNAI), vol. 5306, pp. 369–378. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  22. Denoeux, T., Masson, M.: Evclus: Evidential clustering of proximity data. IEEE Transactions on Systems Man and Cybernetics 34(1), 95–109 (2004)

    Article  Google Scholar 

  23. Lingras, P., Chen, M., Miao, D.: Rough multi-category decision theoretic framework. In: Wang, G., Li, T., Grzymala-Busse, J.W., Miao, D., Skowron, A., Yao, Y. (eds.) RSKT 2008. LNCS (LNAI), vol. 5009, pp. 676–683. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  24. Windham, M.P.: Numerical classification of proximity data with assignment measures. Journal of Classification 2, 157–172 (1985)

    Article  Google Scholar 

  25. Frank, A., Asuncion, A.: UCI machine learning repository (2010), http://archive.ics.uci.edu/ml

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Joshi, M., Lingras, P. (2013). Enhancing Rough Clustering with Outlier Detection Based on Evidential Clustering. In: Ciucci, D., Inuiguchi, M., Yao, Y., Ślęzak, D., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing. RSFDGrC 2013. Lecture Notes in Computer Science(), vol 8170. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41218-9_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41218-9_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41217-2

  • Online ISBN: 978-3-642-41218-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics