IF-CLARANS: Intuitionistic Fuzzy Algorithm for Big Data Clustering

Shili, Hechmi; Romdhane, Lotfi Ben

doi:10.1007/978-3-319-91476-3_4

Hechmi Shili^16,17 &
Lotfi Ben Romdhane¹⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 854))

Included in the following conference series:

International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems

1085 Accesses

Abstract

Clustering method is one of the most important and basic technique for data mining which aims to group a collection of samples into clusters based on similarity. Clustering Big datasets has always been a serious challenge due to its high dimensionality and complexity. In this paper, we propose a novel clustering algorithm which aims to introduce the concept of intuitionistic fuzzy set theory onto the framework of CLARANS for handling uncertainty in the context of mining Big datasets. We also suggest a new scalable approximation to compute the maximum number of neighbors. Our experimental evaluation on real data sets shows that the proposed algorithm can obtain satisfactory clustering results and outperforms other current methods. The clusters quality was evaluated by three well-known metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aboubi, Y., Drias, H., Kamel, N.: BAT-CLARA: BAT-inspired algorithm for clustering LARge applications. In: 8th IFAC Conference on Manufacturing Modelling, Management and Control, MIM 2016, vol. 49, pp. 243–248 (2016)
Article Google Scholar
Ng, R.T., Han, J.: CLARANS: a method for clustering objects for spatial data mining. IEEE Trans. Knowl. Data Eng. 14, 1003–1016 (2002)
Article Google Scholar
Lorbeer, B., et al.: Variations on the clustering algorithm BIRCH. Big Data Res. 2214–5796 (2017)
Google Scholar
Lathiya, P., Rani, R.: Improved CURE clustering for big data using Hadoop and Mapreduce. In: International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, pp. 1–5 (2016)
Google Scholar
Rezaee, B.: A cluster validity index for fuzzy clustering. Fuzzy Sets Syst. 161, 3014–3025 (2010)
Article MathSciNet Google Scholar
Dutta, M., Mahanta, A.K., Pujari, A.K.: QROCK: a quick version of the ROCK algorithm for clustering of categorical data. Pattern Recogn. Lett. 26, 2364–2373 (2005)
Article Google Scholar
Mahesh Kumar, K., Rama Mohan Reddy, A.: A density based algorithm for discovering clusters in large spatial databases with noise. Pattern Recogn. 58, 39–48 (2016)
Article Google Scholar
Ankerst, M., et al.: OPTICS: ordering points to identify clustering structure. In: Proceedings of ACM SIGMOD Conference on Management of Data. ACM Press, Philadelphia (1999)
Google Scholar
Saxena, A., Prasad, M., Gupta, A., Bharill, N., Patel, O.P., Tiwari, A., Er, M.J., Ding, W., Lin, C.-T.: A review of clustering techniques and developments. Neurocomputing 267, 664–681 (2017)
Article Google Scholar
Berkhin, P.: Survey of Clustering Data Mining techniques. Accrue Software Inc., San Jose (2000)
Google Scholar
Yu, H., Zhi, X., Fan, J.: Image segmentation based on weak fuzzy partition entropy. Neurocomputing 168, 994–1010 (2015)
Article Google Scholar
Bezdek, J.C. (ed.): Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
MATH Google Scholar
Zadeh, L.A.: Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans. Syst. Man Cybern. 3, 28–44 (1973)
Article MathSciNet Google Scholar
Deschrijver, G., Cornelis, C., Kerre, E.E.: On the representation of intuitionistic fuzzy t-norms and t-conorms. IEEE Trans. Fuzzy Syst. 12, 45–61 (2004)
Article Google Scholar
Yuan, X., Li, H., Zhang, C.: The theory of intuitionistic fuzzy sets based on the intuitionistic fuzzy special sets. Inf. Sci. 277, 284–298 (2014)
Article MathSciNet Google Scholar
Halkidi, M., Gunopulos, D., Vazirgiannis, M., et al.: A clustering framework based on subjective and objective validity criteria. ACM Trans. Knowl. Disc. Data 1(4), 1–25 (2008)
Article Google Scholar
Zhang, H.-M., Xu, Z.-S., Chen, Q.: On clustering approach to intuitionistic fuzzy sets. Control Decis. 22, 882 (2007)
MathSciNet MATH Google Scholar
Dhillon, I., Guan, Y., Kulis, B.: Kernel k-means: spectral clustering and normalized cuts. In: Proceeding of KDD, Proceedings of 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 551–556 (2004)
Google Scholar
Dhillon, I., Modha, D.: Concept decompositions for large sparse text data using clustering. Mach. Learn. 42(1–2), 143–175 (2001)
Article Google Scholar
de Amorim, R.C., Mirkin, B.: Minkowski metric, feature weighting and anomalous cluster initializing in k-means clustering. Pattern Recogn. 45(3), 1061–1075 (2012)
Article Google Scholar
Pelleg, D., Moore, A.W.: X-means: extending k-means with efficient estimation of the number of clusters. In: Proceedings of 17th International Conference on Machine Learning, pp. 727–734. Morgan Kaufmann (2000)
Google Scholar
Cai, X., Nie, F., Huang, H.: Multi-view k-means clustering on big data. In: Rossi, F. (ed.) Proceedings of 23rd International Joint Conference on Artificial Intelligence, IJCAI 2013. IJCAI/AAAI (2013)
Google Scholar
Mahesh Kumar, K., Rama Mohan Reddy, A.: An efficient k-means clustering filtering algorithm using density based initial cluster centers. Inf. Sci. 418, 286–301 (2017)
Article MathSciNet Google Scholar
Klir, G.J., Yuan, B.: Fuzzy Sets and Fuzzy Logic Theory and Applications. Prentice Hall of India Private Limited, New Delhi (2002)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Sciences, University of Monastir, Monastir, Tunisia
Hechmi Shili
Modeling of Automated Reasoning Systems (MARS), Research Laboratory LR17ES05, Higher Institute of Computer Science and Telecom (ISITCom), University of Sousse, Sousse, Tunisia
Hechmi Shili & Lotfi Ben Romdhane

Authors

Hechmi Shili
View author publications
You can also search for this author in PubMed Google Scholar
Lotfi Ben Romdhane
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hechmi Shili .

Editor information

Editors and Affiliations

Universidad de Cádiz, Cádiz, Cadiz, Spain
Jesús Medina
Universidad de Málaga, Málaga, Málaga, Spain
Manuel Ojeda-Aciego
Universidad de Granada, Granada, Spain
José Luis Verdegay
Universidad de Granada, Granada, Spain
David A. Pelta
Universidad de Málaga, Málaga, Málaga, Spain
Inma P. Cabrera
LIP6, Université Pierre et Marie Curie, CNRS, Paris, France
Bernadette Bouchon-Meunier
Iona College, New Rochelle, New York, USA
Ronald R. Yager

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shili, H., Romdhane, L.B. (2018). IF-CLARANS: Intuitionistic Fuzzy Algorithm for Big Data Clustering. In: Medina, J., et al. Information Processing and Management of Uncertainty in Knowledge-Based Systems. Theory and Foundations. IPMU 2018. Communications in Computer and Information Science, vol 854. Springer, Cham. https://doi.org/10.1007/978-3-319-91476-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-91476-3_4
Published: 18 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91475-6
Online ISBN: 978-3-319-91476-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics