Population-based bio-inspired algorithms for cluster ensembles optimization

Abstract

Clustering algorithms have been applied to different problems in many different real-word applications. Nevertheless, each algorithm has its own advantages and drawbacks, which can result in different solutions for the same problem. Therefore, the combination of different clustering algorithms (cluster ensembles) has emerged as an attempt to overcome the limitations of each clustering technique. The use of cluster ensembles aims to combine multiple partitions generated by different clustering algorithms into a single clustering solution (consensus partition). Recently, several approaches have been proposed in the literature in order to optimize or to improve continuously the solutions found by the cluster ensembles. As a contribution to this important subject, this paper presents an investigation of five bio-inspired techniques in the optimization of cluster ensembles (Genetic Algorithms, Particle Swarm Optimization, Ant Colony Optimization, Coral Reefs Optimization and Bee Colony Optimization). In this investigation, unlike most of the existing work, an evaluation methodology for assessing three important aspects of cluster ensembles will be presented, assessing robustness, novelty and stability of the consensus partition delivered by different optimization algorithms. In order to evaluate the feasibility of the analyzed techniques, an empirical analysis will be conducted using 20 different problems and applying two different indexes in order to examine its efficiency and feasibility. Our findings indicated that the best population-based optimization method was PSO, followed by CRO, AG, BCO and ACO, for providing robust and stable consensus partitions.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2

References

  1. Aggarwal CC, Reddy CK (2013) Data clustering: algorithms and applications. Chapman & Hall/CRC, Boca Raton

    Book  Google Scholar 

  2. Asuncin A, Newman DJ (2012) UCI machine learning repository. http://ics.uci.edu/~mlearn/MLRepository.html

  3. Azimi J, Cull P, Fern X (2009) Clustering ensembles using ants algorithm. Springer, Berlin, pp 295–304

    Google Scholar 

  4. Chatterjee S, Mukhopadhyay A (2013) Clustering ensemble: a multiobjective genetic algorithm based approach. In: International conference on computational intelligence: modeling, techniques and applications (CIMTA), pp 443–449

  5. Dorigo M (1992) Optimization, learning and natural algorithms

  6. Esmin AAA, Coelho RA (2013) Consensus clustering based on particle swarm optimization algorithm. In: 2013 IEEE international conference on systems, man, and cybernetics, pp 2280–2285. IEEE

  7. Everitt BS, Landau S, Leese M, Stahl D (2011) Optimization clustering techniques. Wiley, London, pp 111–142. https://doi.org/10.1002/9780470977811.ch5

    Book  Google Scholar 

  8. Faceli K, Lorena AC, Gama J, de Leon Ferreira de Carvalho ACP (2011) Artificial intelligence: a machine learning approach (in portuguese). LTC, Rio de Janeiro

  9. Fred A, Lourenço A (2008) Cluster ensemble methods: from single clusterings to combined solutions. Springer, Berlin, pp 3–30

    Google Scholar 

  10. Ghaemi R, Sulaiman Nb, Ibrahim H, Mustapha N (2011) A review: accuracy optimization in clustering ensembles using genetic algorithms. Artif Intell Rev 35(4):287–318

    Article  Google Scholar 

  11. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explorations 11(1)

  12. Holland JH (1992) Genetic algorithms. Sci Am 267(1):66–72

    Article  Google Scholar 

  13. Hu J, Li T, Wang H, Fujita H (2016) Hierarchical cluster ensemble model based on knowledge granulation. Knowl-Based Syst 91(C):179–188

    Article  Google Scholar 

  14. Hutter F, Hoos HH, Leyton-Brown K, Stützle T (2009) Paramils: an automatic algorithm configuration framework. J Artif Int Res 36(1):267–306

    MATH  Google Scholar 

  15. José-García A, Gómez-Flores W (2016) Automatic clustering using nature-inspired metaheuristics. Appl Soft Comput 41(C):192–213

    Article  Google Scholar 

  16. Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. Technical Report TR06, Erciyes University

  17. Karaboga D, Basturk B (2008) On the performance of artificial bee colony (abc) algorithm. Appl Soft Comput 8(1):687–697

    Article  Google Scholar 

  18. Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Neural networks, 1995. Proceedings, IEEE international conference on, vol 4, pp 1942–1948

  19. Kuncheva LI (2004) Combining pattern classifiers: methods and algorithms. Wiley, New Jersey

    Book  Google Scholar 

  20. Medeiros IG, Xavier-Jnior JC, Canuto AMP (2015) Applying the coral reefs optimization algorithm to clustering problems. In: International joint conference on neural networks (IJCNN). Proceedings of international joint conference on neural networks (IJCNN) vol 1, pp 1–8

  21. Nisha MN, Mohanavalli S, Swathika R (2013) Improving the quality of clustering using cluster ensembles. In: Proceedings of 2013 IEEE conference on information and communication technologies (ICT 2013), pp 88–92. IEEE

  22. Salcedo-Sanz S, Gallo-Marazuela D, Pastor-Sánchez A, Carro-Calvo L, Portilla-Figueras A, Prieto L (2014) Offshore wind farm design with the coral reefs optimization algorithm. Renew Energy 63:109–115

    Article  Google Scholar 

  23. Salcedo-Sanz S, Casanova-Mateo C, Pastor-Sánchez A, Sánchez-Girn M (2014) Daily global solar radiation prediction based on a hybrid coral reefs optimization 17 extreme learning machine approach. Sol Energy 105:91–98

    Article  Google Scholar 

  24. Salcedo-Sanz S, García-Díaz P, Portilla-Figueras JA, Ser JD, Gil-López S (2014) A coral reefs optimization algorithm for mobile network optimal deployment with electromagnetic pollution control criterion. Appl Soft Comput 24:239–248

    Article  Google Scholar 

  25. Salcedo-Sanz S, Pastor-Sánchez A, Ser JD, Prieto L, Geem Z (2015) A coral reefs optimization algorithm with harmony search operators for accurate wind speed prediction. Renew Energy 75:93–101

    Article  Google Scholar 

  26. Salcedo-Sanz S, Ser JD, Landa-Torres I, Gil-López S, Portilla-Figueras JA (2014) The coral reefs optimization algorithm: a novel metaheuristic for efficiently solving optimization problems, vol 2014. Sci World J

  27. Silva HM, Canuto AMP, Medeiros IG, Xavier-Júnior JC (2016) A bio-inspired optimization technique for cluster ensembles optimization. In: The 5th Brazilian conference on intelligent system (BRACIS). IEEE

  28. Simon D (2013) Evolutionary optimization algorithms: biologically inspired and population-based approaches to computer intelligence

  29. Sulaiman N, Ghaemi R, Ibrahim H, Mustapha N (2009) A survey: clustering ensembles techniques. World Acad Sci Eng Technol 38:636–645

    Google Scholar 

  30. Topchy A, Jain AK, Punch W (2003) Combining multiple weak clusterings. In: Proceedings of the IEEE international conference on data mining (ICDM172003). Melbourne, Florida, USA, pp 331–338

  31. Yang LY, Zhang JY, Wang WJ (2009) Cluster ensemble based on particle swarm optimization. In: Global congress on intelligent systems, pp 519–523. IEEE

  32. Zhong C, Yue X, Zhang Z, Lei J (2015) A clustering ensemble: two-level-refined co-association matrix with path-based transformation. Pattern Recognit 48(8):2699–2709

    Article  Google Scholar 

Download references

Acknowledgements

This work has been financially supported partially by Capes/Brazil.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Anne Canuto.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Canuto, A., Neto, A.F., Silva, H.M. et al. Population-based bio-inspired algorithms for cluster ensembles optimization. Nat Comput 19, 515–532 (2020). https://doi.org/10.1007/s11047-018-9682-1

Download citation

Keywords

  • Cluster ensemble
  • Consensus partition
  • Population-based bio-inspired optimization