Abstract
Clustering is an important methodology for data mining and data analysis. K-Means is a simple and fast algorithm for clustering data. However the performance of K-means is highly sensitive on the initial seed of the algorithm. Heuristic Kalman Algorithm (HKA) is a population based stochastic optimization technique which is an effective method for searching a near-optimal solution of a function. Although HKA has good global search characteristics, it is shown that when directly applied on clustering it performs poorly. This paper proposes a new approach KHKA, which combines the benefits of the global nature of HKA and the fast convergence of K-means. KHKA was implemented and benchmarked on synthetic and real datasets from UCI Machine Learning Repository. The results were compared with other population based, stochastic algorithms. Results show that KHKA is a promising algorithm and was able to perform better than the compared algorithms with respect to the used datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Selim, S.Z., Ismail, M.A.: K-means-type algorithms: a generalized convergence theorem and characterization of local optimality. IEEE Trans. Pattern Anal. Mach. Intell. PAMI–6, 81–87 (1984)
Sung, C., Jin, H.: A tabu-search-based heuristic for clustering. Pattern Recogn. 33, 849–858 (2000)
Toscano, R., Lyonnet, P.: A new heuristic approach for non-convex optimization problems. Inf. Sci. 180, 1955–1966 (2010). Special Issue on Intelligent Distributed Information Systems
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (2011)
Jain, A.K.: Data clustering: 50 years beyond k-means. Pattern Recogn. Lett. 31(8), 651–666 (2010). Award winning papers from the 19th International Conference on Pattern Recognition (ICPR)
Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag New York Inc, Secaucus, NJ, USA (2006)
Bache, K., Lichman, M.: UCI machine learning repository (2013)
Pal, S.K., Dutta Majumder, D.: Fuzzy sets and decision making approaches in vowel and speaker recognition. IEEE Trans. Syst. Man Cybern. 7, 625–629 (1977)
Maulik, U., Bandyopadhyay, S.: Genetic algorithm-based clustering technique. Pattern Recogn. 33, 1455–1465 (2000)
Chen, C.Y., Ye, F.: Particle swarm optimization algorithm and its application to clustering analysis. In: IEEE International Conference on Networking, Sensing and Control, 2004, vol. 2, pp. 789–794 (2004)
Sanderson, C.: Armadillo: an open source C++ linear algebra library for fast prototyping and computationally intensive experiments. Technical report, NICTA, Australia (2010)
Hatamlou, A., Abdullah, S., Nezamabadi-pour, H.: A combined approach for clustering based on k-means and gravitational search algorithms. Swarm Evol. Comput. 6, 47–52 (2012)
Acknowledgments
The author would like to thank Dr. Vikram Pakrashi, University College Cork, Ireland, for the valuable suggestions, discussions and constant support throughout the entire period of research.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Pakrashi, A. (2015). A New Hybrid Clustering Approach Based on Heuristic Kalman Algorithm. In: Panigrahi, B., Suganthan, P., Das, S. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2014. Lecture Notes in Computer Science(), vol 8947. Springer, Cham. https://doi.org/10.1007/978-3-319-20294-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-20294-5_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20293-8
Online ISBN: 978-3-319-20294-5
eBook Packages: Computer ScienceComputer Science (R0)