Abstract
Clustering is data mining method to divide the data objects into n number of clusters. Clustering algorithms can be used in domains such as e-commerce, bio-informatics, image segmentation, speech recognition, financial analysis, and fraud detection. There is abandon knowledge in the clustering research and applications and also various improvements are done on various clustering algorithms. This paper includes the study and survey of various concepts and clustering algorithms by experimenting on it on some data sets and then analyzed gaps and scope for enhancement and scalability of algorithms. Then improved k-means is proposed to minimize these gaps. This improved algorithm automatically finds value of number of clusters and calculates initial centroids in better way rather random selection. From the experimentation, it is found that numbers of iterations are reduced; clusters quality increased and also minimized empty clusters in proposed algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dunham MH (2006) Data mining-introduction and advanced concepts. Pearson Education
Aggarwal CC, Zhai C (2012) Survey on text clustering algorithms in mining text data. Springer, USA, pp 77–128
Mahmood A, Leckie C, Udaya P (2007) An efficient clustering scheme to exploit hierarchical data in NW traffic analysis. IEEE Tran. Knowl Data Eng 20(6):752–767
Kanungo T, Mount DM, Netanyahu NS, Wu AY, Piatko CD, Silverman R (2002) An efficient k-means clustering algorithm-analysis and implementation. IEEE Trans Pattern Anal Mach Intell 24(7)
Pham DT, Dimov SS, Nguyen CD (2005) Selection of value of k in k-means clustering. Proc Mech Mech Eng Sci 219
Fong S (2013) Opportunities and challenges of integrating bio inspired optimization and data mining algorithms. In: Swarm intelligence and bio inspired computation. Elsevier, pp 385–401
Abbasi AA, Younis M (2007) A survey of clustering algorithms for wireless sensor networks. Comput Commun 30(14), 15, 2822841
Bishnu PS, Bhattacherjee V (2012) Software fault predictions using quad tree based k-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6)
Siddiqui FU, Isa NAM (2011) Enhanced moving k-means algorithm for image segmentation. IEEE Tran Consum Electron 57(2)
Khandare AD (2015) A modified k-means algorithm for emotional intelligence mining, ICCCI-15, Coimbatore, India, pp 1–3
Harrison R, Zhong W, Altun G, Tai PC, Pan Y (2005) Improved k-means clustering algorithm for exploring local protein sequence motifs representing common structural property. IEEE Trans Nanobiosci 4(3)
Jaber H, Marle F, Jankovic M (2015) Improving the collaborative decision making in the new products development project using clustering algorithm. IEEE Trans Eng Manag 62(4)
Li T-HS, Kao M-C, Kuo P-H (2016) Recognitions system for the home service related sign languages using entropy based kmeans algorithm and the ABC based HMM. IEEE Trans Syst Man Cybern Syst 46(1)
Wu X, Zhu X, Wu G-Q, Ding W (2014) Data mining on big data. IEEE Trans Knowl Data Eng 26(1)
Traganitis PA, Slavakis K, Giannakis GB (2015) Sketch and validate big data clustering. IEEE J Sel Top Signal Process 9(4)
Khandare A, Alvi AS (2016) Survey of improved k-means clustering algorithms-an improvements, shortcoming and scope for further enhancement and scalability, INDIA-2016, vol 434. AISC Springer, pp 495–503
Xu R, Wunsch D II (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3)
AM Fahim, AM Salem, FATorkey, M.A. Ramadan (2006) An efficient enhance kmeans clustering algorithm. J Zhejiang Univ Sci 7(10):1626–1633
Verma NK, Roy A (2014) Self optimal clustering techniques using optimized threshold function. IEEE Syst J 8(4)
Harb H, Makhoul A, Couturier R (2015) Enhanced k-means, ANOVA based clustering approach for similarity aggregation in underwater wireless sensor networks. IEEE Sens J 15(10)
Liang H-W, Chung W-H, Kuo S-Y (2016) Coding aided k-means clustering blind transceiver for space shift keying mimo system. IEEE Trans Wirel Commun 15(1)
Kumar R, Dwivedi R (2016) Quaternion domain kmeans clustering for the improved real time classification of E-Nose data. IEEE Sens J 16(1)
Antonenko PD, Toy S, Niederhauser DS (2012) Using cluster analysis for the data mining in educational technology research R&D
Kwak J, Lee T, Kim CO (2015) Incremental clustering algorithm based fault detection algorithm for class imbalanced process data. IEEE Trans Semicond Manuf 28(3) (Yonsei University, Seoul, Korea)
Sulaiman SN, Isa NAM (2010) Adaptive fuzzy k-means clustering algorithm for image segmentation. IEEE Trans Consum Electron 56(4)
Huang X, Ye Y, Zhang H (2014) Extensions of k-means type algorithms: a new clustering framework by integrating intra cluster compactness and inter cluster separation, IEEE Trans Neural Netw Learn Syst 25(8)
Xie M, Cui H, Cai Y, Huang X, Liu Y (2014) Cluster validity index for adaptive clustering algorithms. IET Commun 8(13)
Bandyopadhyay S, Coyle E (2003) An energy efficient hierarchical clustering algorithm for wireless sensor networks. In: Proceedings of the 22 annual joint conference, IEEE computer and communication societies, San Francisco, California
An F, Mattausch HJ (2013) k-means clustering algorithm for multimedia application with flexible hardware and software co-design. J Syst Archit 59(3) (Elsevier)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Khandare, A., Alvi, A.S. (2017). Clustering Algorithms: Experiment and Improvements. In: Vishwakarma, H., Akashe, S. (eds) Computing and Network Sustainability. Lecture Notes in Networks and Systems, vol 12. Springer, Singapore. https://doi.org/10.1007/978-981-10-3935-5_27
Download citation
DOI: https://doi.org/10.1007/978-981-10-3935-5_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3934-8
Online ISBN: 978-981-10-3935-5
eBook Packages: EngineeringEngineering (R0)