Clustering Algorithms: Experiment and Improvements

Khandare, Anand; Alvi, A. S.

doi:10.1007/978-981-10-3935-5_27

Anand Khandare⁴ &
A. S. Alvi⁵

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 12))

770 Accesses
2 Citations

Abstract

Clustering is data mining method to divide the data objects into n number of clusters. Clustering algorithms can be used in domains such as e-commerce, bio-informatics, image segmentation, speech recognition, financial analysis, and fraud detection. There is abandon knowledge in the clustering research and applications and also various improvements are done on various clustering algorithms. This paper includes the study and survey of various concepts and clustering algorithms by experimenting on it on some data sets and then analyzed gaps and scope for enhancement and scalability of algorithms. Then improved k-means is proposed to minimize these gaps. This improved algorithm automatically finds value of number of clusters and calculates initial centroids in better way rather random selection. From the experimentation, it is found that numbers of iterations are reduced; clusters quality increased and also minimized empty clusters in proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dunham MH (2006) Data mining-introduction and advanced concepts. Pearson Education
Google Scholar
Aggarwal CC, Zhai C (2012) Survey on text clustering algorithms in mining text data. Springer, USA, pp 77–128
Google Scholar
Mahmood A, Leckie C, Udaya P (2007) An efficient clustering scheme to exploit hierarchical data in NW traffic analysis. IEEE Tran. Knowl Data Eng 20(6):752–767
Google Scholar
Kanungo T, Mount DM, Netanyahu NS, Wu AY, Piatko CD, Silverman R (2002) An efficient k-means clustering algorithm-analysis and implementation. IEEE Trans Pattern Anal Mach Intell 24(7)
Google Scholar
Pham DT, Dimov SS, Nguyen CD (2005) Selection of value of k in k-means clustering. Proc Mech Mech Eng Sci 219
Google Scholar
Fong S (2013) Opportunities and challenges of integrating bio inspired optimization and data mining algorithms. In: Swarm intelligence and bio inspired computation. Elsevier, pp 385–401
Google Scholar
Abbasi AA, Younis M (2007) A survey of clustering algorithms for wireless sensor networks. Comput Commun 30(14), 15, 2822841
Google Scholar
Bishnu PS, Bhattacherjee V (2012) Software fault predictions using quad tree based k-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6)
Google Scholar
Siddiqui FU, Isa NAM (2011) Enhanced moving k-means algorithm for image segmentation. IEEE Tran Consum Electron 57(2)
Google Scholar
Khandare AD (2015) A modified k-means algorithm for emotional intelligence mining, ICCCI-15, Coimbatore, India, pp 1–3
Google Scholar
Harrison R, Zhong W, Altun G, Tai PC, Pan Y (2005) Improved k-means clustering algorithm for exploring local protein sequence motifs representing common structural property. IEEE Trans Nanobiosci 4(3)
Google Scholar
Jaber H, Marle F, Jankovic M (2015) Improving the collaborative decision making in the new products development project using clustering algorithm. IEEE Trans Eng Manag 62(4)
Google Scholar
Li T-HS, Kao M-C, Kuo P-H (2016) Recognitions system for the home service related sign languages using entropy based kmeans algorithm and the ABC based HMM. IEEE Trans Syst Man Cybern Syst 46(1)
Google Scholar
Wu X, Zhu X, Wu G-Q, Ding W (2014) Data mining on big data. IEEE Trans Knowl Data Eng 26(1)
Google Scholar
Traganitis PA, Slavakis K, Giannakis GB (2015) Sketch and validate big data clustering. IEEE J Sel Top Signal Process 9(4)
Google Scholar
Khandare A, Alvi AS (2016) Survey of improved k-means clustering algorithms-an improvements, shortcoming and scope for further enhancement and scalability, INDIA-2016, vol 434. AISC Springer, pp 495–503
Google Scholar
Xu R, Wunsch D II (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3)
Google Scholar
AM Fahim, AM Salem, FATorkey, M.A. Ramadan (2006) An efficient enhance kmeans clustering algorithm. J Zhejiang Univ Sci 7(10):1626–1633
Google Scholar
Verma NK, Roy A (2014) Self optimal clustering techniques using optimized threshold function. IEEE Syst J 8(4)
Google Scholar
Harb H, Makhoul A, Couturier R (2015) Enhanced k-means, ANOVA based clustering approach for similarity aggregation in underwater wireless sensor networks. IEEE Sens J 15(10)
Google Scholar
Liang H-W, Chung W-H, Kuo S-Y (2016) Coding aided k-means clustering blind transceiver for space shift keying mimo system. IEEE Trans Wirel Commun 15(1)
Google Scholar
Kumar R, Dwivedi R (2016) Quaternion domain kmeans clustering for the improved real time classification of E-Nose data. IEEE Sens J 16(1)
Google Scholar
Antonenko PD, Toy S, Niederhauser DS (2012) Using cluster analysis for the data mining in educational technology research R&D
Google Scholar
Kwak J, Lee T, Kim CO (2015) Incremental clustering algorithm based fault detection algorithm for class imbalanced process data. IEEE Trans Semicond Manuf 28(3) (Yonsei University, Seoul, Korea)
Google Scholar
Sulaiman SN, Isa NAM (2010) Adaptive fuzzy k-means clustering algorithm for image segmentation. IEEE Trans Consum Electron 56(4)
Google Scholar
Huang X, Ye Y, Zhang H (2014) Extensions of k-means type algorithms: a new clustering framework by integrating intra cluster compactness and inter cluster separation, IEEE Trans Neural Netw Learn Syst 25(8)
Google Scholar
Xie M, Cui H, Cai Y, Huang X, Liu Y (2014) Cluster validity index for adaptive clustering algorithms. IET Commun 8(13)
Google Scholar
Bandyopadhyay S, Coyle E (2003) An energy efficient hierarchical clustering algorithm for wireless sensor networks. In: Proceedings of the 22 annual joint conference, IEEE computer and communication societies, San Francisco, California
Google Scholar
An F, Mattausch HJ (2013) k-means clustering algorithm for multimedia application with flexible hardware and software co-design. J Syst Archit 59(3) (Elsevier)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of CSE, SGB Amravati University, Amravati, India
Anand Khandare
Department of CSE, PRMIT & R, Badnera, Amravati, India
A. S. Alvi

Authors

Anand Khandare
View author publications
You can also search for this author in PubMed Google Scholar
A. S. Alvi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anand Khandare .

Editor information

Editors and Affiliations

Department of Software and Systems Engineering, VIT University, Vellore, Tamil Nadu, India
H.R . Vishwakarma
Department of Electronics and Communication Engineering, ITM University, Gwalior, Madhya Pradesh, India
Shyam Akashe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khandare, A., Alvi, A.S. (2017). Clustering Algorithms: Experiment and Improvements. In: Vishwakarma, H., Akashe, S. (eds) Computing and Network Sustainability. Lecture Notes in Networks and Systems, vol 12. Springer, Singapore. https://doi.org/10.1007/978-981-10-3935-5_27

Download citation

DOI: https://doi.org/10.1007/978-981-10-3935-5_27
Published: 06 July 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3934-8
Online ISBN: 978-981-10-3935-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics