Skip to main content

Clustering Algorithms: Experiment and Improvements

  • Conference paper
  • First Online:
Computing and Network Sustainability

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 12))

Abstract

Clustering is data mining method to divide the data objects into n number of clusters. Clustering algorithms can be used in domains such as e-commerce, bio-informatics, image segmentation, speech recognition, financial analysis, and fraud detection. There is abandon knowledge in the clustering research and applications and also various improvements are done on various clustering algorithms. This paper includes the study and survey of various concepts and clustering algorithms by experimenting on it on some data sets and then analyzed gaps and scope for enhancement and scalability of algorithms. Then improved k-means is proposed to minimize these gaps. This improved algorithm automatically finds value of number of clusters and calculates initial centroids in better way rather random selection. From the experimentation, it is found that numbers of iterations are reduced; clusters quality increased and also minimized empty clusters in proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dunham MH (2006) Data mining-introduction and advanced concepts. Pearson Education

    Google Scholar 

  2. Aggarwal CC, Zhai C (2012) Survey on text clustering algorithms in mining text data. Springer, USA, pp 77–128

    Google Scholar 

  3. Mahmood A, Leckie C, Udaya P (2007) An efficient clustering scheme to exploit hierarchical data in NW traffic analysis. IEEE Tran. Knowl Data Eng 20(6):752–767

    Google Scholar 

  4. Kanungo T, Mount DM, Netanyahu NS, Wu AY, Piatko CD, Silverman R (2002) An efficient k-means clustering algorithm-analysis and implementation. IEEE Trans Pattern Anal Mach Intell 24(7)

    Google Scholar 

  5. Pham DT, Dimov SS, Nguyen CD (2005) Selection of value of k in k-means clustering. Proc Mech Mech Eng Sci 219

    Google Scholar 

  6. Fong S (2013) Opportunities and challenges of integrating bio inspired optimization and data mining algorithms. In: Swarm intelligence and bio inspired computation. Elsevier, pp 385–401

    Google Scholar 

  7. Abbasi AA, Younis M (2007) A survey of clustering algorithms for wireless sensor networks. Comput Commun 30(14), 15, 2822841

    Google Scholar 

  8. Bishnu PS, Bhattacherjee V (2012) Software fault predictions using quad tree based k-means clustering algorithm. IEEE Trans Knowl Data Eng 24(6)

    Google Scholar 

  9. Siddiqui FU, Isa NAM (2011) Enhanced moving k-means algorithm for image segmentation. IEEE Tran Consum Electron 57(2)

    Google Scholar 

  10. Khandare AD (2015) A modified k-means algorithm for emotional intelligence mining, ICCCI-15, Coimbatore, India, pp 1–3

    Google Scholar 

  11. Harrison R, Zhong W, Altun G, Tai PC, Pan Y (2005) Improved k-means clustering algorithm for exploring local protein sequence motifs representing common structural property. IEEE Trans Nanobiosci 4(3)

    Google Scholar 

  12. Jaber H, Marle F, Jankovic M (2015) Improving the collaborative decision making in the new products development project using clustering algorithm. IEEE Trans Eng Manag 62(4)

    Google Scholar 

  13. Li T-HS, Kao M-C, Kuo P-H (2016) Recognitions system for the home service related sign languages using entropy based kmeans algorithm and the ABC based HMM. IEEE Trans Syst Man Cybern Syst 46(1)

    Google Scholar 

  14. Wu X, Zhu X, Wu G-Q, Ding W (2014) Data mining on big data. IEEE Trans Knowl Data Eng 26(1)

    Google Scholar 

  15. Traganitis PA, Slavakis K, Giannakis GB (2015) Sketch and validate big data clustering. IEEE J Sel Top Signal Process 9(4)

    Google Scholar 

  16. Khandare A, Alvi AS (2016) Survey of improved k-means clustering algorithms-an improvements, shortcoming and scope for further enhancement and scalability, INDIA-2016, vol 434. AISC Springer, pp 495–503

    Google Scholar 

  17. Xu R, Wunsch D II (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3)

    Google Scholar 

  18. AM Fahim, AM Salem, FATorkey, M.A. Ramadan (2006) An efficient enhance kmeans clustering algorithm. J Zhejiang Univ Sci 7(10):1626–1633

    Google Scholar 

  19. Verma NK, Roy A (2014) Self optimal clustering techniques using optimized threshold function. IEEE Syst J 8(4)

    Google Scholar 

  20. Harb H, Makhoul A, Couturier R (2015) Enhanced k-means, ANOVA based clustering approach for similarity aggregation in underwater wireless sensor networks. IEEE Sens J 15(10)

    Google Scholar 

  21. Liang H-W, Chung W-H, Kuo S-Y (2016) Coding aided k-means clustering blind transceiver for space shift keying mimo system. IEEE Trans Wirel Commun 15(1)

    Google Scholar 

  22. Kumar R, Dwivedi R (2016) Quaternion domain kmeans clustering for the improved real time classification of E-Nose data. IEEE Sens J 16(1)

    Google Scholar 

  23. Antonenko PD, Toy S, Niederhauser DS (2012) Using cluster analysis for the data mining in educational technology research R&D

    Google Scholar 

  24. Kwak J, Lee T, Kim CO (2015) Incremental clustering algorithm based fault detection algorithm for class imbalanced process data. IEEE Trans Semicond Manuf 28(3) (Yonsei University, Seoul, Korea)

    Google Scholar 

  25. Sulaiman SN, Isa NAM (2010) Adaptive fuzzy k-means clustering algorithm for image segmentation. IEEE Trans Consum Electron 56(4)

    Google Scholar 

  26. Huang X, Ye Y, Zhang H (2014) Extensions of k-means type algorithms: a new clustering framework by integrating intra cluster compactness and inter cluster separation, IEEE Trans Neural Netw Learn Syst 25(8)

    Google Scholar 

  27. Xie M, Cui H, Cai Y, Huang X, Liu Y (2014) Cluster validity index for adaptive clustering algorithms. IET Commun 8(13)

    Google Scholar 

  28. Bandyopadhyay S, Coyle E (2003) An energy efficient hierarchical clustering algorithm for wireless sensor networks. In: Proceedings of the 22 annual joint conference, IEEE computer and communication societies, San Francisco, California

    Google Scholar 

  29. An F, Mattausch HJ (2013) k-means clustering algorithm for multimedia application with flexible hardware and software co-design. J Syst Archit 59(3) (Elsevier)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anand Khandare .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Khandare, A., Alvi, A.S. (2017). Clustering Algorithms: Experiment and Improvements. In: Vishwakarma, H., Akashe, S. (eds) Computing and Network Sustainability. Lecture Notes in Networks and Systems, vol 12. Springer, Singapore. https://doi.org/10.1007/978-981-10-3935-5_27

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3935-5_27

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3934-8

  • Online ISBN: 978-981-10-3935-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics