Abstract
Mining knowledge and predicting the behavior of the data have become major challenge with the advent of unprecedented escalation in the volume of the existing databases. Generally, clustering is adopted for voluminous and intricate data. In the present work, two techniques of K-means clustering, namely, K-means algorithm with random sampling (without realignment) and K-means algorithm with realignment sampling, are compared in terms of time taken and number of moves made for clustering the given data. The first one checks for any transfers between the clusters after inserting all the data. The second one is to check for any transfers between clusters for each new data inserted into cluster. The experimental results reveal that K-means clustering algorithm with realignment has performed reasonably well against K-means clustering algorithm without realignment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Narayanan, R., Ozisikyilmaz, B., Zambreno, J., Memik, G., Choudhary, A.: MineBench: a benchmark suite for data mining workloads. In: 2006 IEEE International Symposium on Workload Characterization, pp. 182–188. San Jose, CA (2006)
Anh, DT., Thanh, LH.: An efficient implementation of k-means clustering for time series data with DTW distance. Int. J. Bus. Intell. Data Min. 10(3), 213–232 (2015)
Sakthi, M., Thanamani, AS.: An effective determination of initial centroids in K-Means clustering using kernel PCA. Int. J. Comput. Sci. Inf. Technol. 2(3), 955–959 (2011)
Sangalli, L.M., Secchi, P., Vantini, S., Vitelli, L.: K-mean alignment for curve clustering. Comput. Stat. Data Anal. 54(5), 1219–1233 (2010)
Bradley, P., Fayyad, U.: Refining initial data items for k-means clustering. In: Proceedings 15th International Conference on Machine Learning (1998)
Das, H., Naik, B., Behera, H. S.: Classification of diabetes mellitus disease (DMD): a data mining (DM) approach. Prog. Comput. Anal. Netw. 539–549 (2018) (Springer, Singapore)
Zhang, R., Rudnicky, A.: A large scale clustering scheme for K-means. In: 16th International Conference on Pattern Recognition (ICPR’02) (2002)
Nepolean, G., Ganga Lakshmi, G.: An efficient K-Means clustering algorithm for reducing time complexity using uniform distribution data points. Trends Inf. Sci. Comput. (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sirisha, D., Prasad, S.S. (2020). Toward Ameliorating K-Means Clustering Algorithm. In: Das, H., Pattnaik, P., Rautaray, S., Li, KC. (eds) Progress in Computing, Analytics and Networking. Advances in Intelligent Systems and Computing, vol 1119. Springer, Singapore. https://doi.org/10.1007/978-981-15-2414-1_40
Download citation
DOI: https://doi.org/10.1007/978-981-15-2414-1_40
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2413-4
Online ISBN: 978-981-15-2414-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)