Abstract
k-means clustering is a popular clustering method.Its core task of finding the closest prototype for every input pattern involves expensive distance calculations. We present a novel algorithm for performing this task. This and other optimizations are shown to significantly improve the performance of the k-means algorithm. The resultant algorithm produces the same (except for round-off errors)results as those of the direct algorithm.
A large part of this work was done while the authors at the Information Technology Lab (ITL)of Hitachi America Ltd.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
K. AlSabti. Efficient Algorithms for Data Mining. Ph.D. Thesis, Syracuse University, 1998.
R.C. Dubes and A.K. Jain. Algorithms fo Clustering Data. Prentice Hall, 1988.
D. Judd, P. McKinley and A. Jain. Large-Scale Parallel Data Clustering. Proc. Int’l Conference on Pattern Recognition, August 1996.
K. Mehrotra C.K. Mohan, and S. Ranka. Elements of Artificial Neural Networks. MIT Press, 1996.
T. Zhang, R. Ramakrishnan and M. Livny. BIRCH: An Efficient Data Clustering Method for Very Large Databases. Proc.of the ACM SIGMOD’f96.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
AlSabti, K., Ranka, S., Singh, V. (1999). An Efficient Space-Partitioning Based Algorithm for the K-Means Clustering. In: Zhong, N., Zhou, L. (eds) Methodologies for Knowledge Discovery and Data Mining. PAKDD 1999. Lecture Notes in Computer Science(), vol 1574. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48912-6_47
Download citation
DOI: https://doi.org/10.1007/3-540-48912-6_47
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65866-5
Online ISBN: 978-3-540-48912-2
eBook Packages: Springer Book Archive