Abstract
Like other iterative refinement clustering algorithms, the Neighborhood Expectation-Maximization (NEM) algorithm is sensitive to the initial state of cluster separation. Therefore, the study of the initialization methods is of great importance for the success of finding a better sub-optimal solution in practice. However, existing initialization methods for mixture model based clustering using EM-style algorithms do not account for the unique properties of spatial data, such as spatial autocorrelation. To that end, this paper incorporates spatial information into the initialization and compares three representative initialization methods. Our experimental results on both synthetic and real-world datasets show that the NEM algorithm usually leads to a better clustering solution if it starts with initial states returned by the spatial augmented initialization method based on K-Means.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statistical Society B(39), 1–38 (1977)
Ambroise, C., Govaert, G.: Convergence of an EM-type algorithm for spatial clustering. Pattern Recognition Letters 19(10), 919–927 (1998)
Topchy, A., Jain, A.K., Punch, W.: Clustering ensembles: Models of consensus and weak partitions. IEEE Transactions on Pattern Aanalysis and Machine Intelligence 27(12), 1866–1881 (2005)
Neal, R., Hinton, G.: A view of the EM algorithm that justifies incremental, sparse, and other variants. In: Jordan, M. (ed.) Learning in Graphical Models, pp. 355–368. Kluwer Academic Publishers, Dordrecht (1998)
Franti, P., Kivijarvi, J.: Randomised local search algorithm for the clustering problem. Pattern Analysis & Applications 3(4), 358–369 (2000)
Celeux, G., Govaert, G.: A classification EM algorithm for clustering and two stochastic versions. Computational Statistics and Data Analysis 14(3), 315–332 (1992)
Katsavounidis, I., Kuo, C., Zhang, Z.: A new initialization technique for generalized lloyd iteration. IEEE Signal Processing Letters 1(10), 144–146 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hu, T., Ouyang, J., Qu, C., Liu, C. (2009). Initialization of the Neighborhood EM Algorithm for Spatial Clustering. In: Huang, R., Yang, Q., Pei, J., Gama, J., Meng, X., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2009. Lecture Notes in Computer Science(), vol 5678. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03348-3_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-03348-3_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03347-6
Online ISBN: 978-3-642-03348-3
eBook Packages: Computer ScienceComputer Science (R0)