Abstract
Spatial data mining is the discovery of novel and interesting relationships and characteristics that may exist implicitly in spatial databases. The identification of clusters coupled with Geographical Information System provides a means of information generalization. A variety of clustering approaches exists. A non-hierarchical method in data mining applications is the medoid approach. Many heuristics have been developed for this approach. This paper carefully analyses the complexity of hill-climbing heuristics for medoid based spatial clustering. Improvements to recently suggested heuristics like CLARANS are identified. We propose a novel idea, the stopping early of the heuristic search, and demonstrate that this provides large savings in computational time while the quality of the partition remains unaffected.
This research was supported in part by a grant from the Australian Research Council.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Y. Cai, N. Cercone, and J. Han. Attribute-oriented induction in relational databases. In G. Piatetsky-Shapiro and W.J. Frawley, Knowledge Discovery in Databases, 213–228, Menlo Park, CA. USA, 1991. AAAI Press.
M. Ester, H.P. Kriegel, and J. Sander. Spatial data mining: A database approach. In A. School and A. Voisrad, Advances in Spatial Databases, 5th Int. Symp., SDD97, 47–66, Berlin, Germany, 1997. Springer-Verlag LNCS 1262.
M. Ester, H.P. Kriegel, S. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In E. Simoudis, J. Han, and U. Fayyad, Proc. of the 2nd Int. Conf. on Knowledge Discovery and Data Mining (KDD-96), 226–231, Menlo Park, CA, 1996. AAAI, AAAI Press.
V. Estivill-Castro and A.T. Murray. Mining spatial data via clustering. Technical Report #5/97, FIT, QUT, Brisbane 4000, Australia, 1997.
A. Ghozeil and D.B. Fogel. Discovering patterns in spatial data using evolutionary programming. In J.R. Koza, Genetic Programming: Proc. of the First Annual Conf., 521–527, Cambridge, MA, 1996. MIT Press.
M Goodchild and V. Noronha. Location-allocation for small computers. Monograph 8, University of Iowa, 1983.
K. Han, J. Koperski and N. Stefanovic. GeoMiner: A system prototype for spatial data mining. SIGMOD Record, 26(2):553–556, 1997.
D.S. Johnson, C.H. Papadimitrou, and M. Yanakakis. How easy is local search? Journal of Computer System Sciences, 37:79–100, 1988.
L. Kaufman and P.J. Rousseuw. Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, NY, US, 1990.
A.T. Murray and R.L. Church. Applying simulated annealing to location-planning models. Journal of Heuristics, 2:31–53, 1996.
A.T. Murray and V. Estivill-Castro. Cluster discovery techniques for exploratory spatial data analysis. Int. J. of GIS (to appear).
R.T. Ng and J. Han. Efficient and effective clustering methods for spatial data mining. In J. Bocca, M. Jarke, and C. Zaniolo, Proc. of the 20th Conf. on Very Large Data Bases (VLDB), 144–155, San Francisco, CA, 1994. Morgan Kaufmann.
D. Rolland, E. Schilling and J. Current. An efficient tabu search procedure for the p-median problem. European Journal of Operations Research, 96:329–342, 1996.
P. Sorensen. Analysis and design of heuristics for the p-median location-allocation problem. Master's thesis, Dep. of Geography, U. California, Santa Barbara, 1994.
T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH:an efficient data clustering method for very large databases. SIGMOD Record, 25(2):103–114, June 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Estivill-Castrol, V., Murray, A.T. (1998). Discovering associations in spatial data — An efficient medoid based approach. In: Wu, X., Kotagiri, R., Korb, K.B. (eds) Research and Development in Knowledge Discovery and Data Mining. PAKDD 1998. Lecture Notes in Computer Science, vol 1394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64383-4_10
Download citation
DOI: https://doi.org/10.1007/3-540-64383-4_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64383-8
Online ISBN: 978-3-540-69768-8
eBook Packages: Springer Book Archive