Abstract
Many real-life applications use various kinds of clustering algorithms. Very popular and interesting are applications dealing with spatial data, like on-line map services or traffic tracking systems. A very important branch of spatial systems is telemetry. Our current research is focused on providing an efficient caching structure that will accelerate spatial queries evaluation and improve the ways of storing and processing aggregates. We use a density-based clustering algorithm to create the structure levels. The used clustering algorithm is fast and efficient but it requires a user-defined Eps parameter. As we cannot get the Eps parameter from the user for every level of the structure, we propose an Automatic Eps Calculation (AEC) algorithm which, based on the points distribution characteristics, is able to estimate the Eps parameter value. The algorithm is not limited to the telemetry-specific data and can be applied to any set of points located in a two-dimensional space. We describe in detail the algorithm operation, test results and possible algorithm improvements.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barclay, T., Slutz, D.R., Gray, J.: TerraServer: A Spatial Data Warehouse. In: Proc. ACM SIGMOD 2000, pp. 307–318 (June 2000)
http://www.lsgi.polyu.edu.hk/sTAFF/zl.li/vol_2_2/02_chen.pdf
Ester, M., Kriegel, H.-P., Sander, J., Wimmer, M.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: Proc. of 2nd International Conference on Knowledge Discovery and Data Mining (1996)
Gorawski, M., Malczok, R.: On Efficient Storing and Processing of Long Aggregate Lists. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589, pp. 190–199. Springer, Heidelberg (2005)
Papadias, D., Kalnis, P., Zhang, J., Tao, Y.: APN 2001. LNCS. Spinger, Heidelberg (2001)
Wang, X., Hamilton, H.J.: DBRS: A Density-Based Spatial Clustering Method with Random Sampling. In: PAKDD (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gorawski, M., Malczok, R. (2006). Towards Automatic Eps Calculation in Density-Based Clustering. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds) Advances in Databases and Information Systems. ADBIS 2006. Lecture Notes in Computer Science, vol 4152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827252_24
Download citation
DOI: https://doi.org/10.1007/11827252_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37899-0
Online ISBN: 978-3-540-37900-3
eBook Packages: Computer ScienceComputer Science (R0)