Skip to main content

Generalizing the Optimality of Multi-step k-Nearest Neighbor Query Processing

  • Conference paper
Advances in Spatial and Temporal Databases (SSTD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4605))

Included in the following conference series:

Abstract

Similarity search algorithms that directly rely on index structures and require a lot of distance computations are usually not applicable to databases containing complex objects and defining costly distance functions on spatial, temporal and multimedia data. Rather, the use of an adequate multi-step query processing strategy is crucial for the performance of a similarity search routine that deals with complex distance functions. Reducing the number of candidates returned from the filter step which then have to be exactly evaluated in the refinement step is fundamental for the efficiency of the query process. The state-of-the-art multi-step k-nearest neighbor (kNN) search algorithms are designed to use only a lower bounding distance estimation for candidate pruning. However, in many applications, also an upper bounding distance approximation is available that can additionally be used for reducing the number of candidates. In this paper, we generalize the traditional concept of R-optimality and introduce the notion of R I -optimality depending on the distance information I available in the filter step. We propose a new multi-step kNN search algorithm that utilizes lower- and upper bounding distance information (I lu ) in the filter step. Furthermore, we show that, in contrast to existing approaches, our proposed solution is \(R_{I_{lu}}\)-optimal. In an experimental evaluation, we demonstrate the significant performance gain over existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Guttman, A.: R-Trees: A dynamic index structure for spatial searching. In: Proc. SIGMOD, pp. 47–57 (1984)

    Google Scholar 

  2. Beckmann, N., Kriegel, H.P., Schneider, R., Seeger, B.: The R*-Tree: An efficient and robust access method for points and rectangles. In: Proc. SIGMOD, pp. 322–331 (1990)

    Google Scholar 

  3. Ciaccia, P., Patella, M., Zezula, P.: M-Tree: an efficient access method for similarity search in metric spaces. In: Proc. VLDB (1997)

    Google Scholar 

  4. Orenstein, J., Manola, F.: Probe spatial data modelling and query processing in an image database application. IEEE Trans. on Software Engineering 14(5), 611–629 (1988)

    Article  Google Scholar 

  5. Brinkhoff, T., Horn, H., Kriegel, H.P., Schneider, R.: A storage and access architecture for efficient query processing in spatial database systems. In: Abel, D.J., Ooi, B.-C. (eds.) SSD 1993. LNCS, vol. 692, Springer, Heidelberg (1993)

    Google Scholar 

  6. Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Lomet, D.B. (ed.) FODO 1993. LNCS, vol. 730, Springer, Heidelberg (1993)

    Google Scholar 

  7. Faloutsos, C., Manolopoulos, M R.a.Y.: Fast subsequence matching in time series database. In: Proc. SIGMOD (1994)

    Google Scholar 

  8. Korn, F., Sidiropoulos, N., Faloutsos, C., Siegel, E., Protopapas, Z.: Fast nearest neighbor search in medical image databases. In: Proc. VLDB (1996)

    Google Scholar 

  9. Seidl, T., Kriegel, H.P.: Optimal multi-step k-nearest neighbor search. In: Proc.SIGMOD (1998)

    Google Scholar 

  10. Hjaltason, G.R., Samet, H.: Ranking in spatial databases. In: Egenhofer, M.J., Herring, J.R. (eds.) SSD 1995. LNCS, vol. 951, Springer, Heidelberg (1995)

    Google Scholar 

  11. Samet, H.: Foundations of Multidimensional and Metric Data Structures. Morgan Kaufmann, San Francisco (2006)

    MATH  Google Scholar 

  12. Keogh, E.: Exact indexing of dynamic time warping. In: Proc. VLDB (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Dimitris Papadias Donghui Zhang George Kollios

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kriegel, HP., Kröger, P., Kunath, P., Renz, M. (2007). Generalizing the Optimality of Multi-step k-Nearest Neighbor Query Processing. In: Papadias, D., Zhang, D., Kollios, G. (eds) Advances in Spatial and Temporal Databases. SSTD 2007. Lecture Notes in Computer Science, vol 4605. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73540-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73540-3_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73539-7

  • Online ISBN: 978-3-540-73540-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics