Skip to main content

An Algorithm for Incremental Nearest Neighbor Search in High-Dimensional Data Spaces

  • Conference paper
  • First Online:
The Human Society and the Internet Internet-Related Socio-Economic Issues (HSI 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2105))

Included in the following conference series:

  • 832 Accesses

Abstract

The SPY-TEC (Spherical Pyramid-Technique) [8] was proposed as a new indexing method for high-dimensional data spaces using a special partitioning strategy that divides a d-dimensional data space into 2d spherical pyramids. Although the authors of [8] proposed an efficient algorithm for processing hyperspherical range queries, they did not propose an algorithm for processing k-nearest neighbor queries that are frequently used in similarity search. In this paper, we propose an efficient algorithm for processing exact nearest neighbor queries on the SPY-TEC by extending the incremental nearest neighbor algorithm proposed in [10]. We also introduce a metric that can be used to guide an ordered best-first traversal when finding nearest neighbors on the SPYTEC. Finally, we show that our technique significantly outperforms the related techniques in processing k-nearest neighbor queries by comparing it to the R*-tree, the X-tree, and the sequential scan through extensive experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Guttman. “R-trees: a dynamic index structure for spatial searching”. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 47–57, June 1984.

    Google Scholar 

  2. A. Henrich. “The LSDh-Tree: An Access Structure for Feature Vectors”. Proc. 14th Int. Conf on Data Engineering, pages 362–369, 1998.

    Google Scholar 

  3. C. Faloutsos, R. Barber, M. Flickner, J. Hafner, W. Niblack, D. Petkovic, and W. Equiz. “Efficient and Effective Querying by Image Content”. Journal of Intelligent Information System(JIIS), 3(3):231–262, July 1994.

    Article  Google Scholar 

  4. B.C. Ooi, K.L. Tan, T.S. Chua, and W. Hsu. “Fast image retrieval using colorspatial information”. The VLDB Journal, 7(2):115–128, 1998.

    Article  Google Scholar 

  5. C.E. Jacobs, A. Finkelstein, and D.H. Salesin. “Fast Multiresolution Image Query”. Proc. of the 1995 ACM SIGGRAPH, New York, 1995.

    Google Scholar 

  6. D.A. White and R. Jain. “Similarity Indexing with the SS-tree”. Proc. 12th Int. Conf on Data Engineering, pages 516–523, 1996.

    Google Scholar 

  7. D.B. Lomet and B. Salzberg. “The hB-Tree: A Multiattribute Indexing Method with Good Guaranteed Performance”. ACM Transaction on Database Systems, 15(4):625–658, 1990.

    Article  Google Scholar 

  8. D.H. Lee and H.J. Kim. “SPY-TEC: An Efficient Indexing Method for Similarity Search in High-Dimensional Data Spaces”. Data & Knowledge Engineering, 34(1):77–97, 2000.

    Article  MATH  Google Scholar 

  9. C. Faloutsos. “Fast Searching by Content in Multimedia Databases”. Data Engineering Bulletin, 18(4), 1995.

    Google Scholar 

  10. G.R. Hjaltason and H. Samet. “Distance Browsing in Spatial Databases”. ACM Transaction on Database Systems, 24(2):265–318, 1999.

    Article  Google Scholar 

  11. J. Bentley. “Mutidimensional binary search trees used for associative searching”. Communications of the ACM, 18(9):509–517, 1975.

    Article  MATH  MathSciNet  Google Scholar 

  12. J.R. Smith and S.-F. Chang. “VisualSEEk: a fully automated content-based image query system”. ACM Multimedia 96, Boston, MA, 1996.

    Google Scholar 

  13. J.T. Robinson. “The K-D-B-tree: a Search Structure for Large Multidimensional Dynamic Indexes”. Proc. ACM SIGMOD, Ann Arbor, USA, pages 10–18, April 1981.

    Google Scholar 

  14. K.-I. Lin, H.V. Jagadish, and C. Faloutsos. “The TV-tree: An Index Structure for High-Dimensional Data”. The VLDB Journal, 3(4):517–542, 1994.

    Article  Google Scholar 

  15. L. Leithold. “Trigonometry”. Addison-Wesley, 1989.

    Google Scholar 

  16. D.H. Lee and H.J. Kim. “An Efficient Nearest Neighbor Search in High-Dimensional Data Spaces”. Seoul National University, CE Technical Report (OOPSLA-TR1028), http://oopsla.snu.ac.kr/~dhlee/OOPSLA-TR1028.ps , 2000.

  17. N. Katayama and S. Satoh. “The SR-tree: An Index Structure for High-Dimensional Nearest Neighbor Queries”. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 517–542, May 1997.

    Google Scholar 

  18. N. Roussopoulos, S. Kelley, and F. Vincent. “Nearest Neighbor Queries”. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 71–79, 1995.

    Google Scholar 

  19. K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft. “When Is “Nearest Neighbor” Meaningful ? ”. Proc. 7th Int. Conf. on Database Teory, pages 217–235, January 1999.

    Google Scholar 

  20. S. Berchtold, C. Böhm, and H.-P. Kriegel. “The Pyramid-Technique: Towards Breaking the Curse of Dimensionality”. Proc. ACM SIGMOD Int. Conf. on Management of Data, 1998.

    Google Scholar 

  21. S. Berchtold, C. Böhm, D.A. Keim, and H.-P. Kriegel. “A Cost Model For Nearest Neighbor Search in High-Dimensional Data Space”. ACM PODS Symposium on Principles of Database Systems, Tucson, Arizona, 1997.

    Google Scholar 

  22. S. Berchtold, D.A. Keim, and H.-P. Kriegel. “The X-tree: An Indexing Structure for High-Dimensional Data”. Proc. 22nd Int. Conf. on Very Large Database, pages 28–39, September 1996.

    Google Scholar 

  23. P.M. Kelly, T.M. Cannon and D.R. Hush. “Query by image example: the CANDID approach”. Proc. SPIE Storage and Retrieval for Image and Video Databases III, 2420: 238–248, 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, DH., Lee, HD., Choi, IH., Kim, HJ. (2001). An Algorithm for Incremental Nearest Neighbor Search in High-Dimensional Data Spaces. In: Kim, W., Ling, TW., Lee, YJ., Park, SS. (eds) The Human Society and the Internet Internet-Related Socio-Economic Issues. HSI 2001. Lecture Notes in Computer Science, vol 2105. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47749-7_35

Download citation

  • DOI: https://doi.org/10.1007/3-540-47749-7_35

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42313-3

  • Online ISBN: 978-3-540-47749-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics