Skip to main content

MB+tree: A Dynamically Updatable Metric Index for Similarity Search

  • Conference paper
  • First Online:
Web-Age Information Management (WAIM 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1846))

Included in the following conference series:

Abstract

One of the common query patterns is to find approximate matches to a given query object in a large database. This kind of query processing is referred as similarity search in a metric space. In this paper, we propose a new metric index MB+tree, called Metric B+tree, which supports near neighbour searching in a generic metric space. MB+tree is aimed at reducing both the number of I/O accesses and the number of distance calculations for similarity search in large databases, while allowing dynamic data updates. In this paper, we show that a B+tree, with an auxiliary tree, can be used as a metric index. Unlike other multidimensional (spatial) access methods, using our approach, we can partition data into disjoint partitions while building/maintaining a metric index, which can lead to a significant cost reduction since the number of metric sub-spaces to be searched is reduced. In order to use MB+tree, a slicing value is proposed. With the slicing value, in addition to space division information, a near neighbour searching can be systematically converted to a range search in B+tree. Several different slicing values are considered namely, one-focus-point scheme and two-focus-point scheme. We also conducted extensive experimental studies using synthetic data. Results are reported in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Reference

  1. A.K. Jain and R.C. Dubes. Algorithms for Clustering Data. Prentice-Hall, 1998.

    Google Scholar 

  2. Tolga Bozkaya and Meral Ozsoyoglu. Distance-based indexing for high-dimensional metric spaces. Proceedings of the ACM SIGMOD Conference on Management of Data, Tucson, Arizona, May 1997, pages 357–368, May 1997.

    Google Scholar 

  3. Masajiro Iwasaki. Implementation and evaluation of metric space indices for similarity search. IPSJ Transactions on Databases, in Japanese, 40:24–44, Feb 1999.

    Google Scholar 

  4. Marco Patella Paolo Ciaccia and Pavel Zezula. M-tree: An efficient access method for similarity search in metric spaces. In Proceedings of the 23rd VLDB Conference, Athenes, Greece, 1997, pages 357–368, 1997.

    Google Scholar 

  5. Divyakant Agrawal Ravi Kanth and Ambuj K. Singh. Dimensionality reduction for similarity searching in dynamic databases. In Proceeding of ACM SIGMOD Conference on Management of Data, Seattle, Washington, 1998, pages 166–176, 1998.

    Google Scholar 

  6. S. Brin. Near neighbour search in large metric spaces. In Proceedings of the 21st VLDB Conference, Zurich, Switzerland, 1995, pages 574–584, 1995.

    Google Scholar 

  7. Jeffery K. Uhlmann. Satisfying general proximity/similary queries with metric trees. Information Processing Letters, 40:175–179, November 1991.

    Google Scholar 

  8. Masatoshi Yoshikawa Yasushi Sakurai and Shunsuke Uemura. High-dimensional nearest neighbour search based on virtual bounding rectangles. In Proceeding of International Conference on FODO’98, pages 258–267, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ishikawa, M., Chen, H., Furuse, K., Xu Yu, J., Ohbo, N. (2000). MB+tree: A Dynamically Updatable Metric Index for Similarity Search. In: Lu, H., Zhou, A. (eds) Web-Age Information Management. WAIM 2000. Lecture Notes in Computer Science, vol 1846. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45151-X_34

Download citation

  • DOI: https://doi.org/10.1007/3-540-45151-X_34

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67627-0

  • Online ISBN: 978-3-540-45151-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics