Abstract
One of the common query patterns is to find approximate matches to a given query object in a large database. This kind of query processing is referred as similarity search in a metric space. In this paper, we propose a new metric index MB+tree, called Metric B+tree, which supports near neighbour searching in a generic metric space. MB+tree is aimed at reducing both the number of I/O accesses and the number of distance calculations for similarity search in large databases, while allowing dynamic data updates. In this paper, we show that a B+tree, with an auxiliary tree, can be used as a metric index. Unlike other multidimensional (spatial) access methods, using our approach, we can partition data into disjoint partitions while building/maintaining a metric index, which can lead to a significant cost reduction since the number of metric sub-spaces to be searched is reduced. In order to use MB+tree, a slicing value is proposed. With the slicing value, in addition to space division information, a near neighbour searching can be systematically converted to a range search in B+tree. Several different slicing values are considered namely, one-focus-point scheme and two-focus-point scheme. We also conducted extensive experimental studies using synthetic data. Results are reported in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Reference
A.K. Jain and R.C. Dubes. Algorithms for Clustering Data. Prentice-Hall, 1998.
Tolga Bozkaya and Meral Ozsoyoglu. Distance-based indexing for high-dimensional metric spaces. Proceedings of the ACM SIGMOD Conference on Management of Data, Tucson, Arizona, May 1997, pages 357–368, May 1997.
Masajiro Iwasaki. Implementation and evaluation of metric space indices for similarity search. IPSJ Transactions on Databases, in Japanese, 40:24–44, Feb 1999.
Marco Patella Paolo Ciaccia and Pavel Zezula. M-tree: An efficient access method for similarity search in metric spaces. In Proceedings of the 23rd VLDB Conference, Athenes, Greece, 1997, pages 357–368, 1997.
Divyakant Agrawal Ravi Kanth and Ambuj K. Singh. Dimensionality reduction for similarity searching in dynamic databases. In Proceeding of ACM SIGMOD Conference on Management of Data, Seattle, Washington, 1998, pages 166–176, 1998.
S. Brin. Near neighbour search in large metric spaces. In Proceedings of the 21st VLDB Conference, Zurich, Switzerland, 1995, pages 574–584, 1995.
Jeffery K. Uhlmann. Satisfying general proximity/similary queries with metric trees. Information Processing Letters, 40:175–179, November 1991.
Masatoshi Yoshikawa Yasushi Sakurai and Shunsuke Uemura. High-dimensional nearest neighbour search based on virtual bounding rectangles. In Proceeding of International Conference on FODO’98, pages 258–267, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ishikawa, M., Chen, H., Furuse, K., Xu Yu, J., Ohbo, N. (2000). MB+tree: A Dynamically Updatable Metric Index for Similarity Search. In: Lu, H., Zhou, A. (eds) Web-Age Information Management. WAIM 2000. Lecture Notes in Computer Science, vol 1846. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45151-X_34
Download citation
DOI: https://doi.org/10.1007/3-540-45151-X_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67627-0
Online ISBN: 978-3-540-45151-8
eBook Packages: Springer Book Archive