Abstract
A critical performance issue in searching distance-based indexing trees is that commonly multiple nodes, or data partitions, have to be further examined at each level of the tree. As a result, logarithmic search time is commonly not achievable. To solve this problem, excluded middle forest builds a tree for each data partition to be descended, so that logarithmic search time is achieved for each tree. Promising empirical results are reported from excluded middle forest. However, we observe that what determines whether a data partition has to be further examined is that data composing it, but not its location. Although each tree of the excluded middle forest can be searched in logarithmic time, the overall query performance is encumbered with the existence of multiple trees. We show analytically and empirically that multiple vantage point tree outperforms excluded middle forest.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Uhlmann JK (1991) Satisfying general proximity/similarity queries with metric trees. Inf Process Lett 40(4):175–179
Yianilos P (1999) Excluded middle vantage point forests for nearest neighbor search. In: DIMACS implementation challenge, ALENEX’99, Baltimore, MD
Bozkaya T, Ozsoyoglu M (1999) Indexing large metric spaces for similarity search queries. ACM Trans Database Syst 24(3):361–404
Bozkaya T, Ozsoyoglu M (1997) Distance-based indexing for high-dimensional metric spaces. In: Peckman JM, Ram S, Franklin M (eds) Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD ‘97, Tucson, AZ, May 13–15) ACM Press, New York, NY, pp 357–368
Bentley JL (1975) Multidimensional binary search trees used for associative searching. Commun ACM 18(9):509–517
Xu W, Thompson LP, Miranker DP Empirical evaluation of excluded middle vantage point forest on biological sequences workload. NTSS 2011, Mar 25 2011, Uppsala, Sweden
Mao R, Willard LM, Daniel PM (2010) Dimension reduction for distance-based indexing. In: Proceedings of the Third International Conference on SImilarity Search and Applications (SISAP2010), pages 25–32, Istanbul, Turkey, September 18–19, 2010
SISAP Metric spaces library (2010) http://sisap.org/Metric_Space_Library.html
SISAP test suite. http://sisap.org/Metric_Space_Library.html
CGeDBIT program. http://nhpcc.szu.edu.cn/mao/gedbit/index.html
Acknowledgment
This research was supported by the following grants: China 863: 2012AA010239; NSF-China: 61170076, U1301252; Shenzhen Foundational Research Project: JCYJ20120613161449764.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, Q., Zhang, H., Lei, F., Liu, G., Lu, M., Mao, R. (2014). Excluded Middle Forest Versus Vantage Point Tree: An Analytical and Empirical Comparison. In: Wen, Z., Li, T. (eds) Practical Applications of Intelligent Systems. Advances in Intelligent Systems and Computing, vol 279. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54927-4_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-54927-4_41
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54926-7
Online ISBN: 978-3-642-54927-4
eBook Packages: EngineeringEngineering (R0)