Abstract
The Distance Index (D-index) is a recently introduced metric indexing structure which capable of state-of-the-art performance in large scale metric search applications. In this paper we address the problem of how to balance the D-index structure for more efficient similarity search. A group of evaluation functions measuring the balance property of a D-index structure are introduced to guide the construction of the indexing structure. The optimization is formulated in a genetic representation that is effectively solved by a generic genetic algorithm (GA). Compared with the classic D-index, balanced D-index structures show a significant improvement in reduction of distance calculations while maintaining a good input-output (IO) performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yianilos, P.N.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: ACM-SIAM Symposium on Discrete Algorithms, pp. 311–321 (1993)
Uhlmann, J.K.: Satisfying general proximity/similarity queries with metric trees. Information Processing Letters 40(4), 175–179 (1991)
Ciaccia, P., Patella, M., Zezula, P.: M-tree: an efficient access method for similarity search in matric spaces. In: Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB 1997), pp. 426–435 (1997)
Vidal, E.: New formulation and improvements of the nearest-neighbor approximating and eliminating search algorithm (AESA). Pattern Recognition Letters 15(1), 1–7 (1994)
Micó, M.L., Oncina, J., Vidal, E.: A new version of the nearest-neighbor approximating and eliminating search algorithm (AESA) with linear preprocessing time and memory requirements. Pattern Recognition Letters 15(1), 9–17 (1994)
Dohnal, V., Gennaro, C., Savino, P., Zezula, P.: D-Index: distance searching index for metric sata sets. Multimedia Tools and Applications 21(1), 9–33 (2003)
Schmitt, L.M.: Theory of Genetic Algorithms. Theoretical Computer Science 259, 1–61 (2001)
Data available at http://www.net-comber.com/wordurls.html
Ortega, M., Rui, Y., et al.: Supporting ranked Boolean similarity queries in MARS. IEEE Transaction on Knowledge and Data Engineering 10(6), 905–925 (1998)
Data available at http://www.ics.uci.edu/~mlearn/MLRepository.html
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ban, T. (2008). Using Genetic Algorithm to Balance the D-Index Algorithm for Metric Search. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds) Neural Information Processing. ICONIP 2007. Lecture Notes in Computer Science, vol 4985. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69162-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-69162-4_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69159-4
Online ISBN: 978-3-540-69162-4
eBook Packages: Computer ScienceComputer Science (R0)