Abstract
As the amount of data is growing rapidly, efficient and scalable index structures for managing large-scale data are attracting more and more attention. To efficiently query and manage the data in metric spaces, an adaptive distributed index, MT-Chord, is proposed. MT-Chord integrates Chord based routing protocol and M-tree based index structure to support efficient similarity query processing in metric spaces. Each index node has multiple replicas for load-balance and a cost model is presented to dynamically tune the number of replicas based on the query and update pattern at the granularity of each index node. MT-Chord is a truly scalable, efficient and adaptive distributed index structure for query processing in metric spaces, which is verified by our extensive experimental studies on three real-life datasets extracted from different data sources.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, G., Vo, H.T., Wu, S., Ooi, B.C., Özsu, M.T.: A Framework for Supporting DBMS-like Indexes in the Cloud. In: Proc. of VLDB, pp. 702–713 (2011)
Cao, Y., Chen, C., Guo, F., Jiang, D., Lin, Y.: A Cloud Data Storage System for Supporting Both OLTP and OLAP. In: Proc. of ICDE, pp. 291–302 (2011)
Ciaccia, P., Patella, M., Zezula, P.: M-Tree: An Efficient Access Method for Similarity Search in Metric Spaces. In: Proc. of VLDB, pp. 426–435 (1997)
Chiueh, T.C.: Content-based Image Indexing. In: Proc. of VLDB, pp. 582–593 (1994)
Wang, J., Wu, S., Gao, H., Li, J., Ooi, B.C.: Indexing Multi-dimensional Data in a Cloud System. In: Proc. of SIGMOD, pp. 591–602 (2010)
Tanin, E., Harwood, A., Samet, H.: Using a Distributed Quadtree Index in Peer-to-Peer Networks. VLDB J. (VLDB) 16(2), 165–178 (2007)
Batko, M., Gennaro, C., Savino, P., Zezula, P.: Scalable Similarity Search in Metric Spaces. In: Proc. of DELOS, pp. 213–224 (2004)
Novak, D., Batko, M.: Metric Index: An Efficient and Scalable Solution for Similarity Search. In: Proc. of SISAP, pp. 65–73 (2009)
Novak, D., Zezula, P.: M-Chord: A Scalable Distributed Similarity Search Structure. In: Proc. of INFOSCALE, pp. 181–190 (2006)
Falchi, F., Gennaro, C., Zezula, P.: Nearest Neighbor Search in Metric Spaces Through Content-Addressable Networks. Inf. Process. Manage. 43(3), 665–683 (2007)
Stoica, I., Morris, R., Karger, D.R., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications. In: Proc. of SIGCOMM, pp. 149–160 (2001)
Wu, S., Jiang, D., Ooi, B.C., Wu, K.: Efficient B-tree Based Indexing for Cloud Data Processing. In: Proc. of VLDB, pp. 1207–1218 (2010)
Jagadish, H.V., Ooi, B.C., Tan, K., Yu, C., Zhang, R.: iDistance: An Adaptive B+-Tree Based Indexing Method for Nearest Neighbor Search. ACM Trans. Database Syst. 30(2), 364–397 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhu, M., Shen, D., Kou, Y., Nie, T., Yu, G. (2012). An Adaptive Distributed Index for Similarity Queries in Metric Spaces. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-32281-5_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32280-8
Online ISBN: 978-3-642-32281-5
eBook Packages: Computer ScienceComputer Science (R0)