Skip to main content

An Adaptive Distributed Index for Similarity Queries in Metric Spaces

  • Conference paper
Web-Age Information Management (WAIM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7418))

Included in the following conference series:

Abstract

As the amount of data is growing rapidly, efficient and scalable index structures for managing large-scale data are attracting more and more attention. To efficiently query and manage the data in metric spaces, an adaptive distributed index, MT-Chord, is proposed. MT-Chord integrates Chord based routing protocol and M-tree based index structure to support efficient similarity query processing in metric spaces. Each index node has multiple replicas for load-balance and a cost model is presented to dynamically tune the number of replicas based on the query and update pattern at the granularity of each index node. MT-Chord is a truly scalable, efficient and adaptive distributed index structure for query processing in metric spaces, which is verified by our extensive experimental studies on three real-life datasets extracted from different data sources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, G., Vo, H.T., Wu, S., Ooi, B.C., Özsu, M.T.: A Framework for Supporting DBMS-like Indexes in the Cloud. In: Proc. of VLDB, pp. 702–713 (2011)

    Google Scholar 

  2. Cao, Y., Chen, C., Guo, F., Jiang, D., Lin, Y.: A Cloud Data Storage System for Supporting Both OLTP and OLAP. In: Proc. of ICDE, pp. 291–302 (2011)

    Google Scholar 

  3. Ciaccia, P., Patella, M., Zezula, P.: M-Tree: An Efficient Access Method for Similarity Search in Metric Spaces. In: Proc. of VLDB, pp. 426–435 (1997)

    Google Scholar 

  4. Chiueh, T.C.: Content-based Image Indexing. In: Proc. of VLDB, pp. 582–593 (1994)

    Google Scholar 

  5. Wang, J., Wu, S., Gao, H., Li, J., Ooi, B.C.: Indexing Multi-dimensional Data in a Cloud System. In: Proc. of SIGMOD, pp. 591–602 (2010)

    Google Scholar 

  6. Tanin, E., Harwood, A., Samet, H.: Using a Distributed Quadtree Index in Peer-to-Peer Networks. VLDB J. (VLDB) 16(2), 165–178 (2007)

    Article  Google Scholar 

  7. Batko, M., Gennaro, C., Savino, P., Zezula, P.: Scalable Similarity Search in Metric Spaces. In: Proc. of DELOS, pp. 213–224 (2004)

    Google Scholar 

  8. Novak, D., Batko, M.: Metric Index: An Efficient and Scalable Solution for Similarity Search. In: Proc. of SISAP, pp. 65–73 (2009)

    Google Scholar 

  9. Novak, D., Zezula, P.: M-Chord: A Scalable Distributed Similarity Search Structure. In: Proc. of INFOSCALE, pp. 181–190 (2006)

    Google Scholar 

  10. Falchi, F., Gennaro, C., Zezula, P.: Nearest Neighbor Search in Metric Spaces Through Content-Addressable Networks. Inf. Process. Manage. 43(3), 665–683 (2007)

    Article  Google Scholar 

  11. Stoica, I., Morris, R., Karger, D.R., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications. In: Proc. of SIGCOMM, pp. 149–160 (2001)

    Google Scholar 

  12. Wu, S., Jiang, D., Ooi, B.C., Wu, K.: Efficient B-tree Based Indexing for Cloud Data Processing. In: Proc. of VLDB, pp. 1207–1218 (2010)

    Google Scholar 

  13. Jagadish, H.V., Ooi, B.C., Tan, K., Yu, C., Zhang, R.: iDistance: An Adaptive B+-Tree Based Indexing Method for Nearest Neighbor Search. ACM Trans. Database Syst. 30(2), 364–397 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhu, M., Shen, D., Kou, Y., Nie, T., Yu, G. (2012). An Adaptive Distributed Index for Similarity Queries in Metric Spaces. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32281-5_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32280-8

  • Online ISBN: 978-3-642-32281-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics