Skip to main content

Approximate XML Query Answers in DHT-Based P2P Networks

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4947))

Included in the following conference series:

  • 989 Accesses

Abstract

Due to the increasing number of independent data providers on the web, there is a growing number of web applications that require locating data sources distributed over the internet. Most of the current proposals in the literature focus on developing effective routing data synopses to answer simple XPath queries in structured or unstructured P2P networks. In this paper, we present an effective framework to support XPath queries extended with full-text search predicates over schema-less XML data distributed in a DHT-based P2P network. We construct two concise routing data synopses, termed structural summary and peer-document synopsis, to route the user query to most relevant peers that own documents that can satisfy the query. To evaluate the structural components in the query, a general query footprint derivation algorithm is developed to extract the query footprint from the query and match it with structural summaries. To improve the search performance, we adopt a lazy query evaluation strategy for evaluating the full-text search predicates in the query. Finally, we develop effective strategies to balance the data load distribution in the system. We conduct extensive experiments to show the scalability of our system, validate the efficiency and accuracy of our routing data synopses, and demonstrate the effectiveness of our load balancing schemes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Manolescu, I., Preda, N.: Constructing and Querying Peer-to-Peer Warehouses of XML Resources. In: Proc. of ICDE, Tokyo, Japan, pp. 1122–1123 (2005)

    Google Scholar 

  2. Berkeley DB. http://www.oracle.com/database/berkeley-db

  3. Bonifati, A., Matrangolo, U., Cuzzocrea, A., Jain, M.: XPath Lookup Queries in P2P Networks. In: Proc. of ACM WIDM, Washington DC, USA, pp. 48–55 (2004)

    Google Scholar 

  4. Fegaras, L., He, W., Das, G., Levine, D.: XML Query Routing in Structured P2P Systems. In: Moro, G., Bergamaschi, S., Joseph, S., Morin, J.-H., Ouksel, A.M. (eds.) DBISP2P 2005 and DBISP2P 2006. LNCS, vol. 4125, pp. 13–24. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  5. Galanis, L., Wang, Y., Jeffery, S.R., DeWitt, D.J.: Locating Data Sources in Large Distributed Systems. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 874–885. Springer, Heidelberg (2004)

    Google Scholar 

  6. Kaushik, R., Bohannon, P., Naughton, J.F., Korth, H.: Covering indexes for branching path queries. In: Proc. of SIGMOD, Madison, USA, pp. 133–144 (2002)

    Google Scholar 

  7. Kaushik, R., Bohannon, P., Naughton, J.F., Shenoy, P.: Updates for Structure Indexes. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 239–250. Springer, Heidelberg (2003)

    Google Scholar 

  8. Koloniari, G., Pitoura, E.: Content-Based Routing of Path Queries in Peer-to-Peer Systems. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 29–47. Springer, Heidelberg (2004)

    Google Scholar 

  9. Koloniari, G., Pitoura, E.: Peer-to-peer management of XML data: issues and research challenges. SIGMOD Record 34, 6–17 (2005)

    Article  Google Scholar 

  10. Qizx/open, http://www.axyana.com/qizxopen/

  11. Pastry, http://freepastry.rice.edu

  12. Stoica, I., et al.: Chord: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications. IEEE/ACM Trans. on Networking 11, 17–32 (2003)

    Article  Google Scholar 

  13. Wang, Q., Jha, A.K., Ozsu, M.T.: An XML Routing Synopsis for Unstructured P2P Networks. In: Proc. of the 7th Int. Conference on Web Age Information Management Workshop(WAIMW), p. 23, Hongkong, China (2006)

    Google Scholar 

  14. XMark, http://www.xml-benchmark.org/

  15. XML Path Language (XPath) 2.0, http://www.w3.org/TR/xpath20/

  16. XQuery 1.0: An XML Query Language, http://www.w3.org/TR/xquery/

  17. Zhang, C., et al.: On Supporting Containment Queries in Relational Database Management Systems. In: Proc. of SIGMOD, Santa Barbara, USA, pp. 425–436 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Jayant R. Haritsa Ramamohanarao Kotagiri Vikram Pudi

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

He, W., Fegaras, L. (2008). Approximate XML Query Answers in DHT-Based P2P Networks. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds) Database Systems for Advanced Applications. DASFAA 2008. Lecture Notes in Computer Science, vol 4947. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78568-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78568-2_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78567-5

  • Online ISBN: 978-3-540-78568-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics