Abstract
Due to the increasing number of independent data providers on the web, there is a growing number of web applications that require locating data sources distributed over the internet. Most of the current proposals in the literature focus on developing effective routing data synopses to answer simple XPath queries in structured or unstructured P2P networks. In this paper, we present an effective framework to support XPath queries extended with full-text search predicates over schema-less XML data distributed in a DHT-based P2P network. We construct two concise routing data synopses, termed structural summary and peer-document synopsis, to route the user query to most relevant peers that own documents that can satisfy the query. To evaluate the structural components in the query, a general query footprint derivation algorithm is developed to extract the query footprint from the query and match it with structural summaries. To improve the search performance, we adopt a lazy query evaluation strategy for evaluating the full-text search predicates in the query. Finally, we develop effective strategies to balance the data load distribution in the system. We conduct extensive experiments to show the scalability of our system, validate the efficiency and accuracy of our routing data synopses, and demonstrate the effectiveness of our load balancing schemes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Manolescu, I., Preda, N.: Constructing and Querying Peer-to-Peer Warehouses of XML Resources. In: Proc. of ICDE, Tokyo, Japan, pp. 1122–1123 (2005)
Berkeley DB. http://www.oracle.com/database/berkeley-db
Bonifati, A., Matrangolo, U., Cuzzocrea, A., Jain, M.: XPath Lookup Queries in P2P Networks. In: Proc. of ACM WIDM, Washington DC, USA, pp. 48–55 (2004)
Fegaras, L., He, W., Das, G., Levine, D.: XML Query Routing in Structured P2P Systems. In: Moro, G., Bergamaschi, S., Joseph, S., Morin, J.-H., Ouksel, A.M. (eds.) DBISP2P 2005 and DBISP2P 2006. LNCS, vol. 4125, pp. 13–24. Springer, Heidelberg (2007)
Galanis, L., Wang, Y., Jeffery, S.R., DeWitt, D.J.: Locating Data Sources in Large Distributed Systems. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 874–885. Springer, Heidelberg (2004)
Kaushik, R., Bohannon, P., Naughton, J.F., Korth, H.: Covering indexes for branching path queries. In: Proc. of SIGMOD, Madison, USA, pp. 133–144 (2002)
Kaushik, R., Bohannon, P., Naughton, J.F., Shenoy, P.: Updates for Structure Indexes. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 239–250. Springer, Heidelberg (2003)
Koloniari, G., Pitoura, E.: Content-Based Routing of Path Queries in Peer-to-Peer Systems. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 29–47. Springer, Heidelberg (2004)
Koloniari, G., Pitoura, E.: Peer-to-peer management of XML data: issues and research challenges. SIGMOD Record 34, 6–17 (2005)
Qizx/open, http://www.axyana.com/qizxopen/
Pastry, http://freepastry.rice.edu
Stoica, I., et al.: Chord: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications. IEEE/ACM Trans. on Networking 11, 17–32 (2003)
Wang, Q., Jha, A.K., Ozsu, M.T.: An XML Routing Synopsis for Unstructured P2P Networks. In: Proc. of the 7th Int. Conference on Web Age Information Management Workshop(WAIMW), p. 23, Hongkong, China (2006)
XML Path Language (XPath) 2.0, http://www.w3.org/TR/xpath20/
XQuery 1.0: An XML Query Language, http://www.w3.org/TR/xquery/
Zhang, C., et al.: On Supporting Containment Queries in Relational Database Management Systems. In: Proc. of SIGMOD, Santa Barbara, USA, pp. 425–436 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
He, W., Fegaras, L. (2008). Approximate XML Query Answers in DHT-Based P2P Networks. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds) Database Systems for Advanced Applications. DASFAA 2008. Lecture Notes in Computer Science, vol 4947. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78568-2_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-78568-2_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78567-5
Online ISBN: 978-3-540-78568-2
eBook Packages: Computer ScienceComputer Science (R0)