Abstract
In a large distributed system spanning many administrative domains such as a Data Grid, it is desirable to maintain and query dynamic and timely information about active participants such as services, resources and user communities. However, in such a database system, the set of information tuples in the universe is partitioned over many distributed nodes, for reasons including autonomy, scalability, availability, performance and security. It is not obvious how to enable generalpurpose discovery query support and collective collaborative functionality that operate on the distributed system as a whole, rather than on a given part of it. Further, it is not obvious how to allow for search results that are fresh, allowing dynamic content. It appears that a Peer-to-Peer (P2P) database network may be well suited to support dynamic distributed database search, for example for service discovery. In this paper, we devise the Unified Peer-to-Peer Database Framework (UPDF), which allows to express specific applications for arbitrary query languages (e.g. XQuery, SQL) and node topologies, and a wide range of data types, query response modes (e.g. Routed, Direct and Referral Response), neighbor selection policies, pipelining characteristics, timeout and other scope options.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ian Foster, Carl Kesselman, and Steve Tuecke. The Anatomy of the Grid: Enabling Scalable Virtual Organizations. Int’l. Journal of Supercomputer Applications, 15(3), 2001.
Large Hadron Collider Committee. Report of the LHC Computing Review. Technical report, CERN/LHCC/2001-004, April 2001. http://cern.ch/lhc-computingreview-public/Public/Report final.PDF.
Wolfgang Hoschek. A Unified Peer-to-Peer Database Framework for XQueries over Dynamic Distributed Content and its Application for Scalable Service Discovery. PhD Thesis, Technical University of Vienna, March 2002.
Ben Segal. Grid Computing: The European Data Grid Project. In IEEE Nuclear Science Symposium and Medical Imaging Conference, Lyon, France, October 2000.
Wolfgang Hoschek, Javier Jaen-Martinez, Asad Samar, Heinz Stockinger, and Kurt Stockinger. Data Management in an International Data Grid Project. In 1st IEEE/ACM Int’l. Workshop on Grid Computing (Grid’2000), Bangalore, India, December 2000.
Dirk Düllmann, Wolfgang Hoschek, Javier Jean-Martinez, Asad Samar, Ben Segal, Heinz Stockinger, and Kurt Stockinger. Models for Replica Synchronisation and Consistency in a Data Grid. In 10th IEEE Symposium on High Performance and Distributed Computing (HPDC-10), San Francisco, California, August 2001.
Ian Foster, Carl Kesselman, Jeffrey Nick, and Steve Tuecke. The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration, January 2002. http://www.globus.org.
J.D. Ullman. Information integration using logical views. In Int’l. Conf. on Database Theory (ICDT), Delphi, Greece, 1997.
Daniela Florescu, Ioana Manolescu, Donald Kossmann, and Florian Xhumari. Agora: Living with XML and Relational. In Int’l. Conf. on Very Large Data Bases (VLDB), Cairo, Egypt, February 2000.
A. Tomasic, L. Raschid, and P. Valduriez. Scaling access to heterogeneous data sources with DISCO. IEEE Transactions on Knowledge and Data Engineering, 10(5):808–823, 1998.
M. Tamer Özsu and Patrick Valduriez. Principles of Distributed Database Systems. Prentice Hall, 1999.
Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschek, Adriana Iamnitchi, Carl Kesselman, Peter Kunszt, Matei Ripeanu, Bob Schwartzkopf, Heinz Stockinger, Kurt Stockinger, and Brian Tierney. Giggle: A Framework for Constructing Scalable Replica Location Services. In Proc. of the Int’l. IEEE/ACM Supercomputing Conference (SC 2002), Baltimore, USA, November 2002. IEEE Computer Society Press.
Leanne Guy, Peter Kunszt, Erwin Laure, Heinz Stockinger, and Kurt Stockinger. Replica Management in Data Grids. Technical report, Global Grid Forum InformationalDocument, GGF5, Edinburgh, Scotland, July 2002.
Heinz Stockinger, Asad Samar, Shahzad Mufzaffar, and Flavia Donno. Grid Data Mirroring Package (GDMP). Journal of Scientific Programming, 2002.
William Bell, Diana Bosio, Wolfgang Hoschek, Peter Kunszt, Gavin McCance, and Mika Silander. Project Spitfire-Towards Grid Web Service Databases. Technical report, Global Grid Forum Informational Document, GGF5, Edinburgh, Scotland, July 2002.
Nelson Minar. Peer-to-Peer is Not Always Decentralized. In The O’Reilly Peerto-Peer and Web Services Conference, Washington, D.C., November 2001.
Wolfgang Hoschek. The Web Service Discovery Architecture. In Proc. of the Int’l. IEEE/ACM Supercomputing Conference (SC 2002), Baltimore, USA, November 2002. IEEE Computer Society Press.
Wolfgang Hoschek. A Database for Dynamic Distributed Content and its Application for Service and Resource Discovery. In Int’l. IEEE Symposium on Parallel and Distributed Computing (ISPDC 2002), Iasi, Romania, July 2002.
E. Christensen, F. Curbera, G. Meredith, and S. Weerawarana. Web Services Description Language (WSDL) 1.1. W3C Note 15, 2001. http://www.w3.org/TR/wsdl.
World Wide Web Consortium. XQuery 1.0: An XML Query Language. W3C Working Draft, December 2001.
Wolfgang Hoschek. A Comparison of Peer-to-Peer Query Response Modes. In Proc. of the Int’l. Conf. on Parallel and Distributed Computing and Systems (PDCS 2002), Cambridge, USA, November 2002.
Gnutella Community. Gnutella Protocol Specification v0.4. dss.clip2.com/GnutellaProtocol04.pdf.
I. Clarke, O. Sandberg, B. Wiley, and T. Hong. Freenet: A distributed anonymous information storage and retrieval system. In Workshop on Design Issues in Anonymity and Unobservability, 2000.
B. Zhao, J. Kubiatowicz, and A. Joseph. Tapestry: An infrastructure for fault-resilient wide-area location and routing. Technical report, U.C. Berkeley UCB//CSD-01-1141, 2001.
I. Stoica, R. Morris, D. Karger, M. Kaashoek, and H. Balakrishnan. Chord: A scalable peer-to-peer lookup service for internet applications. In ACM SIGCOMM, 2001.
M. van Steen, P. Homburg, and A. Tanenbaum. A wide-area distributed system. IEEE Concurrency, 1999.
Steven E. Czerwinski, Ben Y. Zhao, Todd Hodes, Anthony D. Joseph, and Randy Katz. An Architecture for a Secure Service Discovery Service. In Fifth Annual Int’l. Conf. on Mobile Computing and Networks (MobiCOM’ 99), Seattle, WA, August 1999.
J. Waldo. The Jini architecture for network-centric computing. Communications of the ACM, 42(7), July 1999.
W. Yeong, T. Howes, and S. Kille. Lightweight Directory Access Protocol. IETF RFC 1777, March 1995.
Karl Czajkowski, Steven Fitzgerald, Ian Foster, and Carl Kesselman. Grid Information Services for Distributed Resource Sharing. In Tenth IEEE Int’l. Symposium on High-Performance Distributed Computing (HPDC-10), San Francisco, California, August 2001.
International Organization for Standardization (ISO). Information Technology-Database Language SQL. Standard No. ISO/IEC 9075:1999, 1999.
Donald Kossmann. The state of the art in distributed query processing. ACM Computing Surveys, September 2000.
T. Urhan and M. Franklin. Dynamic Pipeline Scheduling for Improving Interactive Query Performance. The Very Large Database (VLDB) Journal, 2001.
Jordan Ritter. Why Gnutella Can’t Scale. No, Really. http://www.tch.org/gnutella.html.
Matei Ripeanu. Peer-to-Peer Architecture Case Study: Gnutella Network. In Int’l. Conf. on Peer-to-Peer Computing (P2P2001), Linkoping, Sweden, August 2001.
S.E. Deering. Multicast Routing in a Datagram Internetwork. PhD Thesis, Stanford University, 1991.
IEEE Computer Society. Data Engineering Bulletin, 23(2), June 2000.
Jayavel Shanmugasundaram, Kristin Tufte, David J. DeWitt, Jeffrey F. Naughton, and David Maier. Architecting a Network Query Engine for Producing Partial Results. In WebDB 2000, 2000.
Zachary G. Ives, Alon Y. Halevy, and Daniel S. Weld. Integrating Network-Bound XML Data. IEEE Data Engineering Bulletin, 24(2), 2001.
J. F. Naughton, D. J. DeWitt, D. Maier, A. Aboulnaga, J. Chen, L. Galanis, J. Kang, R. Krishnamurthy, Q. Luo, N. Prakash, R. Ramamurthy, J. Shanmugasundaram, F. Tian, K. Tufte, S. Viglas, Y. Wang, C. Zhang, B. Jackson, A. Gupta, and R. Chen. The Niagara Internet Query System. IEEE Data Engineering Bulletin, 24(2), 2001.
Annita N. Wilschut and Peter M. G. Apers. Dataflowquery execution in a parallel main-memory environment. In First Int’l. Conf. on Parallel and Distributed Information Systems, December 1991.
Zachary G. Ives, Daniela Florescu, Marc T. Friedman, Alon Y. Levy, and Daniel S. Weld. An adaptive query execution system for data integration. In ACM SIGMOD Conf. On Management of Data, 1999.
Tolga Urhan and Michael J. Franklin. Xjoin, A reactively-scheduled pipelined join operator. IEEE Data Engineering Bulletin, 23(2), June 2000.
Beverly Yang and Hector Garcia-Molina. Efficient Search in Peer-to-Peer Networks. In 22nd Int’l. Conf. on Distributed Computing Systems, Vienna, Austria, July 2002.
Adriana Iamnitchi and Ian Foster. On Fully Decentralized Resource Discovery in Grid Environments. In Int’l. IEEE Workshop on Grid Computing, Denver, Colorado, November 2001.
L. Adamic, R. Lukose, A. Puniyani, and B. Huberman. Search in power-law networks. Phys. Rev, E(64), 2001.
Bernard Traversat, Mohamed Abdelaziz, Mike Duigou, Jean-Christophe Hugly, Eric Pouyoul, and Bill Yeager. Project JXTA Virtual Network, 2002. White Paper, http://www.jxta.org.
Steven Waterhouse. JXTA Search: Distributed Search for Distributed Networks, 2001. White Paper, http://www.jxta.org.
P. Mockapetris. Domain Names-Implementation and Specification. IETF RFC 1035, November 1987.
International Telecommunications Union. Recommendation X.500, Information technology-Open System Interconnection-The directory: Overviewof concepts, models, and services. ITU-T, November 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hoschek, W. (2002). A Unified Peer-to-Peer Database Framework for Scalable Service and Resource Discovery. In: Parashar, M. (eds) Grid Computing — GRID 2002. GRID 2002. Lecture Notes in Computer Science, vol 2536. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36133-2_12
Download citation
DOI: https://doi.org/10.1007/3-540-36133-2_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00133-1
Online ISBN: 978-3-540-36133-6
eBook Packages: Springer Book Archive