Skip to main content

A Unified Peer-to-Peer Database Framework for Scalable Service and Resource Discovery

  • Conference paper
  • First Online:
Grid Computing — GRID 2002 (GRID 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2536))

Included in the following conference series:

Abstract

In a large distributed system spanning many administrative domains such as a Data Grid, it is desirable to maintain and query dynamic and timely information about active participants such as services, resources and user communities. However, in such a database system, the set of information tuples in the universe is partitioned over many distributed nodes, for reasons including autonomy, scalability, availability, performance and security. It is not obvious how to enable generalpurpose discovery query support and collective collaborative functionality that operate on the distributed system as a whole, rather than on a given part of it. Further, it is not obvious how to allow for search results that are fresh, allowing dynamic content. It appears that a Peer-to-Peer (P2P) database network may be well suited to support dynamic distributed database search, for example for service discovery. In this paper, we devise the Unified Peer-to-Peer Database Framework (UPDF), which allows to express specific applications for arbitrary query languages (e.g. XQuery, SQL) and node topologies, and a wide range of data types, query response modes (e.g. Routed, Direct and Referral Response), neighbor selection policies, pipelining characteristics, timeout and other scope options.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ian Foster, Carl Kesselman, and Steve Tuecke. The Anatomy of the Grid: Enabling Scalable Virtual Organizations. Int’l. Journal of Supercomputer Applications, 15(3), 2001.

    Google Scholar 

  2. Large Hadron Collider Committee. Report of the LHC Computing Review. Technical report, CERN/LHCC/2001-004, April 2001. http://cern.ch/lhc-computingreview-public/Public/Report final.PDF.

  3. Wolfgang Hoschek. A Unified Peer-to-Peer Database Framework for XQueries over Dynamic Distributed Content and its Application for Scalable Service Discovery. PhD Thesis, Technical University of Vienna, March 2002.

    Google Scholar 

  4. Ben Segal. Grid Computing: The European Data Grid Project. In IEEE Nuclear Science Symposium and Medical Imaging Conference, Lyon, France, October 2000.

    Google Scholar 

  5. Wolfgang Hoschek, Javier Jaen-Martinez, Asad Samar, Heinz Stockinger, and Kurt Stockinger. Data Management in an International Data Grid Project. In 1st IEEE/ACM Int’l. Workshop on Grid Computing (Grid’2000), Bangalore, India, December 2000.

    Google Scholar 

  6. Dirk Düllmann, Wolfgang Hoschek, Javier Jean-Martinez, Asad Samar, Ben Segal, Heinz Stockinger, and Kurt Stockinger. Models for Replica Synchronisation and Consistency in a Data Grid. In 10th IEEE Symposium on High Performance and Distributed Computing (HPDC-10), San Francisco, California, August 2001.

    Google Scholar 

  7. Ian Foster, Carl Kesselman, Jeffrey Nick, and Steve Tuecke. The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration, January 2002. http://www.globus.org.

  8. J.D. Ullman. Information integration using logical views. In Int’l. Conf. on Database Theory (ICDT), Delphi, Greece, 1997.

    Google Scholar 

  9. Daniela Florescu, Ioana Manolescu, Donald Kossmann, and Florian Xhumari. Agora: Living with XML and Relational. In Int’l. Conf. on Very Large Data Bases (VLDB), Cairo, Egypt, February 2000.

    Google Scholar 

  10. A. Tomasic, L. Raschid, and P. Valduriez. Scaling access to heterogeneous data sources with DISCO. IEEE Transactions on Knowledge and Data Engineering, 10(5):808–823, 1998.

    Article  Google Scholar 

  11. M. Tamer Özsu and Patrick Valduriez. Principles of Distributed Database Systems. Prentice Hall, 1999.

    Google Scholar 

  12. Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschek, Adriana Iamnitchi, Carl Kesselman, Peter Kunszt, Matei Ripeanu, Bob Schwartzkopf, Heinz Stockinger, Kurt Stockinger, and Brian Tierney. Giggle: A Framework for Constructing Scalable Replica Location Services. In Proc. of the Int’l. IEEE/ACM Supercomputing Conference (SC 2002), Baltimore, USA, November 2002. IEEE Computer Society Press.

    Google Scholar 

  13. Leanne Guy, Peter Kunszt, Erwin Laure, Heinz Stockinger, and Kurt Stockinger. Replica Management in Data Grids. Technical report, Global Grid Forum InformationalDocument, GGF5, Edinburgh, Scotland, July 2002.

    Google Scholar 

  14. Heinz Stockinger, Asad Samar, Shahzad Mufzaffar, and Flavia Donno. Grid Data Mirroring Package (GDMP). Journal of Scientific Programming, 2002.

    Google Scholar 

  15. William Bell, Diana Bosio, Wolfgang Hoschek, Peter Kunszt, Gavin McCance, and Mika Silander. Project Spitfire-Towards Grid Web Service Databases. Technical report, Global Grid Forum Informational Document, GGF5, Edinburgh, Scotland, July 2002.

    Google Scholar 

  16. Nelson Minar. Peer-to-Peer is Not Always Decentralized. In The O’Reilly Peerto-Peer and Web Services Conference, Washington, D.C., November 2001.

    Google Scholar 

  17. Wolfgang Hoschek. The Web Service Discovery Architecture. In Proc. of the Int’l. IEEE/ACM Supercomputing Conference (SC 2002), Baltimore, USA, November 2002. IEEE Computer Society Press.

    Google Scholar 

  18. Wolfgang Hoschek. A Database for Dynamic Distributed Content and its Application for Service and Resource Discovery. In Int’l. IEEE Symposium on Parallel and Distributed Computing (ISPDC 2002), Iasi, Romania, July 2002.

    Google Scholar 

  19. E. Christensen, F. Curbera, G. Meredith, and S. Weerawarana. Web Services Description Language (WSDL) 1.1. W3C Note 15, 2001. http://www.w3.org/TR/wsdl.

  20. World Wide Web Consortium. XQuery 1.0: An XML Query Language. W3C Working Draft, December 2001.

    Google Scholar 

  21. Wolfgang Hoschek. A Comparison of Peer-to-Peer Query Response Modes. In Proc. of the Int’l. Conf. on Parallel and Distributed Computing and Systems (PDCS 2002), Cambridge, USA, November 2002.

    Google Scholar 

  22. Gnutella Community. Gnutella Protocol Specification v0.4. dss.clip2.com/GnutellaProtocol04.pdf.

    Google Scholar 

  23. I. Clarke, O. Sandberg, B. Wiley, and T. Hong. Freenet: A distributed anonymous information storage and retrieval system. In Workshop on Design Issues in Anonymity and Unobservability, 2000.

    Google Scholar 

  24. B. Zhao, J. Kubiatowicz, and A. Joseph. Tapestry: An infrastructure for fault-resilient wide-area location and routing. Technical report, U.C. Berkeley UCB//CSD-01-1141, 2001.

    Google Scholar 

  25. I. Stoica, R. Morris, D. Karger, M. Kaashoek, and H. Balakrishnan. Chord: A scalable peer-to-peer lookup service for internet applications. In ACM SIGCOMM, 2001.

    Google Scholar 

  26. M. van Steen, P. Homburg, and A. Tanenbaum. A wide-area distributed system. IEEE Concurrency, 1999.

    Google Scholar 

  27. Steven E. Czerwinski, Ben Y. Zhao, Todd Hodes, Anthony D. Joseph, and Randy Katz. An Architecture for a Secure Service Discovery Service. In Fifth Annual Int’l. Conf. on Mobile Computing and Networks (MobiCOM’ 99), Seattle, WA, August 1999.

    Google Scholar 

  28. J. Waldo. The Jini architecture for network-centric computing. Communications of the ACM, 42(7), July 1999.

    Google Scholar 

  29. W. Yeong, T. Howes, and S. Kille. Lightweight Directory Access Protocol. IETF RFC 1777, March 1995.

    Google Scholar 

  30. Karl Czajkowski, Steven Fitzgerald, Ian Foster, and Carl Kesselman. Grid Information Services for Distributed Resource Sharing. In Tenth IEEE Int’l. Symposium on High-Performance Distributed Computing (HPDC-10), San Francisco, California, August 2001.

    Google Scholar 

  31. International Organization for Standardization (ISO). Information Technology-Database Language SQL. Standard No. ISO/IEC 9075:1999, 1999.

    Google Scholar 

  32. Donald Kossmann. The state of the art in distributed query processing. ACM Computing Surveys, September 2000.

    Google Scholar 

  33. T. Urhan and M. Franklin. Dynamic Pipeline Scheduling for Improving Interactive Query Performance. The Very Large Database (VLDB) Journal, 2001.

    Google Scholar 

  34. Jordan Ritter. Why Gnutella Can’t Scale. No, Really. http://www.tch.org/gnutella.html.

  35. Matei Ripeanu. Peer-to-Peer Architecture Case Study: Gnutella Network. In Int’l. Conf. on Peer-to-Peer Computing (P2P2001), Linkoping, Sweden, August 2001.

    Google Scholar 

  36. S.E. Deering. Multicast Routing in a Datagram Internetwork. PhD Thesis, Stanford University, 1991.

    Google Scholar 

  37. IEEE Computer Society. Data Engineering Bulletin, 23(2), June 2000.

    Google Scholar 

  38. Jayavel Shanmugasundaram, Kristin Tufte, David J. DeWitt, Jeffrey F. Naughton, and David Maier. Architecting a Network Query Engine for Producing Partial Results. In WebDB 2000, 2000.

    Google Scholar 

  39. Zachary G. Ives, Alon Y. Halevy, and Daniel S. Weld. Integrating Network-Bound XML Data. IEEE Data Engineering Bulletin, 24(2), 2001.

    Google Scholar 

  40. J. F. Naughton, D. J. DeWitt, D. Maier, A. Aboulnaga, J. Chen, L. Galanis, J. Kang, R. Krishnamurthy, Q. Luo, N. Prakash, R. Ramamurthy, J. Shanmugasundaram, F. Tian, K. Tufte, S. Viglas, Y. Wang, C. Zhang, B. Jackson, A. Gupta, and R. Chen. The Niagara Internet Query System. IEEE Data Engineering Bulletin, 24(2), 2001.

    Google Scholar 

  41. Annita N. Wilschut and Peter M. G. Apers. Dataflowquery execution in a parallel main-memory environment. In First Int’l. Conf. on Parallel and Distributed Information Systems, December 1991.

    Google Scholar 

  42. Zachary G. Ives, Daniela Florescu, Marc T. Friedman, Alon Y. Levy, and Daniel S. Weld. An adaptive query execution system for data integration. In ACM SIGMOD Conf. On Management of Data, 1999.

    Google Scholar 

  43. Tolga Urhan and Michael J. Franklin. Xjoin, A reactively-scheduled pipelined join operator. IEEE Data Engineering Bulletin, 23(2), June 2000.

    Google Scholar 

  44. Beverly Yang and Hector Garcia-Molina. Efficient Search in Peer-to-Peer Networks. In 22nd Int’l. Conf. on Distributed Computing Systems, Vienna, Austria, July 2002.

    Google Scholar 

  45. Adriana Iamnitchi and Ian Foster. On Fully Decentralized Resource Discovery in Grid Environments. In Int’l. IEEE Workshop on Grid Computing, Denver, Colorado, November 2001.

    Google Scholar 

  46. L. Adamic, R. Lukose, A. Puniyani, and B. Huberman. Search in power-law networks. Phys. Rev, E(64), 2001.

    Google Scholar 

  47. Bernard Traversat, Mohamed Abdelaziz, Mike Duigou, Jean-Christophe Hugly, Eric Pouyoul, and Bill Yeager. Project JXTA Virtual Network, 2002. White Paper, http://www.jxta.org.

  48. Steven Waterhouse. JXTA Search: Distributed Search for Distributed Networks, 2001. White Paper, http://www.jxta.org.

  49. P. Mockapetris. Domain Names-Implementation and Specification. IETF RFC 1035, November 1987.

    Google Scholar 

  50. International Telecommunications Union. Recommendation X.500, Information technology-Open System Interconnection-The directory: Overviewof concepts, models, and services. ITU-T, November 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hoschek, W. (2002). A Unified Peer-to-Peer Database Framework for Scalable Service and Resource Discovery. In: Parashar, M. (eds) Grid Computing — GRID 2002. GRID 2002. Lecture Notes in Computer Science, vol 2536. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36133-2_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-36133-2_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00133-1

  • Online ISBN: 978-3-540-36133-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics