Skip to main content

GDIS: A Service-Based Architecture for Data Integration on Grids

  • Conference paper
Book cover On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops (OTM 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3292))

Abstract

Distributed data sources can be heterogeneous in their formats, schemas, quality, access mechanisms, ownership, access policies, and capabilities. We need models and techniques for managing different data resources in an integrated way. Data integration is the flexible and managed federation, analysis, and processing of data from different distributed sources. Data integration is becoming as important as data mining for exploiting the value of large and distributed data sets that today are available. Distributed processing infrastructures such as Grids and peer-to-peer networks can be used for data integration on geographically distributed sites. This paper presents a service-based architecture for data integration on Grids. The basic model is discussed and its implementation based on the OGSA Globus architecture is described.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bernstein, P., Giunchiglia, F., Kementsietsidis, A., Mylopoulos, J., Serafini, L., Zaihrayeu, I.: Data Management for Peer-to-Peer Computing: A Vision. In: Proc. of the 5th International Workshop on the Web and Databases, WebDB, Madison, Wisconsin (June 2002)

    Google Scholar 

  2. Calvanese, D., Damaggio, E., De Giacomo, G., Lenzerini, M., Rosati, R.: Semantic Data Integration in P2P Systems. In: Proc. of Databases, Information Systems, and Peer-to-Peer Computing, 1st International Workshop, DBISP2P, Berlin, Germany, September 7-8 (2003)

    Google Scholar 

  3. Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. The Globus Project (2002), www.globus.org

  4. Foster, I., Tuecke, S., Unger, J.: OGSA Data Services, DAIS-WG Informational Draft, 9th Global Grid Forum, August 14 (2003)

    Google Scholar 

  5. Foster, I., Grossman, R.L.: Blueprint for the future of high-performance networking: Data integration in a bandwidth-rich world. Communications of the ACM 46(11) (November 2003)

    Google Scholar 

  6. Franconi, E., Kuper, G.M., Lopatenko, A., Serafini, L.: A Robust Logical and Computational Characterisation of Peer-to-Peer Database Systems. In: Proc. of Databases, Information Systems, and Peer-to-Peer Computing, First International Workshop, DBISP2P, Berlin Germany, September 7-8 (2003)

    Google Scholar 

  7. Halevy, A., Ives, Z., Suciu, D., Tatarinov, I.: Schema Mediation in Peer Data Management Systems. In: Proc. of the 19th IEEE Int. Conf. on Data Engineering, ICDE 2003 (2003)

    Google Scholar 

  8. Kementsietsidis, A., Arenas, M., Miller, R.J.: Mapping data in peer-to-peer systems: Semantics and algorithmic issues. In: Proc. of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2003) (June 2003)

    Google Scholar 

  9. Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (PODS), pp. 233–246. ACM Press, New York (2002)

    Google Scholar 

  10. Levy, Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: VLDB 1996, pp. 251–262 (1996)

    Google Scholar 

  11. Malaika, S., Eisenberg, A., Melton, J.: Standards for Databases on the Grid. ACM SIGMOD Record 32(3) (September 2003)

    Google Scholar 

  12. The Globus Project. The Globus Toolkit 3, Towards the Open Grid Services Architecture, http://www.globus.org/ogsa/

  13. Alpdemir, M.N., Mukherjee, A., Paton, N.W., Watson, P., Fernandes, A., Gounaris, A., Smith, J.: OGSA-DQP: A service-based distributed query processor for the Grid. In: Proc. of UK e-Science All Hands Meeting Nottingham. EPSRC, September 2-4 (2003)

    Google Scholar 

  14. Global Grid Forum, Open Grid Services Architecture - Data Access and Integration, http://www.ogsa-dai.org.uk

  15. Paton, N., Atkinson, M., Dialani, V., Pearson, D., Storey, T., Watson, P.: Database Access and Integration Services on the Grid. UK e-Science Programme Technical Report Series Number UKeS-2002-03, National e-Science Centre, UK

    Google Scholar 

  16. Sheth, A., Larson, J.: Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys 22(3), 183–236 (1990)

    Article  Google Scholar 

  17. Talia, D., Trunfio, P.: Toward a Synergy Between P2P and Grids. IEEE Internet Computing 7(4), 94–96 (2003)

    Article  Google Scholar 

  18. Tuecke, S., Czajkowski, K., Foster, I., Frey, J., Graham, S., Kesselman, C.: Grid Service Specification, Draft 5 (November 2002), http://www.gridforum.org/ogsi-wg

  19. The WS-Resource Framework, http://www.globus.org/wsrf/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Comito, C., Talia, D. (2004). GDIS: A Service-Based Architecture for Data Integration on Grids. In: Meersman, R., Tari, Z., Corsaro, A. (eds) On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops. OTM 2004. Lecture Notes in Computer Science, vol 3292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30470-8_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30470-8_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23664-1

  • Online ISBN: 978-3-540-30470-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics