Abstract
Distributed data sources can be heterogeneous in their formats, schemas, quality, access mechanisms, ownership, access policies, and capabilities. We need models and techniques for managing different data resources in an integrated way. Data integration is the flexible and managed federation, analysis, and processing of data from different distributed sources. Data integration is becoming as important as data mining for exploiting the value of large and distributed data sets that today are available. Distributed processing infrastructures such as Grids and peer-to-peer networks can be used for data integration on geographically distributed sites. This paper presents a service-based architecture for data integration on Grids. The basic model is discussed and its implementation based on the OGSA Globus architecture is described.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bernstein, P., Giunchiglia, F., Kementsietsidis, A., Mylopoulos, J., Serafini, L., Zaihrayeu, I.: Data Management for Peer-to-Peer Computing: A Vision. In: Proc. of the 5th International Workshop on the Web and Databases, WebDB, Madison, Wisconsin (June 2002)
Calvanese, D., Damaggio, E., De Giacomo, G., Lenzerini, M., Rosati, R.: Semantic Data Integration in P2P Systems. In: Proc. of Databases, Information Systems, and Peer-to-Peer Computing, 1st International Workshop, DBISP2P, Berlin, Germany, September 7-8 (2003)
Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. The Globus Project (2002), www.globus.org
Foster, I., Tuecke, S., Unger, J.: OGSA Data Services, DAIS-WG Informational Draft, 9th Global Grid Forum, August 14 (2003)
Foster, I., Grossman, R.L.: Blueprint for the future of high-performance networking: Data integration in a bandwidth-rich world. Communications of the ACMÂ 46(11) (November 2003)
Franconi, E., Kuper, G.M., Lopatenko, A., Serafini, L.: A Robust Logical and Computational Characterisation of Peer-to-Peer Database Systems. In: Proc. of Databases, Information Systems, and Peer-to-Peer Computing, First International Workshop, DBISP2P, Berlin Germany, September 7-8 (2003)
Halevy, A., Ives, Z., Suciu, D., Tatarinov, I.: Schema Mediation in Peer Data Management Systems. In: Proc. of the 19th IEEE Int. Conf. on Data Engineering, ICDE 2003 (2003)
Kementsietsidis, A., Arenas, M., Miller, R.J.: Mapping data in peer-to-peer systems: Semantics and algorithmic issues. In: Proc. of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2003) (June 2003)
Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (PODS), pp. 233–246. ACM Press, New York (2002)
Levy, Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: VLDB 1996, pp. 251–262 (1996)
Malaika, S., Eisenberg, A., Melton, J.: Standards for Databases on the Grid. ACM SIGMOD Record 32(3) (September 2003)
The Globus Project. The Globus Toolkit 3, Towards the Open Grid Services Architecture, http://www.globus.org/ogsa/
Alpdemir, M.N., Mukherjee, A., Paton, N.W., Watson, P., Fernandes, A., Gounaris, A., Smith, J.: OGSA-DQP: A service-based distributed query processor for the Grid. In: Proc. of UK e-Science All Hands Meeting Nottingham. EPSRC, September 2-4 (2003)
Global Grid Forum, Open Grid Services Architecture - Data Access and Integration, http://www.ogsa-dai.org.uk
Paton, N., Atkinson, M., Dialani, V., Pearson, D., Storey, T., Watson, P.: Database Access and Integration Services on the Grid. UK e-Science Programme Technical Report Series Number UKeS-2002-03, National e-Science Centre, UK
Sheth, A., Larson, J.: Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys 22(3), 183–236 (1990)
Talia, D., Trunfio, P.: Toward a Synergy Between P2P and Grids. IEEE Internet Computing 7(4), 94–96 (2003)
Tuecke, S., Czajkowski, K., Foster, I., Frey, J., Graham, S., Kesselman, C.: Grid Service Specification, Draft 5 (November 2002), http://www.gridforum.org/ogsi-wg
The WS-Resource Framework, http://www.globus.org/wsrf/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Comito, C., Talia, D. (2004). GDIS: A Service-Based Architecture for Data Integration on Grids. In: Meersman, R., Tari, Z., Corsaro, A. (eds) On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops. OTM 2004. Lecture Notes in Computer Science, vol 3292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30470-8_27
Download citation
DOI: https://doi.org/10.1007/978-3-540-30470-8_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23664-1
Online ISBN: 978-3-540-30470-8
eBook Packages: Springer Book Archive