Skip to main content

Data Source Management and Selection for Dynamic Data Integration

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6162))

Abstract

Selection-dynamic data integration employs a set of known data sources attached to an integration system. For answering a given query, suitable sources are selected from this set and dynamically integrated. This procedure requires a method to determine the degree of suitability of the individual data sources within a short timeframe, eliminating conventional schema matching approaches. We developed a registry component for our DynaGrid virtual data source which analyzes data sources upon registration and constructs a catalog of schema fragments grouped by content and cohesion. Given a concrete query, it provides a ranked list of data sources capable of contributing to answering the query. In this paper, we first give an overview of dynamic data integration and the DynaGrid virtual data source. We then present the design and the functionality of the registry component and illustrate its task in the overall process of selection-dynamic data integration.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Husemann, M., Ritter, N.: A Virtual Data Source for Service Grids. In: Second Int. Conf. on Data Management in Grid and P2P Systems, September 2009, pp. 24–35 (2009)

    Google Scholar 

  2. Wiederhold, G.: Mediators in the Architecture of Future Information Systems. IEEE Computer 25(3), 38–49 (1992)

    Google Scholar 

  3. Shvaiko, P., Euzenat, J.: A Survey of Schema-Based Matching Approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  4. Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. VLDB J. 10(4), 334–350 (2001)

    Article  MATH  Google Scholar 

  5. Gounaris, A., Sakellariou, R., Comito, C., Talia, D.: Service Choreography for Data Integration on the Grid. In: Knowledge and Data Management in GRIDs, February 2007, pp. 19–33. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  6. Gorton, I., Almquist, J., Dorow, K., et al.: An Architecture for Dynamic Data Source Integration. In: 38th Hawaii Int. Conf. on System Sciences (January 2005)

    Google Scholar 

  7. Chang, K.C.C., He, B., Zhang, Z.: Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. In: CIDR, January 2005, pp. 44–55 (2005)

    Google Scholar 

  8. Al-Hussaini, L., Viglas, S., Atkinson, M.: A Service-based Approach to Schema Federation of Distributed Databases. Technical Report EES-2006-01, University of Edinburgh (November 2005)

    Google Scholar 

  9. Lacroix, Z., Parekh, K., Vidal, M.E., et al.: BioNavigation: Selecting Optimum Paths Through Biological Resources to Evaluate Ontological Navigational Queries. In: Ludäscher, B., Raschid, L. (eds.) DILS 2005. LNCS (LNBI), vol. 3615, pp. 275–283. Springer, Heidelberg (2005)

    Google Scholar 

  10. Aziz, M., Lacroix, Z.: ProtocolDB: Classifying Resources with a Domain Ontology to Support Discovery. In: 10th Int. Conf. on Information Integration and Web-based Applications Services, November 2008, pp. 462–469 (2008)

    Google Scholar 

  11. Wilkinson, M.D., Links, M.: BioMOBY: An Open Source Biological Web Services Proposal. Briefings in Bioinformatics 3(4), 331–341 (2002)

    Article  Google Scholar 

  12. Ayadi, N.Y., Lacroix, Z., Vidal, M.E.: BiOnMap: A Deductive Approach for Resource Discovery. In: 10th Int. Conf. on Information Integration and Web-based Applications Services, November 2008, pp. 477–482 (2008)

    Google Scholar 

  13. Li, J., Ma, D., Zhao, Z., et al.: An Efficient Semantic Web Services Matching Mechanism. In: Second Int. Workshop on Resource Discovery (August 2009)

    Google Scholar 

  14. Foster, I.T.: Globus Toolkit Version 4: Software for Service-Oriented Systems. In: IFIP Int. Conf. on Network and Parallel Computing, November 2005, pp. 2–13 (2005)

    Google Scholar 

  15. Antonioletti, M., Atkinson, M.P., Baxter, R., et al.: The design and implementation of Grid database services in OGSA-DAI. Concurrency - Practice and Experience 17(2-4), 357–376 (2005)

    Article  Google Scholar 

  16. Antonioletti, M., Atkinson, M., Baxter, R., et al.: OGSA-DAI: Two Years On. In: The Future of Grid Data Environments Workshop, GGF10 (March 2004)

    Google Scholar 

  17. Lenzerini, M.: Data Integration: A Theoretical Perspective. In: PODS, June 2002, pp. 233–246 (2002)

    Google Scholar 

  18. Pottinger, R., Halevy, A.Y.: MiniCon: A scalable algorithm for answering queries using views. VLDB J. 10(2-3), 182–198 (2001)

    MATH  Google Scholar 

  19. Yu, B., Li, G., Sollins, K.R., Tung, A.K.H.: Effective keyword-based selection of relational databases. In: SIGMOD, June 2007, pp. 139–150 (2007)

    Google Scholar 

  20. Fellbaum, C. (ed.): WordNet - An Electronic Lexical Database, May 1998. MIT Press, Cambridge (1998), http://wordnet.princeton.edu

    MATH  Google Scholar 

  21. Rahm, E., Do, H.H., Maßmann, S.: Matching Large XML Schemas. SIGMOD Record 33(4), 26–31 (2004)

    Article  Google Scholar 

  22. Floyd, R.W.: Algorithm 97: Shortest path. ACM Commun. 5(6), 345 (1962)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Husemann, M., Ritter, N. (2010). Data Source Management and Selection for Dynamic Data Integration. In: Lacroix, Z. (eds) Resource Discovery. RED 2009. Lecture Notes in Computer Science, vol 6162. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14415-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14415-8_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14414-1

  • Online ISBN: 978-3-642-14415-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics