Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Information Integration Techniques for Scientific Data

  • Amarnath Gupta
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1303

Definition

Information integration refers to the field of study of techniques attempting to combine information from disparate sources despite differing conceptual, contextual and lexical representations. One goal of information integration, commonly held by the data management community, is to combine the information in such a way that the user gets a unified view of the data. In other words, the user should see and query the data as though it is present in a common, unified schema.

In the domain of scientific applications, the problems and expectations are different. In this domain the semantics of data play a very strong role in data integration, and semantic compatibility need to be ensured as part of the data integration process. Further, straightforward view-based data integration, which works well for commercial applications, does not always suit the needs of the scientific users. Finally, scientists need to ensure that the result of any query on integrated data is...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Bowers S, Lin K, Ludaescher B. On integrating scientific resources through semantic registration. In: Proceedings of the 16th International Conference on Scientific and Statistical Database Management; 2004. p. 349.Google Scholar
  2. 2.
    Cluet S, Delobel C, Siméon J, Smaga K. Your mediators need data conversion! In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1998. p. 177–88.CrossRefGoogle Scholar
  3. 3.
    Critchlow T, Fidelis K, Ganesh M, Musick R, Slezak T. Datafoundry: information management for scientific data. IEEE Trans Inf Technol Biomed. 2000;4(1):52–7.CrossRefGoogle Scholar
  4. 4.
    Gupta A, Ludäscher B, Martone ME, Rajasekar A, Ross E, Qian X, Santini S, He H, Zaslavsky I. BIRN-M: a semantic mediator for solving real-world neuroscience problems. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003. p. 678.Google Scholar
  5. 5.
    Hakimpour F, Geppert A. Resolving semantic heterogeneity in schema integration. In: Proceedings of the 2nd International Conference on Formal Ontology in Information Systems; 2001. p. 297–308.Google Scholar
  6. 6.
    Halevy A. Why your data won’t mix. ACM Queue. 2003;3(8):50–8.CrossRefGoogle Scholar
  7. 7.
    Ludäscher B, Gupta A, Martone ME. Model-based mediation with domain maps. In: Proceedings of the 17th International Conference on Data Engineering; 2001. p. 81–90.Google Scholar
  8. 8.
    Qi Y, Candan KS, Sapino ML, Kintigh KW. Integrating and querying taxonomies with quest in the presence of conflicts. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2007. p. 1153–5.Google Scholar
  9. 9.
    Searls DB. Data integration: challenges for drug discovery. Nat Rev Drug Discov. 2005;4:45–58.CrossRefGoogle Scholar
  10. 10.
    National Research Council (US) Committee on Applied and Theoretical Statistics. Steps Toward Large-Scale Data Integration in the Sciences: Summary of a Workshop. 2010. available from http://www.ncbi.nlm.nih.gov/books/NBK45678/.

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.San Diego Supercomputer CenterUniversity of California San DiegoLa JollaUSA

Section editors and affiliations

  • Amarnath Gupta
    • 1
  1. 1.San Diego Supercomputer CenterUniv. of California San DiegoLa JollaUSA