Skip to main content

Holistic Schema Matching

  • Living reference work entry
  • First Online:
Encyclopedia of Big Data Technologies

Synonyms

Collective schema matching

Definition

Holistic schema matching aims at identifying semantically corresponding elements in multiple schemas, e.g., database schemas, web forms, or ontologies. The corresponding elements from N (>2) sources are typically grouped into clusters with up to N members. Holistic schema matching is usually applied when multiple schemas need to be combined within an integrated schema or ontology.

Overview

Holistic schema matching aims at identifying semantically corresponding elements in multiple (>2) schemas, such as database schemas, web forms, or ontologies. It is to be contrasted with the traditional pairwise schema matching (Rahm and Bernstein 2001; Euzenat and Shvaiko 2013) between two input schemas only that determines a so-called mapping consisting of a set of correspondences, i.e., pairs of elements of the input schemas (table attributes, ontology concepts) that match with each other. Holistic schema matching is applied to more than two input...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • Balakrishnan S, Halevy AY, Harb B, Lee H, Madhavan J, Rostamizadeh A, Shen W, Wilder K, Wu F, Yu C (2015) Applying web tables in practice. In: Proceedings of the CIDR

    Google Scholar 

  • Barbosa L, Freire J, Silva A (2007) Organizing hidden-web databases by clustering visible web documents. In: Proceedings of the international conference on data engineering, pp 326–335

    Google Scholar 

  • Batini C, Lenzerini M, Navathe SB (1986) A comparative analysis of methodologies for database schema integration. ACM Comput Surv 18(4):323–364

    Article  Google Scholar 

  • Bodenreider O (2004) The unified medical language system (UMLS): integrating bio-medical terminology. Nucleic Acids Res 32(suppl 1):D267–D270

    Article  Google Scholar 

  • Das Sarma A, Dong X, Halevy AY (2008) Bootstrapping pay-as-you-go data integration systems. In: Proceedings of the ACM SIGMOD conference

    Google Scholar 

  • Do HH, Rahm E (2002) COMA – a system for flexible combination of schema matching approaches. In: Proceedings of the international conference on very large data bases, pp 610–621

    Google Scholar 

  • Dong X, Gabrilovich E, Heitz G, Horn W, Lao N, Murphy K, Strohmann T, Sun S, Zhang W (2014) Knowledge vault: A web-scale approach to probabilistic knowledge fusion. In: Proceedings of KDD, New York, USA, pp~601–610

    Google Scholar 

  • Eberius J, Damme P, Braunschweig K, Thiele M, Lehner W (2013) Publish-time data integration for open data platforms. In: Proceedings of the ACM workshop on open data

    Google Scholar 

  • Euzenat J, Shvaiko P (2013) Ontology matching, 2nd edn. Springer, Berlin/Heidelberg, Germany

    Google Scholar 

  • Gross A, Hartung M, Kirsten T Rahm E (2011) Mapping composition for matching large life science ontologies. In: Proceedings of the ICBO, pp 109–116

    Google Scholar 

  • Gruetze T, Boehm C, Naumann F (2012) Holistic and scalable ontology alignment for linked open data. In: Proceedings of linked data on the web

    Google Scholar 

  • He B, Chang KC (2003) Statistical schema matching across web query interfaces. In: Proceedings of the ACM SIGMOD conference, pp 217–228

    Google Scholar 

  • He H, Meng W, Yu CT, Wu Z (2004) Automatic integration of web search interfaces with WISE-integrator. VLDB J 13(3):256–273

    Article  Google Scholar 

  • Hu W, Chen J, Zhang H, Qu Y (2011) How matchable are four thousand ontologies on the semantic web. In: Proceedings of the ESWC, Springer LNCS 6643

    Google Scholar 

  • Koepcke H, Rahm E (2010) Frameworks for entity matching: a comparison. Data Knowl Eng 69(2):197–210

    Article  Google Scholar 

  • Limaye G, Sarawagi S, Chakrabarti S (2010) Annotating and searching web tables using entities, types and relationships. PVLDB 3(1–2):1338–1347

    Google Scholar 

  • Madhavan J, Bernstein PA, Doan A, Halevy AY (2005) Corpus-based schema matching. In: Proceedings of the IEEE ICDE conference

    Google Scholar 

  • Mahmoud HA, Aboulnaga A (2010) Schema clustering and retrieval for multi-domain pay-as-you-go data integration systems. In: Proceedings of the ACM SIGMOD conference

    Google Scholar 

  • Nentwig M, Soru T, Ngonga Ngomo A, Rahm E (2014) LinkLion: a link repository for the web of data. In: Proceedings of the ESWC (Satellite Events), Springer LNCS 8798

    Google Scholar 

  • Rahm E (2011) Towards large-scale schema and ontology matching. In: Schema matching and mapping. Springer, Berlin/Heidelberg, Germany

    Google Scholar 

  • Rahm E (2016) The case for holistic data integration. In: Proceedings of the ADBIS conference, Springer LNCS 9809

    Google Scholar 

  • Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350

    Article  MATH  Google Scholar 

  • Raunich S, Rahm E (2014) Target-driven merging of taxonomies with ATOM. Inf Syst 42:1–14

    Article  Google Scholar 

  • Saleem K, Bellahsene Z, Hunt E (2008) PORSCHE: performance oriented SCHEma mediation. Inf Syst 33(7–8):637–657

    Article  Google Scholar 

  • Wang J, Wang H, Wang Z, Zhu KQ (2012) Understanding tables on the web. In: Proceedings of the ER conference, Springer LNCS 7532, pp 141–155

    Google Scholar 

  • Yakout M, Ganjam K, Chakrabarti K, Chaudhuri S (2012) Infogather: entity augmentation and attribute discovery by holistic matching with web tables. In: Proceedings of the ACM SIGMOD conference, pp 97–108

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Erhard Rahm .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Rahm, E., Peukert, E. (2018). Holistic Schema Matching. In: Sakr, S., Zomaya, A. (eds) Encyclopedia of Big Data Technologies. Springer, Cham. https://doi.org/10.1007/978-3-319-63962-8_12-1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63962-8_12-1

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63962-8

  • Online ISBN: 978-3-319-63962-8

  • eBook Packages: Springer Reference MathematicsReference Module Computer Science and Engineering

Publish with us

Policies and ethics