Skip to main content

Computing a Canonical Hierarchical Schema

  • Conference paper
  • First Online:
Enterprise Interoperability V

Part of the book series: Proceedings of the I-ESA Conferences ((IESACONF,volume 5))

Abstract

We present a novel approach to constructing a canonical data model from a set of hierarchical schemas. Canonical data model is a well-known pattern for enterprise integration and the integral enabler for many business applications such as business warehousing, business intelligence, data cleansing, and forsustainable business-to-business integration. After knowing the correspondences between schemas by applying existing schema or ontology matching, building the overarching canonical schema remains. A canonical schema must be able to integrate extremely different and even conflicting structures. Furthermore, the schema should exhibit the most commonly used structures of the sources and be stable with respect to the order of importing. Due to these properties, the manual construction is cumbersome and error-prone and becomes a major cost driver of integration projects. Our approach models that task as finding an optimal solution of a constraint satisfaction problem. Our comparison with manual integration shows that our prototype quickly reduces human effort by multiple person days with growing size of the integration task. With our techniques as a baseline, data models of enterprise applications can be converged and kept in synch to reduce integration costs in the long run.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The e-business standards are: ACORD, ANSI ASC X.12, CIDX, Crossgate’s Canonical Data Model, ISO 20022 (SWIFT), OAGI BODs, OASIS UBL (Universal Business Language), ODETTE, PapiNet, RosettaNet PIPs, SAP GDT based Message Types, SAP Idoc, Tradacomms, UN/EDIFACT, and xCBL. The message types are: Delivery Schedule, Despatch Advice, Invoice, Purchase Order, Purchase Order Change, Purchase Order Response, and Ship Notice The companies are: Adidas, Adobe, Aldra, Benteler, Boeing, Borders, Bosch Rexroth, CBS, Case New Holland, Danfoss, Defense Logistics Agency (DLA), EMDiesel, Egger, Erico, Ford Motor Company, Freiberger, General Motors, Heidelberger Druck, Hella, Karmann, Kaufland, MAN, Maytag, Metcash, Miele, REWE, Renault, STIHL, Sauer Danfoss, Siemens, Tegut, Texas Instruments, Valero, Volvo Car Corporation, Woehrl, Nestle, 3 M, John Deere, Mahle, Procter & Gamble, Delphi Automotive, Canada Border Services Agency, Eaton Cooperation, Woolworths, Volkswagen, Magyar Hipermarket, Daily Standard, Questa Web, Austria Gastro, and Hometrend

References

  1. Kastner und Saia, “The Composite Applications Benchmark Report”. Dez-2006.

    Google Scholar 

  2. Gartner, “Technology Research | Gartner Inc.” [Online]. Available: http://www.gartner.com/technology/home.jsp. [Accessed: 30-Sep-2011].

  3. D. Beneventano, S. Bergamaschi, F. Guerra, und M. Vincini, The MOMIS approach to Information Integration. 2001.

    Google Scholar 

  4. K. Saleem, Z. Bellahsene, und E. Hunt, “PORSCHE: Performance ORiented SCHEma mediation”, Inf. Syst., Bd. 33, Nr. 7-8, S. 637-657, 2008.

    Google Scholar 

  5. C. Delobel, C. Reynaud, M.-C. Rousset, J.-P. Sirot, und D. Vodislav, “Semantic integration in Xyleme: a uniform tree-based approach”, Data Knowl. Eng., Bd. 44, Nr. 3, S. 267-298, 2003.

    Google Scholar 

  6. R. D. S. Mello und C. A. Heuser, “BInXS: A Process for Integration of XML Schemata”, 2005, Bd. 3520, S. 151-166.

    Google Scholar 

  7. “Data Integration - Data Transformation - Data Management - Data Security - Data in the Cloud - Liaison Technologies”.

    Google Scholar 

  8. Crossgate, “Crossgate: EDI Managed Services, E-Invoicing, SAP PI, Supply Chain Analytics”. [Online]. Available: http://www.crossgate.de/. [Accessed: 29-Sep-2011].

  9. J. Madhavan, P. Bernstein, und E. Rahm, “Generic Schema Matching with Cupid”, in In The VLDB Journal, 2001, S. 49–58.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag London Limited

About this paper

Cite this paper

Lemcke, J., Stuhec, G., Dietrich, M. (2012). Computing a Canonical Hierarchical Schema. In: Poler, R., Doumeingts, G., Katzy, B., Chalmeta, R. (eds) Enterprise Interoperability V. Proceedings of the I-ESA Conferences, vol 5. Springer, London. https://doi.org/10.1007/978-1-4471-2819-9_27

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-2819-9_27

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-2818-2

  • Online ISBN: 978-1-4471-2819-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics