Skip to main content

Semantic Data Integration: Tools and Architectures

  • Chapter
  • First Online:
Semantic Web Technologies for Intelligent Engineering Applications

Abstract

This chapter is focused on the technical aspects of semantic data integration that provides solutions for bridging semantic gaps between common project-level concepts and the local tool concepts as identified in the Engineering Knowledge Base (EKB). Based on the elicitation of use case requirements from automation systems engineering, the chapter identifies required capabilities an EKB software architecture has to consider. The chapter describes four EKB software architecture variants and their components, and discusses identified drawbacks and advantages regarding the utilization of ontologies. A benchmark is defined to evaluate the efficiency of the EKB software architecture variants in the context of selected quality attributes, like performance and scalability. Main results suggest that architectures relying on a relational database still outperform traditional ontology storages while NoSQL databases outperforms for query execution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Aldred, L., van der Aalst, W., Dumas, M., ter Hofstede, A.: Understanding the challenges in getting together: the semantics of decoupling in middleware. In: BPM Center Report BPM-06-19. http://www.BPMcenter.org (2006)

  • Bergamaschi, S., Castano, S., Vincini, M.: Semantic integration of semistructured and structured data sources. SIGMOD Rec. 28, 54–59 (1999). doi:10.1145/309844.309897

    Article  Google Scholar 

  • Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Sci. Am. 284, 34–43 (2001). doi:10.1038/scientificamerican0501-34

    Article  Google Scholar 

  • Biffl, S., Mordinyi, R., Moser, T.: Anforderungsanalyse für das integrierte Engineering - Mechanismen und Bedarfe aus der Praxis. atp edition –Automatisierungstechnische. Praxis 54, 28–35 (2012)

    Google Scholar 

  • Biffl, S., Schatten, A., Zoitl, A.: Integration of heterogeneous engineering environments for the automation systems lifecycle. In: IEEE International Conference on Industrial Informatics (INDIN), pp. 576–581 (2009a)

    Google Scholar 

  • Biffl, S., Sunindyo, W.D., Moser, T.: Bridging semantic gaps between stakeholders in the production automation domain with ontology areas. In: Proceedings of the 21st International Conference on Software Engineering & Knowledge Engineering (SEKE 2009), USA, pp. 233–239 (2009b)

    Google Scholar 

  • Bishop, B., Kiryakov, A., Ognyanoff, D., et al.: OWLIM: a family of scalable semantic repositories. Semant. Web 2, 33–42 (2011). doi:10.3233/SW-2011-0026

    Article  Google Scholar 

  • Bizer, C., Schultz, A.: The Berlin SPARQL benchmark. Int. J. Semant. Web Inf. Syst. 5, 1–24 (2001). doi:10.4018/jswis.2009040101

    Google Scholar 

  • Bizer, C., Seaborne, A.: D2RQ—treating non-RDF databases as virtual RDF graphs. In: Proceedings of the 3rd International Semantic Web Conference (ISWC2004) (2004)

    Google Scholar 

  • Bornea, M.A., Dolby, J., Kementsietsidis, A., et al.: Building an efficient RDF store over a relational database. In: Proceedings of the 2013 International Conference on Management of Data—SIGMOD’13, p. 121 (2013)

    Google Scholar 

  • Calvanese, D., De Giacomo, G., Lembo, D., et al.: The MASTRO system for ontology-based data access. Semant. Web 2, 43–53 (2011). doi:10.3233/SW-2011-0029

    Google Scholar 

  • Castelltort, A., Laurent, A.: Representing history in graph-oriented NoSQL databases: a versioning system. In: 8th International Conference on Digital Information Management, ICDIM 2013, pp. 228–234 (2013)

    Google Scholar 

  • Cattell, R.: Scalable SQL and NoSQL data stores. ACM SIGMOD Rec. 39, 12 (2011). doi:10.1145/1978915.1978919

    Article  Google Scholar 

  • Chacon, S.: Pro Git. Apress (2009)

    Google Scholar 

  • Chappell, D.A.: Enterprise Service Bus. O’Reilly Media Inc. (2004)

    Google Scholar 

  • De Leon Battista, A., Villanueva-Rosales, N., Palenychka, M., Dumontier, M.: SMART: a web-based, ontology-driven, semantic web query answering application. In: CEUR Workshop Proceedings (2007)

    Google Scholar 

  • Doan, A.H., Noy, N.F., Halevy, A.Y.: Introduction to the special issue on semantic integration. ACM SIGMOD Rec. 33, 11–13 (2004). doi:http://doi.acm.org/10.1145/1041410.1041412

    Google Scholar 

  • Drath, R., Lüder, A., Peschke, J., Hundt, L.: AutomationML—the glue for seamless automation engineering. In: IEEE International Conference on Emerging Technologies and Factory Automation, ETFA, pp. 616–623 (2008)

    Google Scholar 

  • Dupont, G.M., de Chalendar, G., Khelif, K., et al.: Evaluation with the VIRTUOSO platform: an open source platform for information extraction and retrieval evaluation. In: Proceedings of the 2011 Workshop on Data InfrastructurEs for Supporting Information Retrieval Evaluation, pp. 13–18 (2011)

    Google Scholar 

  • Gottlob, G., Orsi, G., Pieris, A.: Ontological queries: rewriting and optimization. In: Proceedings—International Conference on Data Engineering, pp. 2–13 (2011)

    Google Scholar 

  • Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Human-Comput. Stud. 43, 907–928. (1995) http://dx.doi.org/10.1006/ijhc.1995.1081

    Google Scholar 

  • Halevy, A.: Why your data won’t mix? Queue 3:50–58 (2005). doi:http://doi.acm.org/10.1145/1103822.1103836

    Google Scholar 

  • Harris, S., Gibbins, N.: 3store: Efficient bulk RDF storage. In: Proceedings of the 1st International Workshop on Practical and Scalable Semantic Systems (PSSS’03), pp. 1–20 (2003)

    Google Scholar 

  • Haslhofer, B., Momeni, E., Schandl, B., Zander, S.: Europeana RDF store report. Library Europeana, pp. 447–458 (2011)

    Google Scholar 

  • Hohpe, G.: 06291 Workshop report: conversation patterns. In: Leymann, F., Reisig, W., Thatte, S.R., van der Aalst, W. (eds.) The Role of Business Processes in Service Oriented Architectures. Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany (2006)

    Google Scholar 

  • Hohpe, G., Woolf, B.: Enterprise Integration Patterns: Designing, Building, and Deploying Messaging Solutions (2003)

    Google Scholar 

  • IEEE: IEEE Recommended Practice for CASE Tool Interconnection—Characterization of Interconnections (2007)

    Google Scholar 

  • Klieber, W., Sabol, V., Kern, R., et al.: Using Ontologies for Software Documentation (2009)

    Google Scholar 

  • Korel, B., Wedde, H., Magaraj, S., et al.: Version management in distributed network environment. In: Proceedings of the 3rd International Workshop on Software Configuration Management, pp. 161–166. ACM Press, New York (1991)

    Google Scholar 

  • Lenzerini, M.: Data integration: a theoretical perspective. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 233–246. ACM, Madison (2002)

    Google Scholar 

  • Lientz, B.P., Swanson, E.B.: Software Maintenance Management. Addison-Wesley, Boston (1980)

    Google Scholar 

  • Lu, J., Ma, L., Zhang, L., et al.: SOR: a practical system for ontology storage, reasoning and search. In: Proceedings of the 33rd International Conference on Very Large Data Bases, pp. 1402–1405 (2007)

    Google Scholar 

  • Lüder, A., Hundt, L., Keibel, A.: Description of manufacturing processes using AutomationML. In: Proceedings of the 15th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2010 (2010)

    Google Scholar 

  • Miles, A., Zhao, J., Klyne, G., et al.: OpenFlyData: an exemplar data web integrating gene expression data on the fruit fly Drosophila melanogaster. J. Biomed. Inf. 43, 752–761 (2010). doi:10.1016/j.jbi.2010.04.004

    Article  Google Scholar 

  • Mironov, V., Seethappan, N., Blondé, W., et al.: Benchmarking triple stores with biological data. In: CEUR Workshop Proceedings (2010)

    Google Scholar 

  • Mordinyi, R., Moser, T., Winkler, D., Biffl, S.: Navigating between tools in heterogeneous automation systems engineering landscapes. In: IECON Proceedings (Industrial Electronics Conference), pp. 6178–6184 (2012)

    Google Scholar 

  • Mordinyi, R., Pacha, A., Biffl, S.: Quality assurance for data from low-tech participants in distributed automation engineering environments. In: IEEE International Conference on Emerging Technologies and Factory Automation, ETFA (2011)

    Google Scholar 

  • Mordinyi, R., Schindler, P., Biffl, S.: Evaluation of NoSQL graph databases for querying and versioning of engineering data in multi-disciplinary engineering environments. In: 2015 IEEE 20th Conference on Emerging Technologies Factory Automation (ETFA), pp. 1–8 (2015)

    Google Scholar 

  • Mordinyi, R., Serral, E., Winkler, D., Biffl, S.: Evaluating software architectures using ontologies for storing and versioning of engineering data in heterogeneous systems engineering environments. In: Proceedings of the 2014 IEEE Emerging Technology and Factory Automation, ETFA 2014, Barcelona, Spain, 16–19 Sept 2014, pp. 1–10. Vienna, Austria (2014)

    Google Scholar 

  • Moser, T.: Semantic integration of engineering environments using an engineering knowledge base. Ph.D. thesis, Vienna University of Technology (2009)

    Google Scholar 

  • Moser, T., Biffl, S.: Semantic tool interoperability for engineering manufacturing systems. In: Proceedings of the 15th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2010 (2010)

    Google Scholar 

  • Moser, T., Biffl, S., Sunindyo, W.D., Winkler, D.: Integrating production automation expert knowledge across engineering domains. Int. J. Distrib. Syst. Technol. 2, 88–103 (2011). doi:10.4018/jdst.2011070106

    Article  Google Scholar 

  • Moser, T., Biffl, S., Sunindyo, W.D., Winkler, D.: Integrating production automation expert knowledge across engineering stakeholder domains. In: Barolli, L., Xhafa, F., Vitabile, S., Hsu, H.-H. (eds.) Proceedings of the 4th International Conference on Complex, Intelligent and Software Intensive Systems (CISIS 2010). IEEE Computer Society (2010)

    Google Scholar 

  • Moser, T., Mordinyi, R., Mikula, A., Biffl, S.: Making expert knowledge explicit to facilitate tool support for integrating complex information systems in the ATM domain. In: International Conference on Complex, Intelligent and Software Intensive Systems (CISIS’09), pp. 90–97. IEEE Computer Society, Fukuoka, Japan (2009a)

    Google Scholar 

  • Moser, T., Mordinyi, R., Winkler, D., Biffl, S.: Engineering project management using the Engineering Cockpit: a collaboration platform for project managers and engineers. In: IEEE International Conference on Industrial Informatics (INDIN), pp. 579–584 (2011b)

    Google Scholar 

  • Moser, T., Schimper, K., Mordinyi, R., Anjomshoaa, A.: SAMOA—a semi-automated ontology alignment method for systems integration in safety-critical environments. In: Proceedings of the 2nd IEEE International Workshop on Ontology Alignment and Visualization (OnAV’09), International Conference on Complex, Intelligent and Software Intensive Systems (CISIS‘09), pp. 724–729. Fukuoka, Japan (2009b)

    Google Scholar 

  • Moser, T., Biffl, S.: Semantic integration of software and systems engineering environments. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 42, 38–50 (2012). doi:10.1109/TSMCC.2011.2136377

    Article  Google Scholar 

  • Nielsen, J.: Usability Engineering, vol. 44, p. 362. Morgan Kaufmann, Pietquin O and Beaufort R (1993). doi:10.1145/1508044.1508050

  • Novák, P., Å indelář, R.: Applications of ontologies for assembling simulation models of industrial systems. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), pp. 148–157 (2011)

    Google Scholar 

  • Noy, N.F.: Semantic integration: a survey of ontology-based approaches. SIGMOD Rec. 33:65–70. ST—Semantic integration: a survey of onto. doi:http://doi.acm.org/10.1145/1041410.1041421 (2004)

  • Noy, N.F., Doan, A.H., Halevy, A.Y.: Semantic integration. AI Mag. 26, 7–10 ST—Semantic Integration (2005)

    Google Scholar 

  • Oldakowski, R., Bizer, C., Westphal, D.: RAP: RDF API for PHP. In: Workshop on Scripting for the Semantic Web at 2nd European Semantic Web Conference (ESWC) (2005)

    Google Scholar 

  • Pérez, J., Arenas, M., Gutierrez, C.: Semantics and complexity of SPARQL. Semant. Web—ISWC 4273, 30–43 (2006). doi:10.1007/11926078

    Google Scholar 

  • Pilato, M.: Version Control with Subversion. O’Reilly and Associates, Inc. (2004)

    Google Scholar 

  • Rodriguez-Muro, M., Hardi, J., Calvanese, D.: Quest: efficient SPARQL-to-SQL for RDF and OWL. In: Demos of the 12th International Semantic Web Conference (ISWC 2012) (2012)

    Google Scholar 

  • Sequeda, J.F., Arenas, M., Miranker, D.P.: On directly mapping relational databases to RDF and OWL. In: Proceedings of the 21st International Conference on World Wide Web—WWW’12, p. 649. ACM Press, New York (2012)

    Google Scholar 

  • Sequeda, J.F., Miranker, D.P.: Ultrawrap: SPARQL execution on relational data. J. Web Semant. 22, 19–39 (2013). doi:10.1016/j.websem.2013.08.002

    Article  Google Scholar 

  • Serral, E., Kovalenko, O., Moser, T., Biffl, S.: Semantic integration data storage architectures: A systematic comparison for automation systems engineering (2012)

    Google Scholar 

  • Serral, E., Mordinyi, R., Kovalenko, O., et al.: Evaluation of semantic data storages for integrating heterogenous disciplines in automation systems engineering. In: IECON Proceedings (Industrial Electronics Conference), pp. 6858–6865 (2013)

    Google Scholar 

  • Shen, X., Huang, V.: A framework for performance study of semantic databases. In: Proceedings of the International Workshop on Evaluation of Semantic Technologies (IWEST 2010). http://www.ceur-ws.org (2010)

  • SWAD-Europe: SWAD-Europe deliverable 10.1—scalability and storage: survey of free software/ open source RDF storage systems (2002)

    Google Scholar 

  • Tanenbaum, A.S.: Modern Operating Systems, 3rd edn. Prentice Hall Press, Upper Saddle River (2007)

    MATH  Google Scholar 

  • Tichy, W.F.: RCS—a system for version control. Softw.-Pract. Exp. 15, 637–654 (1985). doi:10.1002/spe.4380150703

    Article  Google Scholar 

  • Tinelli, E., Cascone, A., Ruta, M., et al.: I.M.P.A.K.T.: an innovative semantic-based skill management system exploiting standard SQL. In: ICEIS 2009—11th International Conference on Enterprise Information Systems, Proceedings, pp. 224–229 (2009)

    Google Scholar 

  • TinkerPop: TinkerPop—An Open Source Graph Computing Framework. http://www.tinkerpop.com. Accessed 8 Apr 2015

  • Völkel, M., Groza, T.: SemVersion: an RDF-based ontology versioning system. In: Proceedings of IADIS International Conference on WWW/Internet (IADIS 2006), pp. 195–202 (2006)

    Google Scholar 

  • Vysniauskas, E., Nemuraite, L., Paradauskas, B.: Hybrid method for storing and querying ontologies in databases (2011)

    Google Scholar 

  • Waltersdorfer, F., Moser, T., Zoitl, A., Biffl, S.: Version management and conflict detection across heterogeneous engineering data models. In: IEEE International Conference on Industrial Informatics (INDIN), pp. 928–935 (2010)

    Google Scholar 

  • Wiederhold, G.: Mediators in the architecture of future information systems. Computer 25, 38–49 (1992). doi:10.1109/2.121508

    Article  Google Scholar 

  • Wiesner, A., Morbach, J., Marquardt, W.: Information integration in chemical process engineering based on semantic technologies. Comput. Chem. Eng. 35, 692–708 (2011). doi:10.1016/j.compchemeng.2010.12.003

    Article  Google Scholar 

  • Winkler, D., Moser, T., Mordinyi, R., et al.: Engineering object change management process observation in distributed automation systems projects. In: Proceedings of 18th European System and Software Process Improvement and Innovation (EuroSPI 2011), pp. 1–12 (2011)

    Google Scholar 

  • Zaikin, I., Tuzovsky, A.: Owl2vcs: Tools for distributed ontology development. In: Proceedings of 10th OWL: Experiences and Directions Workshop. http://www.CEUR-WS.org (2013)

  • Zhou, J., Ma, L., Liu, Q., et al.: Minerva: a scalable OWL ontology storage and inference system. Seman. Web 429–443 (2006). doi:10.1007/11836025_42

    Google Scholar 

Download references

Acknowledgments

This work was supported by the Christian Doppler Forschungsgesellschaft, the Federal Ministry of Economy, Family and Youth, and the National Foundation for Research, Technology and Development in Austria.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Richard Mordinyi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Mordinyi, R., Serral, E., Ekaputra, F.J. (2016). Semantic Data Integration: Tools and Architectures. In: Biffl, S., Sabou, M. (eds) Semantic Web Technologies for Intelligent Engineering Applications. Springer, Cham. https://doi.org/10.1007/978-3-319-41490-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41490-4_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41488-1

  • Online ISBN: 978-3-319-41490-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics