Skip to main content

Translation of Heterogeneous Databases into RDF, and Application to the Construction of a SKOS Taxonomical Reference

  • Conference paper
Web Information Systems and Technologies (WEBIST 2015)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 246))

Abstract

While the data deluge accelerates, most of the data produced remains locked in deep Web databases. For the linked open data to benefit from the potential represented by this huge amount of data, it is crucial to come up with solutions to expose heterogeneous databases as linked data. The xR2RML mapping language is an endeavor towards this goal: it is designed to map various types of databases to RDF, by flexibly adapting to heterogeneous query languages and data models while remaining free from any specific language. It extends R2RML, the W3C recommendation for the mapping of relational databases to RDF, and relies on RML for the handling of various data formats.

In this paper we present xR2RML, we analyse data models of several modern databases as well as the format in which query results are returned, and we show how xR2RML translates any result data element into RDF, relying on existing languages such as XPath and JSONPath when necessary. We illustrate some features of xR2RML such as the generation of RDF collections and containers, and the ability to deal with mixed data formats. We also describe a real-world use case in which we applied xR2RML to build a SKOS thesaurus aimed at supporting studies on History of Zoology, Archaeozoology and Conservation Biology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.w3.org/2013/data/.

  2. 2.

    http://linkeddata.org/.

  3. 3.

    http://www.w3.org/2001/sw/rdb2rdf/wiki/Implementations.

  4. 4.

    http://refine.deri.ie/.

  5. 5.

    http://jenit.github.io/linked-csv/.

  6. 6.

    http://www.w3.org/2013/csvw/wiki.

  7. 7.

    https://github.com/cygri/tarql/wiki/TARQL-Mapping-Language.

  8. 8.

    http://www.couchbase.com/communities/n1ql.

  9. 9.

    http://unql.sqlite.org/index.html.

  10. 10.

    http://docs.arangodb.org/Aql/README.html.

  11. 11.

    aka. column family store, column-oriented store, etc.

  12. 12.

    http://www.w3.org/TR/turtle/.

  13. 13.

    rml:query also subsumes rml:xmlQuery and rml:queryLanguage, although none of those properties are described or exemplified in the RML language specification and articles at the time of writing.

  14. 14.

    https://github.com/frmichel/morph-xr2rml/.

  15. 15.

    http://www.cepam.cnrs.fr/zoomathia/.

  16. 16.

    http://www.w3.org/2009/08/skos-reference/skos.html.

  17. 17.

    Changes were made since the description presented in [26].

  18. 18.

    http://www.ontobee.org/browser/index.php?o=NCBITaxon.

  19. 19.

    http://datahub.io/dataset/geospecies.

  20. 20.

    http://www.ontobee.org/browser/index.php?o=ENVO.

  21. 21.

    http://www.tdwg.org/.

  22. 22.

    http://json-schema.org/.

  23. 23.

    http://www.w3.org/TR/json-ld/.

References

  1. He, B., Patel, M., Zhang, Z., Chang, K.C.C.: Accessing the deep web. Commun. ACM 50, 94–101 (2007)

    Article  Google Scholar 

  2. Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF mapping language (2012)

    Google Scholar 

  3. Dimou, A., Sande, M.V., Slepicka, J., Szekely, P., Mannens, E., Knoblock, C., Walle, R.V.: Mapping hierarchical sources into RDF using the RML mapping language. In: Proceedings of ICSC 2014, pp. 151–158. IEEE (2014)

    Google Scholar 

  4. Roth, M.T., Schwartz, P.: Don’t scrap it, wrap it! A wrapper architecture for legacy data sources. In: Proceedings of VLDB 1997, pp. 266–275 (1997)

    Google Scholar 

  5. Melton, J., Michels, J.E., Josifovski, V., Kulkarni, K., Schwarz, P.: SQL/MED: a status report. ACM SIGMOD Rec. 31, 81–89 (2002)

    Article  Google Scholar 

  6. Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 601–616. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  7. Acosta, M., Vidal, M.-E., Lampo, T., Castillo, J., Ruckhaus, E.: ANAPSID: an adaptive query processing engine for SPARQL endpoints. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 18–34. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Gaignard, A.: Distributed knowledge sharing and production through collaborative e-science platforms. PhD thesis (2013)

    Google Scholar 

  9. Spanos, D.E., Stavrou, P., Mitrou, N.: Bringing relational databases into the semantic web: a survey. Semant. Web J. 3, 169–209 (2012)

    Google Scholar 

  10. Sequeda, J., Tirmizi, S.H., Corcho, S., Miranker, D.P.: Survey of directly mapping SQL databases to the semantic web. Knowl. Eng. Rev. 26, 445–486 (2011)

    Article  Google Scholar 

  11. Michel, F., Montagnat, J., Faron-Zucker, C.: A survey of RDB to RDF translation approaches and tools Research report. ISRN I3S/RR 2013–04-FR (2014)

    Google Scholar 

  12. Bischof, S., Decker, S., Krennwallner, T., Lopes, N., Polleres, A.: Mapping between RDF and XML with XSPARQL. J. Data Semant. 1, 147–185 (2012)

    Article  Google Scholar 

  13. Fennell, P.: Schematron - more useful than you’d thought. In: Proceedings of the XML London 2014 Conference, pp. 103–112 (2014)

    Google Scholar 

  14. Breitling, F.: A standard transformation from XML to RDF via XSLT. Astron. Not. 330, 755 (2009)

    Google Scholar 

  15. Bikakis, N., Tsinaraki, C., Stavrakantonakis, I., Gioldasis, N., Christodoulakis, S.: The SPARQL2XQuery interoperability framework. CoRR abs/1311.0536 (2013)

    Google Scholar 

  16. Langegger, A., Wöß, W.: XLWrap – querying and integrating arbitrary spreadsheets with SPARQL. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 359–374. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  17. Scharffe, F., Atemezing, G., Troncy, R., Gandon, F., Villata, S., Bucher, B., Hamdi, F., Bihanic, L., Képéklian, G., Cotton, F., et al.: Enabling linked data publication with the Datalift platform. In: Proceedings of the AAAI Workshop on Semantic Cities (2012)

    Google Scholar 

  18. Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Proceedings of the 7th LDOW Workshop (2014)

    Google Scholar 

  19. Hecht, R., Jablonski, S.: NoSQL evaluation: a use case oriented survey. In: Proceedings of CSC 2011, pp. 336–341. IEEE Computer Society (2011)

    Google Scholar 

  20. Gajendran, S.K.: A survey on NoSQL databases (technical report) (2013)

    Google Scholar 

  21. Ong, K.W., Papakonstantinou, Y., Vernoux, R.: The SQL++ unifying semi-structured query language, and an expressiveness benchmark of SQL-on-Hadoop, NoSQL and NewSQL databases (submitted). CoRR abs/1405.3631 (2014)

    Google Scholar 

  22. Kolev, B., Valduriez, P., Jimenez-Peris, R., Martìnez-Bazan, N., Pereira, J.: CloudMdsQL: querying heterogeneous cloud data stores with a common language. In: Proceedings of the BDA 2014 Conference (2014)

    Google Scholar 

  23. Michel, F., Djimenou, L., Faron-Zucker, C., Montagnat, J.: xR2RML: Relational and non-relational databases to RDF mapping language (2014). Research report. ISRN I3S/RR 2014–04-FR v3

    Google Scholar 

  24. Priyatna, F., Corcho, O., Sequeda, J.: Formalisation and experiences of R2RML-based SPARQL to SQL query translation using Morph. In: Proceedings of WWW 2014 (2014)

    Google Scholar 

  25. Gargominy, P., Tercerie, S., Régnier, C., Ramage, T., Schoelinck, C., Dupont, P., Vandel, E., Daszkiewicz, P., Poncet, L.: TAXREF v8.0, référentiel taxonomique pour la France: méthodologie, mise en oeuvre et diffusion. In: Rapport SPN 2014 - 42 (2014)

    Google Scholar 

  26. Callou, C., Michel, F., Faron-Zucker, C., Martin, C., Montagnat, J.: Towards a shared reference thesaurus for studies on history of zoology, archaeozoology and conservation biology. In: ESCW 2015, Workshop Semantic Web For Scientific Heritage (SW4SH), Portoroz, Slovenia (2015)

    Google Scholar 

  27. Floratou, A., Minhas, U.F., Ozcan, F.: SQl-on-hadoop: Full circle back to shared-nothing database architectures. Proc. VLDB Endowment 7, 1295–1306 (2014)

    Article  Google Scholar 

  28. Arenas, M., Bertails, A., Prud’hommeaux, E., Sequeda, J.: A direct mapping of relational data to RDF (2012)

    Google Scholar 

  29. de Medeiros, L.F., Priyatna, F., Corcho, O.: MIRROR: automatic R2RML mapping generation from relational databases. In: Cimiano, P., Frasincar, F., Houben, G.-J., Schwabe, D. (eds.) ICWE 2015. LNCS, vol. 9114, pp. 326–343. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  30. Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. In: Spaccapietra, S. (ed.) Journal on Data Semantics IV. LNCS, vol. 3730, pp. 146–171. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  31. Fagin, R., Haas, L.M., Hernández, M., Miller, R.J., Popa, L., Velegrakis, Y.: Clio: schema mapping creation and data exchange. In: Borgida, A.T., Chaudhri, V.K., Giorgini, P., Yu, E.S. (eds.) Conceptual Modeling: Foundations and Applications. LNCS, vol. 5600, pp. 198–236. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  32. Knoblock, C.A., Szekely, P., Ambite, J.L., Goel, A., Gupta, S., Lerman, K., Muslea, M., Taheriyan, M., Mallick, P.: Semi-automatically mapping structured sources into the semantic web. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 375–390. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  33. Sequeda, J.F., Miranker, D.P.: Ultrawrap: SPARQL execution on relational data. Web Semant.: Sci. Serv. Agents WWW 22, 19–39 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Franck Michel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Michel, F., Djimenou, L., Faron-Zucker, C., Montagnat, J. (2016). Translation of Heterogeneous Databases into RDF, and Application to the Construction of a SKOS Taxonomical Reference. In: Monfort, V., Krempels, KH., Majchrzak, T.A., Turk, Ž. (eds) Web Information Systems and Technologies. WEBIST 2015. Lecture Notes in Business Information Processing, vol 246. Springer, Cham. https://doi.org/10.1007/978-3-319-30996-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-30996-5_14

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-30995-8

  • Online ISBN: 978-3-319-30996-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics