Skip to main content

Re-constructing Hidden Semantic Data Models by Querying SPARQL Endpoints

  • Conference paper
  • First Online:
Book cover Database and Expert Systems Applications (DEXA 2016)

Abstract

Linked Open Data community is constantly producing new repositories that store information from different domains. The data included in these repositories follow the rules proposed by the W3C community, based on standards such as Resource Description Framework (RDF) and the SPARQL query language. The main advantage of this approach is the possibility of external developers accessing the data from their applications. This advantage is also one of the main challenges of this new technology due to the cost of exploring how the data is structured in a given repository in order to construct SPARQL queries to retrieve useful information. According to the reviewed literature, there are no applications to reconstruct the underlying semantic data models from an SPARQL endpoint. In this paper, we present an application for the reconstruction of the data model as an OWL (Ontology Web Language) ontology. This application, available as Open Source at http://github.com/estebanpua/ontology-endpoint-extraction uses a set of SPARQL queries to discover the classes and the (object and data) properties for a given RDF database. A web application interface has also been implemented for users to browse through classes, properties of the ontology generated from the data structure (http://khaos.uma.es/oee). The ontologies generated by this application can help users to understand how the information is semantically organized, making easier the design of SPARQL queries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.w3.org/TR/void/.

  2. 2.

    http://protege.stanford.edu/about.php.

  3. 3.

    http://github.com/estebanpua/ontology-endpoint-extraction.

  4. 4.

    https://jena.apache.org/.

  5. 5.

    http://owlapi.sourceforge.net/.

  6. 6.

    http://wiki.dbpedia.org/.

  7. 7.

    https://www.openstreetmap.org/.

References

  1. Carmona, R.M., Zafra, A., Seoane, P., Castro, A.J., Guerrero-Fernndez, D., Castillo-Castillo, T., Medina-García, A., Cánovas, F.M., Aldana-Montes, J., Navas-Delgado, I., Alché, J.D.D., Claros, M.G.: ReprOlive: a database with linked data for the olive tree (Olea europaea L.) reproductive transcriptome. Front. Plant Sci. 6(625) (2015)

    Google Scholar 

  2. Chelliah, V., Juty, N., Ajmera, I., Ali, R., Dumousseau, M., Glont, M., Hucka, M., Jalowicki, G., Keating, S., Knight-Schrijver, V., Lloret-Villas, A., Natarajan, K.N., Pettit, J.B., Rodriguez, N., Schubert, M., Wimalaratne, S.M., Zhao, Y., Hermjakob, H., Le Novre, N., Laibe, C.: BioModels: ten-year anniversary. Nucleic Acids Res. 43(D1), D542–D548 (2015)

    Article  Google Scholar 

  3. García-Godoy, M.J., López-Camacho, E., Navas-Delgado, I., Aldana-Montes, J.F.: Sharing and executing linked data queries in a collaborative environment. Bioinformatics 29(13), 1663–1670 (2013)

    Article  Google Scholar 

  4. Harth, A., Hose, K., Karnstedt, M., Polleres, A., Sattler, K.U., Umbrich, J.: Data summaries for on-demand queries over linked data, pp. 411–420 (2010)

    Google Scholar 

  5. Jupp, S., Malone, J., Bolleman, J., Brandizi, M., Davies, M., Garcia, L., Gaulton, A., Gehant, S., Laibe, C., Redaschi, N., Wimalaratne, S.M., Martin, M., Le Novère, N., Parkinson, H., Birney, E., Jenkinson, A.M.: The EBI RDF platform: linked open data for the life sciences. Bioinformatics (Oxf., Engl.) 30(9), 1338–1339 (2014). http://dx.doi.org/10.1093/bioinformatics/btt765

    Article  Google Scholar 

  6. LOD: Open linked data. http://linkeddata.org/

  7. Manola, F., Miller, E.: RDF Primer. World Wide Web Consortium, February 2004

    Google Scholar 

  8. Navas-Delgado, I., García-Godoy, M.J., López-Camacho, E., Rybinski, M., Reyes-Palomares, A., Medina, M., Aldana-Montes, J.F.: Kpath: integration of metabolic pathway linked data. Database 2015, bav053 (2015)

    Article  Google Scholar 

  9. Nuzzolese, A.G., Presutti, V., Gangemi, A., Musetti, A., Ciancarini, P.: Aemoo: exploring knowledge on the web. In: Proceedings of the 5th Annual ACM Web Science Conference, WebSci 2013, pp. 272–275 (2013)

    Google Scholar 

  10. Prud\({\acute{\text{h}}}\)ommeaux, E., Seaborne, A.: SPARQL query language for RDF, W3C recommendation. http://www.w3.org/TR/rdf-sparql-query/

  11. Stadler, C., Lehmann, J., Höffner, K., Auer, S.: LinkedGeoData: a core for a web of spatial open data. Semant. Web J. 3(4), 333–354 (2012). http://jens-lehmann.org/files/2012/linkedgeodata2.pdf

    Google Scholar 

Download references

Acknowledgements

This work was partially supported by Grants TIN2014-58304-R (Ministerio de Ciencia e Innovación) and P11-TIC-7529 and P12-TIC-1519 (Plan Andaluz de Investigación, Desarrollo e Innovación).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to María Jesús García-Godoy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

García-Godoy, M.J., López-Camacho, E., Navas-Delgado, I., Aldana-Montes, J.F. (2016). Re-constructing Hidden Semantic Data Models by Querying SPARQL Endpoints. In: Hartmann, S., Ma, H. (eds) Database and Expert Systems Applications. DEXA 2016. Lecture Notes in Computer Science(), vol 9827. Springer, Cham. https://doi.org/10.1007/978-3-319-44403-1_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-44403-1_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-44402-4

  • Online ISBN: 978-3-319-44403-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics