Skip to main content

Validating Interlinks Between Linked Data Datasets with the SUMMR Methodology

  • Conference paper
  • First Online:
On the Move to Meaningful Internet Systems: OTM 2016 Conferences (OTM 2016)

Abstract

Linked Data datasets use interlinks to connect semantically similar resources across datasets. As datasets evolve, a resources locator can change which can cause interlinks that contain old resource locators, to no longer dereference and become invalid. Validating interlinks, through validating the resource locators within them, when a dataset has changed is important to ensure interlinks work as intended. In this paper we introduce the SPARQL Usage for Mapping Maintenance and Reuse (SUMMR) methodology. SUMMR is an approach for Mapping Maintenance and Reuse (MMR) that provides query templates which are based on standard SPARQL queries for MMR activities. This paper describes SUMMR and two experiments: a lab-based evaluation of SUMMR’s mapping maintenance query templates and a deployment of SUMMR in the DBpedia v.2015-10 release to detect invalid interlinks. The lab-based evaluation involved detecting interlinks that have become invalid, due to changes in resource locators and the repair of the invalid interlinks. The results show that the SUMMR templates and approach can be used to effectively detect and repair invalid interlinks. SUMMR’s query template for discovering invalid interlinks was applied to the DBpedia v.2015-10 release, which discovered 53,418 invalid interlinks in that release.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.scss.tcd.ie/~meehanal/Experiment3/.

  2. 2.

    http://dbpedia.org/sparql.

  3. 3.

    https://github.com/dbpedia/dbpedia-links.

  4. 4.

    http://wiki.dbpedia.org/Downloads2015-04.

References

  1. Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Discovering and Maintaining Links on the Web of Data. Springer, Heidelberg (2009)

    Book  Google Scholar 

  2. Umbrich, J., Hausenblas, M., Hogan, A., Polleres, A., Decker, S.: Towards dataset dynamics: change frequency of linked open data sources. In: Proceedings of the Linked Data on the Web Workshop (LDOW2010), Raleigh, North Carolina, USA, 27 April 2010

    Google Scholar 

  3. Popitsch, N., Haslhofer, B.: DSNotify–A solution for event detection and link maintenance in dynamic datasets. Web Semant. Sci. Serv. Agents World Wide Web 9(3), 266–283 (2011)

    Article  Google Scholar 

  4. Meehan, A., Brennan, R., O’Sullivan, D.: SPARQL based mapping management. In: 2015 IEEE International Conference on Semantic Computing (ICSC), pp. 456–459. IEEE, February 2015

    Google Scholar 

  5. Euzenat, J., Shvaiko, P.: Ontology matching, vol. 18. Springer, Heidelberg (2007)

    MATH  Google Scholar 

  6. Dos Reis, J.C., Pruski, C., Reynaud-Delaître, C.: State-of-the-art on mapping maintenance and challenges towards a fully automatic approach. Expert Syst. Appl. 42(3), 1465–1478 (2015)

    Article  Google Scholar 

  7. Falconer, S., Noy, N.: Interactive techniques to support ontology matching. In: Schema Matching and Mapping, pp. 29–51. Springer, Heidelberg (2011)

    Google Scholar 

  8. Thomas, H., Brennan, R., O’Sullivan, D.: Using the OM 2 R meta-data model for ontology mapping reuse for the ontology alignment challenge–a case study. In: Ontology Matching, p. 61 (2012)

    Google Scholar 

  9. Zhdanova, A.V., Shvaiko, P.: Community-driven ontology matching. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 34–49. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  10. Aumueller, D., Do, H., Massmann, S., Rahm, E.: Schema and ontology matching with COMA++. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 906–908. ACM (2005)

    Google Scholar 

  11. Bizer, C., Schultz, A.: The R2R framework: publishing and discovering mappings on the web. In: 1st International Workshop on Consuming Linked Data (COLD 2010), Shanghai, China, November 2010

    Google Scholar 

  12. Thomas, H., Brennan, R., O’Sullivan, D.: MooM–a prototype framework for management of ontology mappings. In: 2011 IEEE International Conference on Advanced Information Networking and Applications (AINA), pp. 548–555. IEEE (2011)

    Google Scholar 

  13. Nentwig, M., Soru, T., Ngonga Ngomo, A.-C., Rahm, E.: LinkLion: a link repository for the web of data. In: Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8798, pp. 439–443. Springer, Heidelberg (2014). doi:10.1007/978-3-319-11955-7_63

    Google Scholar 

  14. Khattak, A.M., Pervez, Z., Khan, W.A., Khan, A.M., Latif, K., Lee, S.Y.: Mapping evolution of dynamic web ontologies. Inf. Sci. 303, 101–119 (2015)

    Article  MathSciNet  Google Scholar 

  15. Dos Reis, J.C., Pruski, C., Da Silveira, M., Reynaud-Delaître, C.: DyKOSMap: a framework for mapping adaptation between biomedical knowledge organization systems. J. Biomed. Inf. 55, 153–173 (2015)

    Article  Google Scholar 

  16. Hartung, M., Groß, A., Rahm, E.: COnto–Diff: generation of complex evolution mappings for life science ontologies. J. Biomed. Inform. 46(1), 15–32 (2013)

    Article  Google Scholar 

  17. Meehan, A., Brennan, R., Lewis, D., O’Sullivan, D.: Mapping representation based on meta-data and SPIN for localization workflows. In: Proceedings of the Second International Workshop on Semantic Web Enterprise Adoption and Best Practice at ESWC (2014)

    Google Scholar 

  18. Noy, N.F., Griffith, N., Musen, M.A.: Collecting community-based mappings in an ontology repository. In: Sheth, A., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 371–386. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88564-1_24

    Chapter  Google Scholar 

  19. Dos Reis, J.C., Pruski, C., Da Silveira, M., Reynaud-Delaître, C.: Analyzing and supporting the mapping maintenance problem in biomedical knowledge organization systems. In: Proceedings of the Workshop on Semantic Interoperability in Medical Informatics collocated with the 9th Extended Semantic Web Conference, pp. 25–36 (2012)

    Google Scholar 

  20. Shvaiko, P., Euzenat, J.: Ontology matching: state of the art and future challenges. IEEE Trans. Knowl. Data Eng. 25(1), 158–176 (2013)

    Article  Google Scholar 

  21. Knublauch, H.: SPIN SPARQL Inferencing Notation. http://spinrdf.org/. Accessed 7 July 2016

Download references

Acknowledgements

This research is supported by Science Foundation Ireland through the CNGL Program (Grant 12/CE/I2267) in the ADAPT Centre (www.adaptcentre.ie) at Trinity College Dublin and funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No. 644055 (ALIGNED, http://www.aligned-project.eu).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alan Meehan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Meehan, A., Kontokostas, D., Freudenberg, M., Brennan, R., O’Sullivan, D. (2016). Validating Interlinks Between Linked Data Datasets with the SUMMR Methodology. In: Debruyne, C., et al. On the Move to Meaningful Internet Systems: OTM 2016 Conferences. OTM 2016. Lecture Notes in Computer Science(), vol 10033. Springer, Cham. https://doi.org/10.1007/978-3-319-48472-3_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48472-3_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48471-6

  • Online ISBN: 978-3-319-48472-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics