Skip to main content
Log in

An evaluation of ontology matching in geo-service applications

  • Published:
GeoInformatica Aims and scope Submit manuscript

Abstract

Matching between concepts describing the meaning of services representing heterogeneous information sources is a key operation in many application domains, including web service coordination, data integration, peer-to-peer information sharing, query answering, and so on. In this paper we present an evaluation of an ontology matching approach, specifically of structure-preserving semantic matching (SPSM) solution. In particular, we discuss the SPSM approach used to reduce the semantic heterogeneity problem among geo web services and we evaluate the SPSM solution on real world GIS ESRI ArcWeb services. The first experiment included matching of original web service method signatures to synthetically alterated ones. In the second experiment we compared a manual classification of our dataset to the automatic (unsupervised) classification produced by SPSM. The evaluation results demonstrate robustness and good performance of the SPSM approach on a large (ca. 700 000) number of matching tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Notes

  1. http://www.ec-gis.org/inspire/

  2. http://ec.europa.eu/environment/seis/index.htm

  3. http://ec.europa.eu/environment/water/index_en.htm

  4. http://www.gmes.info/index.php?id=home

  5. http://www.earthobservations.org/geoss.shtml

  6. http://www.opengeospatial.org/

  7. http://www.opengeospatial.org/projects/groups/semantics

  8. http://www.w3.org/TR/sawsdl/

  9. Available as open source software at http://semanticmatching.org/

  10. http://www.esri.com/

  11. http://en.wikipedia.org/wiki/SOAP

  12. http://www.arcwebservices.com/v2006/help/index.htm

  13. www.openk.org

  14. http://www.opengeospatial.org/standards/wms

  15. http://www.opengeospatial.org/standards/wfs

  16. http://www.opengeospatial.org/standards/orm

  17. http://www.opengeospatial.org/standards/common

  18. http://www.opengeospatial.org/standards/wps

  19. http://www.opengeospatial.org/standards/cat

  20. http://www.oasis-open.org/apps/group_public/download.php/23974/wsbpel-v2.0-primer.pdf

  21. http://www.wsmo.org/

  22. http://technologies.kmi.open.ac.uk/irs/

  23. http://www.w3.org/Submission/2004/SUBM-OWL-S-20041122/

  24. Examples of individual approaches addressing the matching problem can also be found on http://www.ontologymatching.org.

  25. Element level matching techniques compute correspondences by analyzing concepts in isolation, ignoring their relationships with other concepts. In turn, structure level matching techniques compute correspondences by analyzing relationships between concepts considering the structure of each ontology.

  26. http://about.seals-project.eu/

  27. http://oaei.ontologymatching.org/2009

  28. http://semanticmatching.org

  29. http://wordnet.princeton.edu/

  30. For example, in Fig. 1 all the labels are atomic (though involving multi-words), except requestMap and Vector_Layers, which are complex ones, see [15] for how these are handled.

  31. SAT4J: a satisfiability library for Java. http://www.sat4j.org/

  32. To obtain this result, we used a modified version of the SMatch package, release of 2010-10-09, available at http://semanticmatching.org/download.html: we set the Cost of each tree edit distance operation to 1.0 in the TreeEditDistance.java class, we configured the edit distance matcher threshold to 0.7 (MatcherLibrary.MatcherLibrary.stringMatchers.EditDistanceOptimized.threshold = 0.7) in the configuration file s-match-spsm-function.properties, and we configured the resulting matching file to show the similarity value (MappingRenderer=it.unitn.disi.smatch.renderers.mapping. SimpleXMLMappingRenderer) in the configuration file s-match-spsm-function.properties.

  33. http://arcweb.esri.com/arcwebonline

  34. The edit-distance between two strings is given by the minimum number of operations needed to transform one string into the other, where the operation is an insertion, deletion, or substitution of a single character.

  35. http://icame.uib.no/brown/bcm.html

  36. The BROWN CORPUS contains 1 million words, so the probability of obtaining a related word is relatively low. If, for example, a word had 100 related terms, the probability to have a related term is 1/10000. So we could say that the replacement is indeed with a “probabilistically unrelated word”.

  37. http://www.mobysaurus.com

  38. The empirical rules have been designed one by one for each of the four alteration operations. The rationale behind these empirical rules is that the change rate discriminates clearly between the cases. Reduction by 0.5 turned out to suffice based on some empirical preliminary testing.

  39. The evaluation could be refined by considering the asymmetry of the similarities [54, 43]; since similarities are asymmetric (hypernyms are usually considered less similar to hyponyms than the other way round), this empirical reduction could be 2-valued, depending on whether the relation changes to less general or more general. This line is viewed as future work.

References

  1. Aggarwal R, Verma K, Miller J, Milnor W (2004) Constraint driven web service composition in METEOR-S. In: Proc of the 1st IEEE international Conference of Services Computing (SCC), pp 23–30

  2. Antoniou G, van Harmelen F (2003) Web ontology language: OWL. Springer

  3. Aumüller D, Do H-H, Maßmann S, Rahm E (2005) Schema and ontology matching with COMA+ +. In: Proc 24th international conference on management of data (SIGMOD), software demonstration, pp 906–908

  4. Bernard L, Craglia M, Gould M, Kuhn W (2005) Towards an SDI research agenda. In: Proc of the 11th European Commission-Geographic Information (EC-GI) and GIS wORKShop, pp 147–151

  5. Breitbart Y (1990) Multidatabase interoperability. SIGMOD Rec 19(3):53–60

    Article  Google Scholar 

  6. Di L, Zhao P, Yang W, Yue P (2006) Ontology-driven automatic geospatial-processing modeling based on web-service chaining. In: Proc of the 6th Earth Science Technology Conference (ESTC)—CDROM

  7. Egenhofer MJ (2002) Toward the semantic geospatial web. In: Proc of the 10th ACM symposium on advances in geographic information systems, pp 1–4

  8. Euzenat J, Shvaiko P (2007) Ontology matching. Springer

  9. Fileto R, Liu L, Pu C, Assad ED, Medeiros CB (2003) Poesia: an ontological workflow approach for composing web services in agriculture. VLDB J 12(4):352–367

    Article  Google Scholar 

  10. Giunchiglia F, Marchese M, Zaihrayeu I (2007) Encoding classifications into lightweight ontologies. Journal of Data Semantics VIII:57–81

    Google Scholar 

  11. Giunchiglia F, McNeill F, Yatskevich M, Pane J, Besana P, Shvaiko P (2008) Approximate structure-preserving semantic matching. In: Proc of the 7th conference on Ontologies, DataBases, and Applications of Semantics (ODBASE), pp 1234–1237

  12. Giunchiglia F, Shvaiko P, Yatskevich M (2006) Discovering missing background knowledge in ontology matching. In: Proc of the 17th European Conference on Artificial Intelligence (ECAI), pp 382–386

  13. Giunchiglia F, Walsh T (1992) A theory of abstraction. Artif Intell 57(2–3):323–389

    Article  Google Scholar 

  14. Giunchiglia F, Yatskevich M, Avesani P, Shvaiko P (2009) A large scale dataset for the evaluation of ontology matching systems. The Knowledge Engineering Review Journal 24(2):137–157

    Article  Google Scholar 

  15. Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. Journal on Data Semantics IX:1–38

    Google Scholar 

  16. Gligorov R, Aleksovski Z, ten Kate W, van Harmelen F (2007) Using google distance to weight approximate ontology matches. In: Proc of the 16th international World Wide Web conference (WWW), pp 767–775

  17. Gone M, Shade S (2007) Towards semantic composition of geospatial web services using WSMO in comparison to BPEL. In: Proc of the 5th geographic information day—young researchers forum, pp 43–63

  18. Groot R, McLaughlin J (2000) Geospatial data infrastructure: concepts, cases and good practice. Oxford University Press

  19. Hu W, Qu Y (2008) Falcon-ao: a practical ontology matching system. Journal of Web Semantics, 6(3):237–239

    Article  Google Scholar 

  20. Janowicz K, Keßler C, Schwarz M, Wilkes M, Panov I, Espeter M, Bäumer B (2007) Algorithm, implementation and application of the SIM-DL similarity server. GeoSpatial Semantics, LNCS 4853:128–145

    Article  Google Scholar 

  21. Janowicz K, Wilkes M, Lutz M (2008) Similarity-based information retrieval and its role within spatial data infrastructures. Geogr Inf Sci, LNCS 5266:151–167

    Article  Google Scholar 

  22. Jérôme Euzenat AF, Hollink L, Isaac A, Joslyn C, Malaisé V, Meilicke C, Nikolov A, Pane J, Sabou M, Scharffe F, Shvaiko P, V. Spiliopoulos, H. Stuckenschmidt, O. Šváb Zamazal, V. Svátek, dos Santos CT, Vouros G, Wang S (2009) Results of the ontology alignment evaluation initiative 2009. In: Proc of the 4th Ontology Matching (OM) workshop at the International Semantic Web Conference (ISWC), pp 73–119

  23. Kashyap V, Sheth A (1998) Semantic heterogeneity in global information systems: The role of metadata, context and ontologies. In: Papazoglou M, Schlageter G (eds) Cooperative information systems. Academic Press, pp 139–178

  24. Klein M (2001) Combining and relating ontologies: an analysis of problems and solutions. In: Proc of the workshop on ontologies and information sharing at the International Joint Conference on Artificial Intelligence (IJCAI)

  25. Klusch M, Fries B, Sycara K (2006) Automated semantic web service discovery with OWLS-MX. In: Proc of the 4th international joint conference on Autonomous Agents and Multiagent Systems (AAMAS), pp 915–922

  26. Kuhn W (2005) Geospatial semantics: why, of what, and how? Journal on Data Semantics, LNCS 3534:1–24 (special issue on Semantic-based Geographical Information Systems)

  27. Lemmens R, Wytzisk A, de By R, Granell C, Gould M, van Oosterom P (2006) Integrating semantic and syntactic descriptions to chain geographic services. IEEE Internet Comput 10(5):42–52

    Article  Google Scholar 

  28. Lutz M, Klien E (2006) Ontology-based retrieval of geographic information. Int J Geogr Inf Sci 20(3):233–260

    Article  Google Scholar 

  29. Lutz M, Lucchi R, Friis-christensen A, Ostländer N (2007) A rule-based description framework for the composition of geographic information services. Geospatial Semantics, LNCS 4853:114–127

    Article  Google Scholar 

  30. Marchese M, Vaccari L, Shvaiko P, Pane J (2008) An application of approximate ontology matching in eResponse. In: Proc of the 5th international conference on Information Systems for Crisis Response and Management (ISCRAM), pp 294–304

  31. Martin D, Burstein M, Hobbs J, Lassila O, McDermott D, McIlraith S, Narayanan S, Paolucci M, Parsia B, Payne T, Sirin E, Srinivasan N, Sycara K (2004) OWL-S: semantic markup for web services. W3C Submission

  32. Masser I (2005) Creating spatial data infrastructures. ESRI Press

  33. Mikhaiel R, Stroulia E (2006) Examining usage protocols for service discovery. In: Proc of the 4th International Conference on Service Oriented Computing (ICSOC), pp 496–502

  34. Miller GA (1995) Wordnet: a lexical database for english. Commun ACM 38(11):39–41

    Article  Google Scholar 

  35. Nebert D (2004) Developing Spatial Data Infrastructures. The SDI CookBook. Global Spatial Data Infrastructure (GSDI)

  36. Noy NF (2004) Semantic integration: a survey of ontology-based approaches. SIGMOD Rec 33(4):65–70

    Article  Google Scholar 

  37. Nyerges TL (1989) Schema integration analysis for the development of GIS databases. Int J Geogr Inf Syst 3(2):153–183

    Article  Google Scholar 

  38. Onsrud H (2007) Research and theory in advancing creating spatial data infrastructure concepts. ESRI Press

  39. Oundhakar S, Verma K, Sivashanmugam K, Sheth A, Miller J (2005) Discovery of web services in a multi-ontology and federated registry environment. International Journal of Web Services Research 2(3):1–32

    Article  Google Scholar 

  40. Paolucci M, Kawamura T, Payne TR, Sycara K (2002) Semantic matching of web services capabilities. In: Proc of the 1st International Semantic Web Conference (ISWC), pp 333–347

  41. Partyka J, Alipanah N, Khan L, Thuraisingham BM, Shekhar S (2008) Content-based ontology matching for GIS datasets. In: Proc of the 16th international symposium on Advances in Geographic Information Systems (ACM-GIS)

  42. Petrie C, Margaria T, Kuster U, Lausen H, Zaremba M (2007) Sws challenge: status, perspectives, and lessons learned so far. In: Proc of the 9th International Conference on Enterprise Information Systems (ICEIS), pp 447–452

  43. Rodríguez MA, Egenhofer MJ (2004) Comparing geospatial entity classes: an asymmetric and context-dependent similarity measure. Int J Geogr Inf Sci 18(3):229–256

    Article  Google Scholar 

  44. Roman D, Lausen UKH, de Bruijn J, Lara R, Stollberg M, Polleres A, Fensel D, Bussler C (2005) Web service modeling ontology (WSMO). Applied Ontology 1(1):77–106

    Google Scholar 

  45. Schulte S, Eckert J, Repp N, Steinmetz R (2008) An approach to evaluate and enhance the retrieval of semantic web services. In: Proc of the 5th international conference on service systems and service management, pp 237–243

  46. Sheth AP (1999) Changing focus on interoperability in information systems: from systems, syntax, structure to semantics. Interoperating Geographic Information Systems 47:5–29

    Article  Google Scholar 

  47. Shvaiko P, Euzenat J (2005) A survey of schema-based matching approaches. Journal on Data Semantics IV:146–171

    Google Scholar 

  48. Shvaiko P, Euzenat J (2008) Ten challenges for ontology matching. In: Proc 7th international conference on Ontologies, DataBases, and Applications of Semantics (ODBASE), pp 1163–1181

  49. Smith B, Mark DM (2001) Geographical categories: an ontological investigation. Int J Geogr Inf Sci 15(7):591–612

    Article  Google Scholar 

  50. Smits PC, Friis-Christensen A (2007) Resource discovery in a european spatial data infrastructure. IEEE Trans Knowl Data Eng 19(1):85–95

    Article  Google Scholar 

  51. Stroulia E, Wang Y (2005) Structural and semantic matching for assessing web-service similarity. International Journal of Cooperative Information System 14(4):407–438

    Article  Google Scholar 

  52. Syed Z, Finin T, Joshi A (2008) Wikipedia as an Ontology for Describing Documents. In: Proc of the 2nd International Conference on Weblogs and Social Media (ICWSM) pp 136–144

  53. Tanasescu V, Gugliotta A, Domingue J, Davies R, Gutiérrez-Villarías L, Rowlatt M, Richardson M, Stinčić S (2006) A semantic web services GIS based emergency management application. In: Proc of the workshop on semantic web for eGovernment at the 5th International Semantic Web Conference (ISWC), pp 959–966

  54. Tversky A (1977) Features of similarity. Psychol Rev 84(4):327–352

    Article  Google Scholar 

  55. Vaccari L, Shvaiko P, Marchese M (2009) A geo-service semantic integration in spatial data infrastructures. International Journal of Spatial Data Infrastructures Research (IJSDIR) 4:24–51

    Google Scholar 

  56. Valiente G (2002) Algorithms on trees and graphs. Springer

  57. Worboys MF, Deen MS (1991) Semantic heterogeneity in distributed geographic databases. SIGMOD Rec 20(4):30–34

    Article  Google Scholar 

  58. Zhao P, Di L (2005) Semantic web service based geospatial knowledge discovery. In: Proc of the International Geoscience and Remote Sensing Symposium (IGARSS), pp 3490–3493

Download references

Acknowledgements

We thank Fausto Giunchiglia, Fiona McNeill, Mikalai Yatskevich and Aliaksandr Autayeu for many fruitful discussions on the structure-preserving semantic matching. This work has been partly supported by the FP6 OpenKnowledge European STREP project (FP6-027253). The second author appreciates support from the Trentino as a Lab (TasLab) initiative of the European Network of the Living Labs at Informatica Trentina.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lorenzino Vaccari.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vaccari, L., Shvaiko, P., Pane, J. et al. An evaluation of ontology matching in geo-service applications. Geoinformatica 16, 31–66 (2012). https://doi.org/10.1007/s10707-011-0125-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10707-011-0125-8

Keywords

Navigation