Abstract
The paper presents a new hybrid schema matching algorithm: Semantic Structure Matching Recommendation Algorithm (SSRMA). SSRMA is able to discover lexical correspondences without breaking the structural ones — it is capable of rejecting trivial lexical similarities, if the structural context suggests that a given matching is inadequate. The algorithm enables achieving results that are comparable to those obtained by means of state-of-the-art schema matching solutions. The presented method involves an adaptable pre-processing and flexible internal data representation, which allows to use a variety of auxiliary data (e.g., textual corpora) and to increase the accuracy of semantic matches accommodated in a given domain. In order to increase the mapping quality, the method allows to extend the input data by auxiliary information that may have the form of ontologies or textual corpora.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rahm, E., Bernstein, P.: A survey of approaches to automatic schema matching. The VLDB Journal — The International Journal on Very Large Data Bases (2001)
INDECT Homepage, http://www.indect-project.eu
van Rijsbergen, C.: Information Retrieval. Butterworths, London (1979)
Manning, C., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2009)
Abteilung Datenbanken Leipzig: Abteilung Datenbanken Leipzig am Institut für Informatik. In: COMA++ Web Edition, http://139.18.13.36:8080/coma/WebEdition
Doan, A., Domingos, P., Halevy, A.: Learning to Match the Schemas of Data Sources: A Multistrategy Approach. Machine Learning 50(3), 279–301 (2003)
Li, W., Clifton, C.: SEMINT: a tool for identifying attribute correspondences in heterogeneous databases (2000)
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic schema matching with Cupid. In: Proc 27th Int Conf On Very Large Data Bases, pp. 49–58 (2001)
Do, H., Melnik, S., Rahm, E.: Comparison of schema matching evaluations. In: NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems (2003)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity flooding — a versatile graph matching algorithm. In: Proc 18th Int Conf Data Eng. (2002)
Doan, A., Madhavan, J., Domingos, P., Halevy, A.: Learning to Map between Ontologies on the Semantic Web. VLDB Journal, Special Issue on the Semantic Web (2003)
Bouquet, P., Serafini, L., Zanobini, S.: Semantic Coordination: A New Approach and an Application. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 130–145. Springer, Heidelberg (2003)
Euzenat, J., Mochol, M., Shvaiko, P., Stuckenschmidt, H., Šváb, O., Svátek, V., van Hag, W., Yatskevich, M.: Results of the Ontology Alignment Evaluation Initiative 2010. In: ISWC workshop on Ontology Matching OM–2006, pp. 73–95 (2006)
Landauer, T., Foltz, P., Laham, D.: Introduction to Latent Semantic Analysis, 259–284 (1998)
Ciesielczyk, M., Szwabe, A., Prus-Zajączkowski, B.: Interactive Collaborative Filtering with RI-based Approximation of SVD. In: Proceedings of PACIIA 2010 (2010)
Bouquet, P., Serafini, L., Zanobini, S.: Semantic coordination: a new approach and an application, Trento, Italy (2003)
Pankowski, T.: XML data integration in SixP2P — a theoretical framework. DaMaP 2008 Proceedings of the 2008 international workshop on Data management in peer-to-peer systems (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Szwabe, A., Jachnik, A., Figaj, A., Blinkiewicz, M. (2011). Semantic Structure Matching Recommendation Algorithm. In: Dziech, A., Czyżewski, A. (eds) Multimedia Communications, Services and Security. MCSS 2011. Communications in Computer and Information Science, vol 149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21512-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-21512-4_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21511-7
Online ISBN: 978-3-642-21512-4
eBook Packages: Computer ScienceComputer Science (R0)