Abstract
CWSD (Combined Word Sense Disambiguation) is an algorithm for the automatic annotation of structured and semi-structured data sources. Instead of being targeted to textual data sources like most of the traditional WSD algorithms, CWSD can exploit knowledge from the structure of data sources together with the lexical knowledge associated with schema elements (terms in the following).
We integrated CWSD in the MOMIS system (Mediator EnvirOment forMultiple Information Sources) [1], which is an \({{\it I}^3}\) framework designed for the integration of data sources, where the lexical annotation of terms was performed manually by the user. CWSD combines a structural disambiguation algorithm, that starts the disambiguation process by using the semantic relationships extracted from the data source schemata and a WordNet Domains based disambiguation algorithm, which refines terms disambiguation by using domains information.
This work was partially supported by MUR FIRB Network Peer for Business project (http://www.dbgroup.unimo.it/nep4b) and by the IST FP6 STREP project 2006 STASIS (http://www.dbgroup.unimo.it/stasis).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bergamaschi, S., Castano, S., Beneventano, D., Vincini, M.: Semantic integration of heterogeneous information sources. Journal of Data and Knowledge Engineering 36(3), 215–249 (2001)
Gliozzo, A.M., Strapparava, C., Dagan, I.: Unsupervised and supervised exploitation of semantic domains in lexical disambiguation. Computer Speech & Language 18(3), 275–299 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bergamaschi, S., Po, L., Sorrentino, S. (2007). Automatic Annotation in Data Integration Systems. In: Meersman, R., Tari, Z., Herrero, P. (eds) On the Move to Meaningful Internet Systems 2007: OTM 2007 Workshops. OTM 2007. Lecture Notes in Computer Science, vol 4805. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76888-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-76888-3_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76887-6
Online ISBN: 978-3-540-76888-3
eBook Packages: Computer ScienceComputer Science (R0)