Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4805))

Abstract

CWSD (Combined Word Sense Disambiguation) is an algorithm for the automatic annotation of structured and semi-structured data sources. Instead of being targeted to textual data sources like most of the traditional WSD algorithms, CWSD can exploit knowledge from the structure of data sources together with the lexical knowledge associated with schema elements (terms in the following).

We integrated CWSD in the MOMIS system (Mediator EnvirOment forMultiple Information Sources) [1], which is an \({{\it I}^3}\) framework designed for the integration of data sources, where the lexical annotation of terms was performed manually by the user. CWSD combines a structural disambiguation algorithm, that starts the disambiguation process by using the semantic relationships extracted from the data source schemata and a WordNet Domains based disambiguation algorithm, which refines terms disambiguation by using domains information.

This work was partially supported by MUR FIRB Network Peer for Business project (http://www.dbgroup.unimo.it/nep4b) and by the IST FP6 STREP project 2006 STASIS (http://www.dbgroup.unimo.it/stasis).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bergamaschi, S., Castano, S., Beneventano, D., Vincini, M.: Semantic integration of heterogeneous information sources. Journal of Data and Knowledge Engineering 36(3), 215–249 (2001)

    Article  MATH  Google Scholar 

  2. Gliozzo, A.M., Strapparava, C., Dagan, I.: Unsupervised and supervised exploitation of semantic domains in lexical disambiguation. Computer Speech & Language 18(3), 275–299 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Robert Meersman Zahir Tari Pilar Herrero

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bergamaschi, S., Po, L., Sorrentino, S. (2007). Automatic Annotation in Data Integration Systems. In: Meersman, R., Tari, Z., Herrero, P. (eds) On the Move to Meaningful Internet Systems 2007: OTM 2007 Workshops. OTM 2007. Lecture Notes in Computer Science, vol 4805. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76888-3_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-76888-3_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-76887-6

  • Online ISBN: 978-3-540-76888-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics