Skip to main content

Statistical Profiles of Words for Ontology Enrichment

  • Conference paper
Soft Methods in Probability, Statistics and Data Analysis

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 16))

  • 394 Accesses

Abstract

The following paper focuses on an enrichment method for ontologies. We define similarities of possible new concepts and base the similarity and dissimilarity of concepts on the usage statistics in large corpora. The method is soft in the sense, that we define a semantically motivated heuristics for the influence of different linguistic properties influencing the similarity definition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Stumme, G., Mädche A.: FCA-Merge: A Bottom-Up Approach for Merging Ontologies JCAI ‘01 - Proceedings of the 17th International Joint Conference on Artificial Intelligence, Seattle, USA, August, 1–6, 2001, San Francisco/CA: Morgan Kaufmann, 2001

    Google Scholar 

  2. Kullback, S.: Information Theory and Statistics. John Wiley and Sons, New York, 1959.

    MATH  Google Scholar 

  3. Resnik,P.: Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language, Journal of Artificial Intelligence Research, vol. 11, 1999

    Google Scholar 

  4. Dagan I., Perreira F., Lee L.: Similarity-based Estimation of Word Cooccurence Probabilities, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, ACL’ 94, New Mexico State University, June 1994

    Google Scholar 

  5. the Cosmas corpus querying service, http://corpora.ids-mannheim.de/~cosmas/

  6. the Wortschatz corpus querying service, http://wortschatz.uni-leipzig.de/

  7. Bisson, G. and Nedellec, C. and L. Canamero: Designing clustering methods for ontology building - The Mo’K workbench, Proceedings of the Ontology Learning ECAI-2000 Workshop, August 2000

    Google Scholar 

  8. Sahlgren, M.: Vector-Based Semantic Analysis: Representing Word Meanings Based on Random Labels, Proceedings of the ESSLLI 2001 Workshop on Semantic Knowledge Acquisition and Categorisation, Helsinki, Finland, 2001

    Google Scholar 

  9. Spark-Jones K.: Readings in Information Retrieval,Morgan Kaufmann, 1997

    Google Scholar 

  10. Lagus, K.: Studying similarities in term usage with self-organizing maps. Proceedings of NordTerm 2001, Tuusula, Finland. pp. 34–45, 2001

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Faatz, A., Seeberg, C., Steinmetz, R. (2002). Statistical Profiles of Words for Ontology Enrichment. In: Grzegorzewski, P., Hryniewicz, O., Gil, M.Á. (eds) Soft Methods in Probability, Statistics and Data Analysis. Advances in Intelligent and Soft Computing, vol 16. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1773-7_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-7908-1773-7_30

  • Publisher Name: Physica, Heidelberg

  • Print ISBN: 978-3-7908-1526-9

  • Online ISBN: 978-3-7908-1773-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics