Statistical Profiles of Words for Ontology Enrichment

Faatz, Andreas; Seeberg, Cornelia; Steinmetz, Ralf

doi:10.1007/978-3-7908-1773-7_30

Andreas Faatz⁴,
Cornelia Seeberg⁴ &
Ralf Steinmetz⁴

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 16))

394 Accesses

Abstract

The following paper focuses on an enrichment method for ontologies. We define similarities of possible new concepts and base the similarity and dissimilarity of concepts on the usage statistics in large corpora. The method is soft in the sense, that we define a semantically motivated heuristics for the influence of different linguistic properties influencing the similarity definition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Stumme, G., Mädche A.: FCA-Merge: A Bottom-Up Approach for Merging Ontologies JCAI ‘01 - Proceedings of the 17th International Joint Conference on Artificial Intelligence, Seattle, USA, August, 1–6, 2001, San Francisco/CA: Morgan Kaufmann, 2001
Google Scholar
Kullback, S.: Information Theory and Statistics. John Wiley and Sons, New York, 1959.
MATH Google Scholar
Resnik,P.: Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language, Journal of Artificial Intelligence Research, vol. 11, 1999
Google Scholar
Dagan I., Perreira F., Lee L.: Similarity-based Estimation of Word Cooccurence Probabilities, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, ACL’ 94, New Mexico State University, June 1994
Google Scholar
the Cosmas corpus querying service, http://corpora.ids-mannheim.de/~cosmas/
the Wortschatz corpus querying service, http://wortschatz.uni-leipzig.de/
Bisson, G. and Nedellec, C. and L. Canamero: Designing clustering methods for ontology building - The Mo’K workbench, Proceedings of the Ontology Learning ECAI-2000 Workshop, August 2000
Google Scholar
Sahlgren, M.: Vector-Based Semantic Analysis: Representing Word Meanings Based on Random Labels, Proceedings of the ESSLLI 2001 Workshop on Semantic Knowledge Acquisition and Categorisation, Helsinki, Finland, 2001
Google Scholar
Spark-Jones K.: Readings in Information Retrieval,Morgan Kaufmann, 1997
Google Scholar
Lagus, K.: Studying similarities in term usage with self-organizing maps. Proceedings of NordTerm 2001, Tuusula, Finland. pp. 34–45, 2001
Google Scholar

Download references

Author information

Authors and Affiliations

Multimedia Communications Lab, Darmstadt University of Technology, Merckstrasse 25, 64283, Darmstadt, Germany
Andreas Faatz, Cornelia Seeberg & Ralf Steinmetz

Authors

Andreas Faatz
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia Seeberg
View author publications
You can also search for this author in PubMed Google Scholar
Ralf Steinmetz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Przemysław Grzegorzewski & Olgierd Hryniewicz &
Facultad de Ciencias, Departamento de Estadística e I.O. y D.M., Universidad de Oviedo, C/Calvo Sotelo, s/n, 33007, Oviedo, Spain
María Ángeles Gil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Faatz, A., Seeberg, C., Steinmetz, R. (2002). Statistical Profiles of Words for Ontology Enrichment. In: Grzegorzewski, P., Hryniewicz, O., Gil, M.Á. (eds) Soft Methods in Probability, Statistics and Data Analysis. Advances in Intelligent and Soft Computing, vol 16. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1773-7_30

Download citation

DOI: https://doi.org/10.1007/978-3-7908-1773-7_30
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1526-9
Online ISBN: 978-3-7908-1773-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics