Abstract
The following paper focuses on an enrichment method for ontologies. We define similarities of possible new concepts and base the similarity and dissimilarity of concepts on the usage statistics in large corpora. The method is soft in the sense, that we define a semantically motivated heuristics for the influence of different linguistic properties influencing the similarity definition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Stumme, G., Mädche A.: FCA-Merge: A Bottom-Up Approach for Merging Ontologies JCAI ‘01 - Proceedings of the 17th International Joint Conference on Artificial Intelligence, Seattle, USA, August, 1–6, 2001, San Francisco/CA: Morgan Kaufmann, 2001
Kullback, S.: Information Theory and Statistics. John Wiley and Sons, New York, 1959.
Resnik,P.: Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language, Journal of Artificial Intelligence Research, vol. 11, 1999
Dagan I., Perreira F., Lee L.: Similarity-based Estimation of Word Cooccurence Probabilities, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, ACL’ 94, New Mexico State University, June 1994
the Cosmas corpus querying service, http://corpora.ids-mannheim.de/~cosmas/
the Wortschatz corpus querying service, http://wortschatz.uni-leipzig.de/
Bisson, G. and Nedellec, C. and L. Canamero: Designing clustering methods for ontology building - The Mo’K workbench, Proceedings of the Ontology Learning ECAI-2000 Workshop, August 2000
Sahlgren, M.: Vector-Based Semantic Analysis: Representing Word Meanings Based on Random Labels, Proceedings of the ESSLLI 2001 Workshop on Semantic Knowledge Acquisition and Categorisation, Helsinki, Finland, 2001
Spark-Jones K.: Readings in Information Retrieval,Morgan Kaufmann, 1997
Lagus, K.: Studying similarities in term usage with self-organizing maps. Proceedings of NordTerm 2001, Tuusula, Finland. pp. 34–45, 2001
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Faatz, A., Seeberg, C., Steinmetz, R. (2002). Statistical Profiles of Words for Ontology Enrichment. In: Grzegorzewski, P., Hryniewicz, O., Gil, M.Á. (eds) Soft Methods in Probability, Statistics and Data Analysis. Advances in Intelligent and Soft Computing, vol 16. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1773-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1773-7_30
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1526-9
Online ISBN: 978-3-7908-1773-7
eBook Packages: Springer Book Archive