Abstract
In this paper, we propose a new methodology for semi-supervised acquisition of lexical taxonomies. Our approach is based on the theory of pretopology that offers a powerful formalism to model semantic relations and transforms a list of terms into a structured term space by combining different discriminant criteria. In order to learn a parameterized pretopological space, we define the Learning Pretopological Spaces strategy based on genetic algorithms. In particular, rare but accurate pieces of knowledge are used to parameterize the different criteria defining the pretopological term space. Then, a structuring algorithm is used to transform the pretopological space into a lexical taxonomy. Results over three standard datasets evidence improved performances against state-of-the-art associative and pattern-based approaches.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Belmandt, Z.T.: Basics of Pretopology. Hermann (2011)
Bordea, G., Buitelaar, P., Faralli, S., Navigli, R.: Semeval-2015 task 17: Taxonomy extraction evaluation (texeval). In: Proceedings of the 9th International Workshop on Semantic Evaluation. Association for Computational Linguistics (2015)
Cimiano, P., Hotho, A., Staab, S.: Learning concept hierarchies from text corpora using formal concept anaylsis. Journal of Artificial Intelligence Research 24, 305–339 (2005)
Cleuziou, G., Buscaldi, D., Levorato, V., Dias, G.: A pretopological framework for the automatic construction of lexical-semantic structures from texts. In: 20th ACM International Conference on Information and Knowledge Management (CIKM), pp. 2453–2456 (2011)
Kozareva, Z., Hovy. E.: Tailoring the automated construction of large-scale taxonomies using the web. Language Resource Evaluation 47(3) (2013)
Largeron, C., Bonnevay, S.: A pretopological approach for structural analysis. Information Sciences 144, 169–185 (2002)
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to wordnet: An on-line lexical database. International Journal of Lexicography 3(4), 235–244 (1990)
Navigli, R., Velardi, P.: Learning domain ontologies from document warehouses and dedicated websites. Computational Linguistics 30(2), 151–179 (2004)
Paaß, G., Kindermann, J., Leopold, E.: Learning prototype ontologies by hierachical latent semantic analysis. In: Workshop on Knowledge Discovery and Ontologies at the joint European Conferences on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD) (2004)
Pereira, F., Tishby, N., Lee, L.: Distributional clustering of english words. In: 31st Annual Meeting on Association for Computational Linguistics (ACL), pp. 183–190 (1993)
Van Rijsbergen, C.J.: Information Retrieval, 2nd edn. Butterworth-Heinemann, Newton (1979)
Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pp. 206–213 (1999)
Scrucca, L.: Ga: A package for genetic algorithms in r. Journal of Statistical Software 53(4), 1–37 (2013)
Snow, R., Jurafsky, D., Ng, Y.A.: Semantic taxonomy induction from heterogenous evidence. In: 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pp. 801–808 (2006)
Velardi, P., Faralli, S., Navigli, R.: OntoLearn Reloaded: A Graph-Based Algorithm for Taxonomy Induction. Computational Linguistics 39(3), 665–707 (2013)
Yang, H., Callan, J.: A metric-based framework for automatic taxonomy induction. In: Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 271–279 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Cleuziou, G., Dias, G. (2015). Learning Pretopological Spaces for Lexical Taxonomy Acquisition. In: Appice, A., Rodrigues, P., Santos Costa, V., Gama, J., Jorge, A., Soares, C. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9285. Springer, Cham. https://doi.org/10.1007/978-3-319-23525-7_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-23525-7_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23524-0
Online ISBN: 978-3-319-23525-7
eBook Packages: Computer ScienceComputer Science (R0)