Abstract
The Semantic Textual Similarity (STS) task aims capturing a bidirectional-graded equivalence between the pair of short texts. This work proposes a STS measure for the Portuguese Language based on Semantic Inferentialism Model (SIM) and InferenceNet.BR. We argue that the expression of inferential, causal, motivational and encyclopedic content of InferenceNet enables a more robust and efficient model for the STS task. An extrinsic evaluation in a Portuguese-language processing application - a Case-Based Reasoning system for Requirements Engineering - provided real scenario to assess how the proposed STS measure contributes to the effectiveness of NLP applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, E., Cer, D., Diab, M., Gonzalez-Agirre, A., Guo, W.: Sem 2013 shared task: Semantic Textual Similarity. In: SEM 2013: The Second Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics (2013)
Kauchak, D., Barzilay, R.: Paraphrasing for automatic evaluation. In: HLT-NAACL 2006, pp. 455–462 (2006)
Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 841–842. ACM (2010)
Albuquerque, A., Pinheiro, V., Leite, T.: Reuse of Experiences Applied to Requirements Engineering: An Approach Based on Natural Language Processing. In: Proceedings of the 24th International Conference on Software Engineering and Knowledge Engineering, SEKE 2012, São Francisco, CA (2012)
Pinheiro, V., Furtado, V., Pequeno, T., Franco, W.A.: Semi-Automated Method for Acquisition of Common-sense and Inferentialist Knowledge. Journal of the Brazilian Computer Society 19, 75–87 (2013), doi:10.1007/s13173-012-
Harris, Z.: Mathematical Structures of Language. Wiley, New York (1968)
Pinheiro, V., Pequeno, T., Furtado, V., Nogueira, D.: Semantic Inferentialist Analyser: Um Analisador Semântico de Sentençasem Linguagem Natural. In: Proceedings of the 7th Brazilian Symposium in Information and Human Language Technology, STIL, Brasil (2009)
Pinheiro, V., Pequeno, T., Furtado, V., Franco, W.: InferenceNet.Br: Expression of Inferentialist Semantic Content of the Portuguese Language. In: Pardo, T.A.S., Branco, A., Klautau, A., Vieira, R., de Lima, V.L.S. (eds.) PROPOR 2010. LNCS (LNAI), vol. 6001, pp. 90–99. Springer, Heidelberg (2010)
Pinheiro, V., Furtado, V., Pequeno, T., Ferreira, C.: Towards a common sense base in Portuguese for the linked open data cloud. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds.) PROPOR 2012. LNCS (LNAI), vol. 7243, pp. 128–138. Springer, Heidelberg (2012)
Pinheiro, V., Pequeno, T., Furtado, V.: Um Analisador Semântico Inferencialis ta de Sentenças em Linguagem Natural. Linguamática 2(1), 111–130 (2010) ISSN: 1647-0818
Han, L., Kashyap, A.L., Finin, T., Mayfield, J., Weese, J.: UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems. In: Proceedings of the Second Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics (2013)
Meadow, C.T.: Text Information Retrieval Systems. Academic Press, Inc. (1992)
Mihalcea, R., Corley, C., Strapparava, C.: Corpus-based and knowledge-based measures of text semantic similarity. In: Proceedings of the 21st National Conference on Artificial Intelligence, pp. 775–780. AAAI Press (2006)
Saric, F., Glavas, G., Karan, M., Snajder, J., Basic, B.: Takelab: systems for measuring semantic text similarity. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics, pp. 441–448. Association for Computational Linguistics (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Pinheiro, V., Furtado, V., Albuquerque, A. (2014). Semantic Textual Similarity of Portuguese-Language Texts: An Approach Based on the Semantic Inferentialism Model. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-09761-9_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09760-2
Online ISBN: 978-3-319-09761-9
eBook Packages: Computer ScienceComputer Science (R0)