Skip to main content

Semantic Textual Similarity of Portuguese-Language Texts: An Approach Based on the Semantic Inferentialism Model

  • Conference paper
Computational Processing of the Portuguese Language (PROPOR 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8775))

Abstract

The Semantic Textual Similarity (STS) task aims capturing a bidirectional-graded equivalence between the pair of short texts. This work proposes a STS measure for the Portuguese Language based on Semantic Inferentialism Model (SIM) and InferenceNet.BR. We argue that the expression of inferential, causal, motivational and encyclopedic content of InferenceNet enables a more robust and efficient model for the STS task. An extrinsic evaluation in a Portuguese-language processing application - a Case-Based Reasoning system for Requirements Engineering - provided real scenario to assess how the proposed STS measure contributes to the effectiveness of NLP applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agirre, E., Cer, D., Diab, M., Gonzalez-Agirre, A., Guo, W.: Sem 2013 shared task: Semantic Textual Similarity. In: SEM 2013: The Second Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics (2013)

    Google Scholar 

  2. Kauchak, D., Barzilay, R.: Paraphrasing for automatic evaluation. In: HLT-NAACL 2006, pp. 455–462 (2006)

    Google Scholar 

  3. Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., Demirbas, M.: Short text classification in twitter to improve information filtering. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 841–842. ACM (2010)

    Google Scholar 

  4. Albuquerque, A., Pinheiro, V., Leite, T.: Reuse of Experiences Applied to Requirements Engineering: An Approach Based on Natural Language Processing. In: Proceedings of the 24th International Conference on Software Engineering and Knowledge Engineering, SEKE 2012, São Francisco, CA (2012)

    Google Scholar 

  5. Pinheiro, V., Furtado, V., Pequeno, T., Franco, W.A.: Semi-Automated Method for Acquisition of Common-sense and Inferentialist Knowledge. Journal of the Brazilian Computer Society 19, 75–87 (2013), doi:10.1007/s13173-012-

    Article  Google Scholar 

  6. Harris, Z.: Mathematical Structures of Language. Wiley, New York (1968)

    MATH  Google Scholar 

  7. Pinheiro, V., Pequeno, T., Furtado, V., Nogueira, D.: Semantic Inferentialist Analyser: Um Analisador Semântico de Sentençasem Linguagem Natural. In: Proceedings of the 7th Brazilian Symposium in Information and Human Language Technology, STIL, Brasil (2009)

    Google Scholar 

  8. Pinheiro, V., Pequeno, T., Furtado, V., Franco, W.: InferenceNet.Br: Expression of Inferentialist Semantic Content of the Portuguese Language. In: Pardo, T.A.S., Branco, A., Klautau, A., Vieira, R., de Lima, V.L.S. (eds.) PROPOR 2010. LNCS (LNAI), vol. 6001, pp. 90–99. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Pinheiro, V., Furtado, V., Pequeno, T., Ferreira, C.: Towards a common sense base in Portuguese for the linked open data cloud. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds.) PROPOR 2012. LNCS (LNAI), vol. 7243, pp. 128–138. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  10. Pinheiro, V., Pequeno, T., Furtado, V.: Um Analisador Semântico Inferencialis ta de Sentenças em Linguagem Natural. Linguamática 2(1), 111–130 (2010) ISSN: 1647-0818

    Google Scholar 

  11. Han, L., Kashyap, A.L., Finin, T., Mayfield, J., Weese, J.: UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems. In: Proceedings of the Second Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics (2013)

    Google Scholar 

  12. Meadow, C.T.: Text Information Retrieval Systems. Academic Press, Inc. (1992)

    Google Scholar 

  13. Mihalcea, R., Corley, C., Strapparava, C.: Corpus-based and knowledge-based measures of text semantic similarity. In: Proceedings of the 21st National Conference on Artificial Intelligence, pp. 775–780. AAAI Press (2006)

    Google Scholar 

  14. Saric, F., Glavas, G., Karan, M., Snajder, J., Basic, B.: Takelab: systems for measuring semantic text similarity. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics, pp. 441–448. Association for Computational Linguistics (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Pinheiro, V., Furtado, V., Albuquerque, A. (2014). Semantic Textual Similarity of Portuguese-Language Texts: An Approach Based on the Semantic Inferentialism Model. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09761-9_19

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09760-2

  • Online ISBN: 978-3-319-09761-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics