Skip to main content

Identification of Brazilian Portuguese Causative Verbs through a Weighted Graph Classification Strategy

  • Conference paper
Computational Processing of the Portuguese Language (PROPOR 2014)

Abstract

Causative verbs can assist in the identification of causative relations. Portuguese has a large number of verbs that would make the manual labelling of causative verbs an manually expensive task. This paper presents a classification strategy which uses the characteristics of causative verbs co-occurring with common nouns to classify Brazilian Portuguese verbs as either: causative or non-causative. The strategy constructs a graph where verbs extracted from text are nodes. The verbs are connected if the verbs co-occur with common nouns. The classification strategy uses the unique characteristics of links between: 1. causative verbs, 2. causative verbs and non-causative verbs and 3. non-causative verbs to predict a label (causative or non-causative) for unlabelled verbs. The proposed strategy significantly outperforms a baseline and supervised learning strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bird, S.: Nltk: The natural language toolkit. In: Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Association for Computational Linguistics, Philadelphia (2002)

    Google Scholar 

  2. de Alencar, L.F.: Aelius: Uma ferramenta para anotaçáo automática de corpora usando o nltk. In: ELC 2010, The 9th Brazilian Corpus Linguistics Meeting (2010)

    Google Scholar 

  3. Getoor, L., Diehl, C.P.: Link mining: A survey. SIGKDD Explor. Newsl. 7(2), 3–12 (2005)

    Article  Google Scholar 

  4. Girju, R.: Automatic detection of causal relations for question answering. In: Proceedings of the ACL 2003 Workshop on Multilingual Summarization and Question Answering, vol. 12, pp. 76–83. Association for Computational Linguistics (2003)

    Google Scholar 

  5. Joanis, E., Stevenson, S., James, D.: A general feature space for automatic verb classification. Natural Language Engineering 14(3), 337–367 (2008)

    Article  Google Scholar 

  6. Khoo, C.S.G., Chan, S., Niu, Y.: Extracting causal knowledge from a medical database using graphical patterns. In: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, ACL 2000, pp. 336–343. ssociation for Computational Linguistics (2000)

    Google Scholar 

  7. Merlo, P., Stevenson, S.: Automatic verb classification based on statistical distributions of argument structure. Comput. Linguist. 27(3), 373–408 (2001)

    Article  Google Scholar 

  8. Miller, G.A.: Wordnet: A lexical database for english. Communications of the ACM 38, 39–41 (1995)

    Article  Google Scholar 

  9. Stevenson, S., Joanis, E.: Semi-supervised verb class discovery using noisy features. In: Proceedings of the Seventh Conference on Natural Language Learning (2003)

    Google Scholar 

  10. Tsang, V., Stevenson, S.: Calculating semantic distance between word sense probability distributions. In: Proceedings of the Eighth Conference on Computational Natural Language Learning (CoNLL 2004), pp. 81–88 (2004)

    Google Scholar 

  11. Tsang, V., Stevenson, S.: A graph-theoretic framework for semantic distance. Comput. Linguist. 36(1), 31–69 (2010)

    Article  MathSciNet  Google Scholar 

  12. Xie, J., Szymanski, B.K.: Labelrank: A stabilized label propagation algorithm for community detection in networks. CoRR, abs/1303.0868 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Drury, B., Rossi, R.G., de Andrade Lopes, A. (2014). Identification of Brazilian Portuguese Causative Verbs through a Weighted Graph Classification Strategy. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09761-9_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09760-2

  • Online ISBN: 978-3-319-09761-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics