Abstract
Causative verbs can assist in the identification of causative relations. Portuguese has a large number of verbs that would make the manual labelling of causative verbs an manually expensive task. This paper presents a classification strategy which uses the characteristics of causative verbs co-occurring with common nouns to classify Brazilian Portuguese verbs as either: causative or non-causative. The strategy constructs a graph where verbs extracted from text are nodes. The verbs are connected if the verbs co-occur with common nouns. The classification strategy uses the unique characteristics of links between: 1. causative verbs, 2. causative verbs and non-causative verbs and 3. non-causative verbs to predict a label (causative or non-causative) for unlabelled verbs. The proposed strategy significantly outperforms a baseline and supervised learning strategies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bird, S.: Nltk: The natural language toolkit. In: Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics. Association for Computational Linguistics, Philadelphia (2002)
de Alencar, L.F.: Aelius: Uma ferramenta para anotaçáo automática de corpora usando o nltk. In: ELC 2010, The 9th Brazilian Corpus Linguistics Meeting (2010)
Getoor, L., Diehl, C.P.: Link mining: A survey. SIGKDD Explor. Newsl. 7(2), 3–12 (2005)
Girju, R.: Automatic detection of causal relations for question answering. In: Proceedings of the ACL 2003 Workshop on Multilingual Summarization and Question Answering, vol. 12, pp. 76–83. Association for Computational Linguistics (2003)
Joanis, E., Stevenson, S., James, D.: A general feature space for automatic verb classification. Natural Language Engineering 14(3), 337–367 (2008)
Khoo, C.S.G., Chan, S., Niu, Y.: Extracting causal knowledge from a medical database using graphical patterns. In: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, ACL 2000, pp. 336–343. ssociation for Computational Linguistics (2000)
Merlo, P., Stevenson, S.: Automatic verb classification based on statistical distributions of argument structure. Comput. Linguist. 27(3), 373–408 (2001)
Miller, G.A.: Wordnet: A lexical database for english. Communications of the ACM 38, 39–41 (1995)
Stevenson, S., Joanis, E.: Semi-supervised verb class discovery using noisy features. In: Proceedings of the Seventh Conference on Natural Language Learning (2003)
Tsang, V., Stevenson, S.: Calculating semantic distance between word sense probability distributions. In: Proceedings of the Eighth Conference on Computational Natural Language Learning (CoNLL 2004), pp. 81–88 (2004)
Tsang, V., Stevenson, S.: A graph-theoretic framework for semantic distance. Comput. Linguist. 36(1), 31–69 (2010)
Xie, J., Szymanski, B.K.: Labelrank: A stabilized label propagation algorithm for community detection in networks. CoRR, abs/1303.0868 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Drury, B., Rossi, R.G., de Andrade Lopes, A. (2014). Identification of Brazilian Portuguese Causative Verbs through a Weighted Graph Classification Strategy. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-09761-9_32
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09760-2
Online ISBN: 978-3-319-09761-9
eBook Packages: Computer ScienceComputer Science (R0)