Abstract
We describe a methodology for building event extraction systems. The approach is based on multilingual domain-specific grammars and exploits weakly supervised machine learning algorithms for lexical acquisition. We report on the process of adapting an already existing event extraction system for the domain of conflicts and crises to the Portuguese language.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Steinberger, R., Pouliquen, B., van der Goot, E.: An Introduction to the Europe Media Monitor Family of Applications. In: Gey, F., Kando, N., Karlgren, J. (eds.) Information Access in a Multilingual World - Proceedings of the SIGIR 2009 Workshop (2009)
Piskorski, J.: ExPRESS - Extraction Pattern Recognition Engine and Specification Suite. In: Proceedings of the International Workshop Finite-State Methods and Natural language Processing (FSMNLP 2007), Potsdam, Germany (2007)
Eleuterio, S., Ranchhod, E., Freire, H., Baptista, J.: A System of Electronic Dictionaries of Portuguese Lingvisticae Investigationes, vol. XIX, p. 2. Jonh Benjamins, Amsterdam (1995)
Piskorski, J., Tanev, H., Wennerberg, P.O.: Wennerberg: Extracting Violent Events From On-Line News for Ontology Population. In: 10th International Conference on Business Information Systems (2007)
Tanev, H., Magnini, B.: Weakly Supervised Approaches for Ontology Population. In: Proceedings of the European Chapter of the Association of Computational Linguistics (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zavarella, V., Tanev, H., Linge, J., Piskorski, J., Atkinson, M., Steinberger, R. (2010). Exploiting Multilingual Grammars and Machine Learning Techniques to Build an Event Extraction System for Portuguese. In: Pardo, T.A.S., Branco, A., Klautau, A., Vieira, R., de Lima, V.L.S. (eds) Computational Processing of the Portuguese Language. PROPOR 2010. Lecture Notes in Computer Science(), vol 6001. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12320-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-12320-7_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12319-1
Online ISBN: 978-3-642-12320-7
eBook Packages: Computer ScienceComputer Science (R0)