Abstract
This paper presents the review and evaluation of DiZer – an automatic discourse analyzer for Brazilian Portuguese. Based on Rhetorical Structure Theory, DiZer is a symbolic analyzer that makes use of linguistic patterns learned from a corpus of scientific texts to identify and build the discourse structure of texts. DiZer evaluation shows satisfactory results for scientific texts. In order to test its portability, DiZer is also evaluated with news texts and presents acceptable performance.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aires, R.V.X., Aluísio, S.M., Kuhn, D.C.S., Andreeta, M.L.B., Oliveira Jr., O.N.: Combining Multiple Classifiers to Improve Part of Speech Tagging: A Case Study for Brazilian Portuguese. In: The Proceedings of the Brazilian AI Symposium – SBIA, pp. 20–22 (2000)
Carlson, L., Marcu, D.: Discourse Tagging Reference Manual. ISI Technical Report ISI-TR-545 (2001)
Corston-Oliver, S.: Computing Representations of the Structure of Written Discourse. PhD Thesis, University of California, Santa Barbara, CA, USA (1998)
Cristea, D., Ide, N., Romary, L.: Veins Theory, An Approach to Global Cohesion and Coherence. In: The Proceedings of Coling/ACL (1998)
Grosz, B., Sidner, C.: Attention, Intentions, and the Structure of Discourse. Computational Linguistics 12(3) (1986)
Jordan, M.P.: An Integrated Three-Pronged Analysis of a Fund-Raising Letter. In: Mann, W.C., Thompson, S.A. (eds.) Discourse Description: Diverse Linguistic Analyses of a Fund-Raising Text, pp. 171–226 (1992)
Kehler, A.: Coherence, Reference and the Theory of Grammar. CSLI Publications (2002)
Mann, W.C. and Thompson, S.A, Rhetorical Structure Theory: A Theory of Text Organization. Technical Report ISI/RS-87-190 (1987)
Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. PhD Thesis, Department of Computer Science, University of Toronto (1997)
Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, Cambridge (2000)
O’Donnell, M.: Variable-Length On-Line Document Generation. In: The Proceedings of the 6th European Workshop on Natural Language Generation. Gerhard-Mercator University, Duisburg (1997)
Pardo, T.A.S.: Métodos para Análise Discursiva Automática. PhD Thesis. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, June 2005, 211p. (2005)
Pardo, T.A.S., Nunes, M.G.V.: A Construção de um Corpus de Textos Científicos em Português do Brasil e sua Marcação Retórica. Technical Report N. 212. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, September 2003, 26p. (2003)
Pardo, T.A.S., Nunes, M.G.V.: Relações Retóricas e seus Marcadores Superficiais: Análise de um Corpus de Textos Científicos em Português do Brasil. Technical Report N. 231. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, April 2004, 73p. (2004)
Pardo, T.A.S., Nunes, M.G.V., Rino, L.H.M.: DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese. In: Bazzan, A.L.C., Labidi, S. (eds.) SBIA 2004. LNCS (LNAI), vol. 3171, pp. 224–234. Springer, Heidelberg (2004)
Pardo, T.A.S., Seno, E.M.R.: Rhetalho: um corpus de referência anotado retoricamente. In: Anais do V Encontro de Corpora. São Carlos-SP, November 24-25 (2005)
Pereira, F.C.N., Warren, D.H.D.: Definite Clause Grammars for Language Analysis – A Survey of the Formalism and Comparison with Augmented Transition Networks. In: Artificial Intelligence, vol. 13, pp. 231–278 (1980)
Schauer, H.: Referential Structure and Coherence Structure. In: The Proceedings of TALN. Lausanne, Switzerland (2000)
Soricut, R., Marcu, D.: Sentence Level Discourse Parsing using Syntactic and Lexical Information. In: The Proceedings of HLT/NAACL (2003)
Sumita, K., Ono, K., Chino, T., Ukita, T., Amano, S.: A discourse structure analyzer for Japonese text. In: The Proceedings of the International Conference on Fifth Generation Computer Systems, Tokyo, Japan, vol. 2, pp. 1133–1140 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pardo, T.A.S., Nunes, M.d.G.V. (2006). Review and Evaluation of DiZer – An Automatic Discourse Analyzer for Brazilian Portuguese. In: Vieira, R., Quaresma, P., Nunes, M.d.G.V., Mamede, N.J., Oliveira, C., Dias, M.C. (eds) Computational Processing of the Portuguese Language. PROPOR 2006. Lecture Notes in Computer Science(), vol 3960. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11751984_19
Download citation
DOI: https://doi.org/10.1007/11751984_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34045-4
Online ISBN: 978-3-540-34046-1
eBook Packages: Computer ScienceComputer Science (R0)