Abstract
This paper presents a joint model designed to measure local text coherence that uses Rhetorical Structure Theory (RST) and entity grids. The purpose is to learn patterns of entity distribution in texts by considering entity transition sequences and organizational/discourse information using RST relations in order to create a predictive model that is able to distinguish coherent from incoherent texts. In an evaluation with newspaper texts, the proposed model outperformed other methods in the area.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Althaus, E., Karamanis, N., Koller, A.: Computing locally coherent discourse. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, article 399, Stroudsburg, PA, USA (2004)
Barzilay, R., Lapata, M.: Modeling local coherence: An entity-based approach. Computational Linguistics 34, 1–34 (2008)
Bick, E.: The Parsing System Palavras, Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Aarhus University Press (2000)
Bosma, W.: Query-Based Summarization using Rhetorical Structure Theory. In: Proceedings of the 15th Meetings of CLIN, LOT, Utrecht, pp. 29–44 (2004)
Burstein, J., Tetreault, J., Andreyev, S.: Using entity-based features to model coherence in student essays. In: Human Language Technologies: In Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 681–684 (2010)
Cardoso, P., Maziero, E., Jorge, M., Seno, E., di Felippo, A., Rino, L., Nunes, M., Pardo, T.: Cstnews - a discourse-annotated corpus for single and multi-document summarizationof news texts in brazilian portuguese. In: Proceedings of the 3rd RST Brazilian Meeting, pp. 88–105 (2011)
Cunha, I., Torres-Moreno, J.-M., Sierra, G.: On the Development of the RST Spanish Treebank. In: Proceedings of the 5th Linguistic Annotation Workshop, Portland-Oregon, pp. 1–10 (2011)
Dijk, T.V., Kintsch, W.: Strategics in discourse comprehension. Academic Press, New York (1983)
Filippova, K., Strube, M.: Extending the entity-grid coherence model to semantically related entities. In: Proceedings of the Eleventh European Workshop on Natural Language Generations, pp. 139–142 (2007)
Foltz, P.W., Kintsch, W., Landauer, T.K.: The Measurement of textual coherence using latent semantic analysis. Discourse Processes 25(2-3), 285–307 (1998)
Freitas, A.P., Feltrim, V.D.: Análise Automática de Coerência Usando o Modelo Grade de Entidades para o Português. In: Proceedings of the IX Brazilian Symposium in Information and Human Language Technology, Fortaleza, CE, Brazil, pp. 69–78 (2013)
Grosz, B., Aravind, K.J., Scott, W.: Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21, 203–225 (1995)
Iida, R., Tokunaga, T.: A metric for evaluating discourse coherence based on coreference resolution. In: Proceedings of the COLING 2012: Posters, Mumbai, India, pp. 483–494 (2012)
Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 133–142 (2002)
Karamanis, N., Poesio, M., Mellish, C., Oberlander, J.: Evaluating centering-based metrics of coherence for text structuring using a reliably annotated corpus. In: Proceedings of the 42nd Annual Meetings of the Association for Computational Linguistics, article 391 (2004)
Kibble, R., Power, R.: Optimising referential coherence in text generation. Computational Linguistic 30(4), 401–416 (2004)
Koch, I.V., Travaglia, L.C.: A Coerência Textual, 14th edn. Contexto, São Paulo (2002)
Lapata, M.: Probabilistic texts structuring: Experiments with sentence ordering. In: Proceeding of the 2nd Human Language Technology Conference and Annual Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 545–552 (2003)
Landauer, T.K., Dumais, S.T.: A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation to coreference resolution. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, pp. 104–111 (1997)
Lin, Z., Ng, H.T., Kan, M.Y.: Automatically evaluating text coherence using discourse relations. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Stroudsburg, PA, USA, vol. 1, pp. 997–1006 (2011)
Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1988)
Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: A Theory of Text Organization. Technical Report from Information Sciences Institute (ISI), ISI/RS-87-190, pp. 1-91. University of Southern California, USA (1987)
Marcu, D.: The Rhetorical Parsing of Unrestricted Texts: A Surface-based Approach. Computational Linguistics 26, 396–448 (2000)
Maziero, E., Pardo, T.A.S.: Automatização de um método de avaliação de estruturas retóricas. In Proceedings of the RST Brazilian Meeting (2009)
Mckoon, G., Ratcliff, R.: Inference during reading. Psychological Review, 440-446 (1992)
Radev, D.: A common theory of information fusion from multiple text sources, step one: Cross-document structure. In: Proceedings of the 1st ACL SIGDIAL Workshop on Discourse and Dialogue, Hong Kong, pp. 74–83 (2000)
Ribeiro, G.F., Rino, L.H.M.: A Sumarização Automática com Base em Estruturas RST. Technical Reports from Interinstitutional Center for Computational Linguistics, University of São Paulo, NILC-TR-02-05. São Carlos, Brazil (2002)
Salton, G.: Term-Weighting Approaches in Automatic Text Retrieval. Information Processing and Management, 513–523 (1988)
Seno, E.R.M.: Rhesumarst: Um sumarizador automático de estruturas rst. Master Thesis. University of São Carlos. São Carlos/SP (2005)
Webber, B.: D-ltag: Extending lexicalized tag to discourse. Cognitive Science 28(5), 751–779 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
de S. Dias, M., Feltrim, V.D., Pardo, T.A.S. (2014). Using Rhetorical Structure Theory and Entity Grids to Automatically Evaluate Local Coherence in Texts. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-09761-9_26
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09760-2
Online ISBN: 978-3-319-09761-9
eBook Packages: Computer ScienceComputer Science (R0)