Abstract
Text summarization with the consideration of coherence can be achieved by using discourse processing with the Rhetorical Structure Theory (RST). Additional problems on relational ambiguity may arise, especially in Thai. For example, the use of cue words, i.e. “tae/ ” (meaning “but”), can be identified as a contrast relation or an elaboration relation. Therefore, we propose the reduction of the ambiguity level by reducing the relation types to two, namely Coordinating and Subordinating relation. Our framework is to concentrate on coherence structuring which requires the following 3 steps: (1) identify an attachment point for an incoming discourse unit by using our Adaptive Right-frontier algorithm; (2) extract Coordinating and Subordinating relations through the identification of linguistic coherence features in the lexical and phrasal level, using Bayesian techniques; (3) construct coherence tree structures, The accuracy is 70.45% for the first step, 77.47% and 79.89% for COR and SUBR extraction respectively in the second step and 64.94% in constructing coherent tree of the third.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Edmundson, H.P.: New Method in Automatic Extracting. ACM 16(2), 264–285 (1969)
Hovy, E., Lin, C.: Automated text summarization in summarist. In: Proceedings of the Workshop on Intelligent Scalable Text Summarization, pp. 18–24 (1977)
Marcu, D.: The rhetorical parsing of natural language texts. In: Meeting of the Association for Computational Linguistics, pp. 96–103 (1997)
Cristea, D., Postolache, O., Pistol, I.: Summarisation through discourse structure [15], pp. 632–644
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1998)
Moore, J.D., Pollack, M.E.: A problem for RST: The need for multi-level discourse analysis. Computational Linguistics 18(4), 537–544 (1992)
Hovy, E., Maier, E.: Parsimonious or profligate: How many and which discourse structure relations. In: Discourse Processes, pp. 18–24 (1977)
Asher, N., Lascarides, A.: Logics of Conversation. Studies in Natural Language Processing. Cambridge University Press, Cambridge (2005)
Polanyi, L.: A formal model of the structure of discourse. Journal of Pragmatics 12, 601–638 (1988)
Sassen, C., Kühnlein, P.: The right frontier constraint as conditional [15], pp. 222–225
Grosz, B.J., Joshi, A.K., Weinstein, S.: Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21(2), 203–225 (1995)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
Kongwa, A., Kawtrakul, A.: Know-what: A development of object-property extraction from thai texts and query system. In: Proceeding of the Sixth Symposium on Natural Language Processing (2005)
Wattanamethanont, M., T.S., Kawtrakul, A.: Thai discourse relations recognition by using naive bayes classifier. In: The Proceedings of the Sixth Symposium on Natural Language Processing (2005)
Gelbukh, A. (ed.): CICLing 2005. LNCS, vol. 3406. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sukvaree, T., Kawtrakul, A., Caelen, J. (2007). Thai Text Coherence Structuring with Coordinating and Subordinating Relations for Text Summarization. In: Kokinov, B., Richardson, D.C., Roth-Berghofer, T.R., Vieu, L. (eds) Modeling and Using Context. CONTEXT 2007. Lecture Notes in Computer Science(), vol 4635. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74255-5_34
Download citation
DOI: https://doi.org/10.1007/978-3-540-74255-5_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74254-8
Online ISBN: 978-3-540-74255-5
eBook Packages: Computer ScienceComputer Science (R0)