Abstract
Nowadays, in the Natural Language Processing field, with the object of research gradually shifting from the word to sentence, paragraph and higher semantic units, discourse analysis is one crucial step toward a better understanding of how these articles are structured. Compared with micro-level, this has rarely been investigated in macro Chinese discourse analysis and faces tremendous challenges. First, it is harder to grasp the topic and recognize the relationship between macro discourse units due to their longer length and looser relation between them. Second, how to mine the relationship between nuclearity and relation recognition effectively is another challenge. To address these challenges, we propose a joint model of recognizing macro Chinese discourse nuclearity and relation based on Structure and Topic Gated Semantic Network (STGSN). It makes the semantic representation of a discourse unit can change with its position and the topic by Gated Linear Unit (GLU). Moreover, we analyze the results of our models in nuclearity and relation recognition and explore the potential relationship between them. Conducted experiments show the effectiveness of the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Carlson, L., Marcu, D., Okurowski, M.E.: Building a discourse-tagged corpus in the framework of rhetorical structure theory. In: van Kuppevelt, J., Smith, R.W. (eds.) Current and New Directions in Discourse and Dialogue. Text, Speech and Language Technology, vol. 22, pp. 85–112. Springer, Dordrecht (2003). https://doi.org/10.1007/978-94-010-0019-2_5
Chu, X., Jiang, F., Zhou, Y., Zhou, G., Zhu, Q.: Joint modeling of structure identification and nuclearity recognition in macro Chinese discourse treebank. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 536–546 (2018)
Chu, X., Zhu, Q., Zhou, G.: Discourse primary-secondary relationships in natural language processing. Jisuanji Xuebao/Chin. J. Comput. 40, 842–860 (2017). https://doi.org/10.11897/SP.J.1016.2017.00842
Dauphin, Y.N., Fan, A., Auli, M., Grangier, D.: Language modeling with gated convolutional networks. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 933–941. JMLR. org (2017)
Feng, V.W., Hirst, G.: A linear-time bottom-up discourse parser with constraints and post-editing. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 511–521 (2014)
Hernault, H., Prendinger, H., Ishizuka, M., et al.: HILDA: a discourse parser using support vector machine classification. Dialogue Discourse 1(3), 1–33 (2010)
Jia, Y., Ye, Y., Feng, Y., Lai, Y., Yan, R., Zhao, D.: Modeling discourse cohesion for discourse parsing via memory network. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Short Papers, vol. 2, pp. 438–443 (2018)
Jiang, F., Chu, X., Xu, S., Li, P., Zhu, Q.: A macro discourse primary and secondary relation recognition method based on topic similarity. J. Chin. Inf. Process. 32(1), 43–50 (2018)
Jiang, F., Xu, S., Chu, X., Li, P., Zhu, Q., Zhou, G.: MCDTB: a macro-level Chinese discourse treebank. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 3493–3504 (2018)
Joty, S., Carenini, G., Ng, R.: A novel discriminative framework for sentence-level discourse analysis. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 904–915 (2012)
Joty, S., Carenini, G., Ng, R., Mehdad, Y.: Combining intra-and multi-sentential rhetorical parsing for document-level discourse analysis. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 486–496 (2013)
Kong, F., Zhou, G.: A cdt-styled end-to-end Chinese discourse parser. ACM Trans. Asian Low Resour. Lang. Inf. Process. (TALLIP) 16(4), 26 (2017)
Li, Y., Kong, F., Zhou, G., et al.: Building Chinese discourse corpus with connective-driven dependency tree structure. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 2105–2114 (2014)
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: a theory of text organization. University of Southern California, Information Sciences Institute (1987)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Rocktäschel, T., Grefenstette, E., Hermann, K.M., Kočiskỳ, T., Blunsom, P.: Reasoning about entailment with neural attention. arXiv preprint: arXiv:1509.06664 (2015)
Sporleder, C., Lascarides, A.: Combining hierarchical clustering and machine learning to predict high-level discourse structure. In: Proceedings of the 20th International Conference on Computational Linguistics, p. 43. Association for Computational Linguistics (2004)
Sun, C., Kong, F.: A transition-based framework for Chinese discourse structure parsing. J. Chin. Inf. Process. 32(12), 48 (2018)
Van Dijk, T.A.: Narrative macro-structures. PTL J. Descr. Poet. Theory Lit. 1, 547–568 (1976)
Wang, Y., Li, S., Wang, H.: A two-stage parsing method for text-level discourse analysis. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Short Papers, vol. 2, pp. 184–188 (2017)
Zhou, Y., Chu, X., Zhu, Q., Jiang, F., Li, P.: Macro discourse relation classification based on macro semantics representation. J. Chin. Inf. Process. 33(3), 1–7 (2019)
Acknowledgments
The authors would like to thank three anonymous reviewers for their comments on this paper. This research was supported by the National Natural Science Foundation of China under Grant Nos. 61836007, 61773276 and 61472354.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Jiang, F., Li, P., Zhu, Q. (2019). Joint Modeling of Recognizing Macro Chinese Discourse Nuclearity and Relation Based on Structure and Topic Gated Semantic Network. In: Tang, J., Kan, MY., Zhao, D., Li, S., Zan, H. (eds) Natural Language Processing and Chinese Computing. NLPCC 2019. Lecture Notes in Computer Science(), vol 11839. Springer, Cham. https://doi.org/10.1007/978-3-030-32236-6_24
Download citation
DOI: https://doi.org/10.1007/978-3-030-32236-6_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32235-9
Online ISBN: 978-3-030-32236-6
eBook Packages: Computer ScienceComputer Science (R0)