Abstract
Chinese zero pronoun (ZP) resolution plays an important role in natural language understanding. This paper focuses on improving Chinese ZP resolution from discourse perspective. In particular, various kinds of discourse information are employed in both stages of ZP resolution. During the ZP detection stage, we first propose an elementary discourse unit (EDU) based method to generate ZP candidates from discourse perspective and then exploit relevant discourse context to help better identify ZPs. During the ZP resolution stage, we employ a tree-style discourse rhetorical structure to improve the resolution. Evaluation on OntoNotes shows the significant importance of discourse information to the performance of ZP resolution. To the best of our knowledge, this is the first work to improve Chinese ZP resolution from discourse perspective.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Among them, 325 articles overlap with the “NW” section of the OntoNotes corpus. The following oracle experiments are conducted in this part.
- 2.
If the CA and the ZP are in the same sentence, the value is 0; if they are one sentence apart, the value is 1; and so on.
- 3.
References
Carlson, L., Marcu, D., Okurowski, M.E.: Building a discourse-tagged corpus in the framework of rhetorical structure theory (2001)
Chen, C., Ng, V.: Chinese zero pronoun resolution: a joint unsupervised discourse-aware model rivaling state-of-the-art resolvers. In: Proceedings of ACL 2015, pp. 320–326 (2015)
Chen, C., Ng, V.: Chinese zero pronoun resolution: a unsupervised approach combining ranking and integer linear programming. In: Proceedings of AAAI 2014, pp. 1622–1628 (2014)
Chen, C., Ng, V.: Chinese zero pronoun resolution: some recent advances. In: Proceedings of EMNLP 2013, pp. 1360–1365 (2013)
Chen, C., Ng, V.: Chinese zero pronoun resolution with deep neural networks. In: Proceedings of ACL 2016, pp. 778–788 (2016)
Converse, S.: Pronominal anaphora resolution in Chinese. Ph.D., University of Pennsylvania (2006)
Huang, H.H., Chen, H.H.: An annotation system for development of Chinese discourse corpus. In: Proceedings of COLING 2012, pp. 223–230 (2012)
Huang, H.H., Chen, H.H.: Contingency and comparison relation labeling and structure prediction in Chinese sentences. In: Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 261–269 (2012)
Ji, Y., Eisenstein, J.: Representation learning for text-level discourse parsing. In: Proceedings of ACL 2014, pp. 13–24 (2014)
Joty, S., Carenini, G., Ng, R., Mehdad, Y.: Combining intra- and multi-sentential rhetorical parsing for document-level discourse analysis. In: Proceedings of ACL 2013, pp. 486–496 (2013)
Kong, F., Ng, H.T.: Exploiting zero pronouns to improve Chinese coreference resolution. In: Proceedings of EMNLP 2013, pp. 278–288 (2013)
Kong, F., Ng, H.T., Zhou, G.: A constituent-based approach to argument labeling with joint inference in discourse parsing. In: Proceedings of EMNLP 2014, pp. 68–77 (2014)
Kong, F., Zhou, G.: A clause-level hybrid approach to Chinese empty element recovery. In: Proceedings of IJCAI 2013, pp. 2113–2119 (2013)
Kong, F., Zhou, G.: A tree kernel-based unified framework for Chinese zero anaphora resolution. In: Proceedings of EMNLP 2010, pp. 882–891 (2010)
Kong, F., Zhou, G.: A CDT-styled end-to-end Chinese discourse parser. ACM Trans. Asian Low-Resour. Lang. Inf. Process 16(4), 26:1–26:17 (2017). http://doi.acm.org/10.1145/3099557
Li, C.N., Thompson, S.A.: Third-person pronouns and zero-anaphora in Chinese discourse. Syntax Semant. 12, 311–335 (1979)
Li, W.: Topic chains in Chinese discourse. Discourse Process. 37, 25–45 (2004)
Li, Y., Feng, W., Sun, J., Kong, F., Zhou, G.: Building Chinese discourse corpus with connective-driven dependency tree structure. In: Proceedings of EMNLP 2014, pp. 2105–2114 (2014)
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The Penn Discourse TreeBank 2.0. In: Proceedings of LREC 2008, pp. 2961–2968 (2008)
Soon, W.M., Ng, H.T., Lim, D.C.Y.: A machine learning approach to coreference resolution of noun phrases. Comput. Linguist. 27(4), 521–544 (2001)
Xue, N., Xia, F., Chiou, F.D., Palmer, M.: The Penn Chinese TreeBank: phrase structure annotation of a large corpus. Nat. Lang. Eng. 11, 207–238 (2005)
Yang, Y., Xue, N.: Chinese comma disambiguation for discourse analysis. In: Proceedings of ACL 2012, pp. 786–794 (2012)
Zhao, S., Ng, H.T.: Identification and resolution of Chinese zero pronouns: a machine learning approach. In: Proceedings of EMNLP-CoNLL 2007, pp. 541–550 (2007)
Zhou, Y., Xue, N.: The Chinese Discourse TreeBank: a Chinese corpus annotated with discourse relations. Lang. Resour. Eval. 49(2), 397–431 (2015)
Acknowledgements
This work is supported by Project 61472264, 61673290 and 61502149 under the National Natural Science Foundation of China, Key Project 61333018 under the National Natural Science Foundation of China.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Cheng, S., Fang, K., Guodong, Z. (2018). Towards Better Chinese Zero Pronoun Resolution from Discourse Perspective. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-73618-1_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)