Skip to main content

A Joint Learning Approach to Explicit Discourse Parsing via Structured Perceptron

  • Conference paper
Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD 2014, CCL 2014)

Abstract

Discourse parsing is a challenging task and plays a critical role in discourse analysis. In this paper, we focus on building an end-to-end PDTB-style explicit discourse parser via structured perceptron by decomposing it into two components, i.e., a connective labeler, which identifies connectives from a text and determines their senses in classifying discourse relationship, and an argument labeler, which identifies corresponding arguments for a given connective. Particularly, to reduce error propagation and incorporate the interaction between the two components, a joint learning approach via structured perceptron is proposed. Evaluation on the PDTB corpus shows that our two-components explicit discourse parser can achieve comparable performance with the state-of-the-art one. It also shows that our joint learning approach can significantly outperform the pipeline ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barzilay, R., Lapata, M.: Modeling local coherence: An entity-based approach. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), pp. 141–148. Association for Computational Linguistics, Ann Arbor (2005), http://www.aclweb.org/anthology/P05-1018

    Google Scholar 

  2. Collins, M.: Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 1–8. Association for Computational Linguistics (2002)

    Google Scholar 

  3. Collins, M., Roark, B.: Incremental parsing with the perceptron algorithm. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 111. Association for Computational Linguistics (2004)

    Google Scholar 

  4. Dines, N., Lee, A., Miltsakaki, E., Prasad, R., Joshi, A., Webber, B.: Attribution and the (non-)alignment of syntactic and discourse arguments of connectives. In: Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky, CorpusAnno 2005, pp. 29–36. Association for Computational Linguistics, Stroudsburg (2005), http://dl.acm.org/citation.cfm?id=1608829.1608834

    Chapter  Google Scholar 

  5. Elwell, R., Baldridge, J.: Discourse connective argument identification with connective specific rankers. In: 2008 IEEE International Conference on Semantic Computing, pp. 198–205 (2008)

    Google Scholar 

  6. Ghosh, S.: End-to-End Discourse Parsing with Cascaded Structured Prediction. Ph.D. thesis, University of Trento (2012)

    Google Scholar 

  7. Ghosh, S., Johansson, R., Riccardi, G., Tonelli, S.: Shallow discourse parsing with conditional random fields. In: Proceedings of 5th International Joint Conference on Natural Language Processing, pp. 1071–1079. Asian Federation of Natural Language Processing, Chiang Mai (2011), http://www.aclweb.org/anthology/I11-1120

    Google Scholar 

  8. Huang, L., Fayong, S., Guo, Y.: Structured perceptron with inexact search. In: Proceedings of NAACL 2012 (2012)

    Google Scholar 

  9. Lin, Z., Liu, C., Ng, H.T., Kan, M.Y.: Combining coherence models and machine translation evaluation metrics for summarization evaluation. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), pp. 1006–1014. Association for Computational Linguistics, Jeju Island (2012), http://www.aclweb.org/anthology/P12-1106

    Google Scholar 

  10. Lin, Z., Ng, H.T., Kan, M.Y.: A pdtb-styled end-to-end discourse parser. Technical report, School of Computing. National University of Singapore (2010)

    Google Scholar 

  11. Lin, Z., Ng, H.T., Kan, M.Y.: Automatically evaluating text coherence using discourse relations. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 997–1006. Association for Computational Linguistics (2011)

    Google Scholar 

  12. Lin, Z., Ng, H.T., Kan, M.Y.: A pdtb-styled end-to-end discourse parser. Natural Language Engineering FirstView, 1–34 (August 2013), http://journals.cambridge.org/article_S1351324912000307

  13. Marcus, M., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The penn treebank. Computational Linguistics 19(2), 313–330 (1993)

    Google Scholar 

  14. Meyer, T., Webber, B.: Implicitation of discourse connectives in (machine) translation. In: Proceedings of the Workshop on Discourse in Machine Translation, pp. 19–26. Association for Computational Linguistics, Sofia (2013), http://www.aclweb.org/anthology/W13-3303

    Google Scholar 

  15. Miltsakaki, E., Prasad, R., Joshi, A., Webber, B.: The penn discourse treebank, pp. 2237–2240 (2004), http://www.lrec-conf.org/proceedings/lrec2004/pdf/618

  16. Ng, J.P., Kan, M.Y., Lin, Z., Feng, W., Chen, B., Su, J., Tan, C.L.: Exploiting discourse analysis for article-wide temporal classification. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 12–23. Association for Computational Linguistics, Seattle (2013), http://www.aclweb.org/anthology/D13-1002

    Google Scholar 

  17. The PDTB Research Group: the Penn Discourse Treebank 2.0 Annotation Manual (December 2007)

    Google Scholar 

  18. Pitler, E., Nenkova, A.: Using syntax to disambiguate explicit discourse connectives in text. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 13–16. Association for Computational Linguistics, Suntec (2009), http://www.aclweb.org/anthology/P/P09/P09-2004

    Chapter  Google Scholar 

  19. Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The penn discourse treebank 2.0, pp. 2961–2968 (2008), http://www.lrec-conf.org/proceedings/lrec2008/pdf/754

  20. Prasad, R., Joshi, A., Webber, B.: Exploiting scope for shallow discourse parsing. In: Calzolari, N. (Conference Chair), Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010). European Language Resources Association (ELRA), Valletta (2010)

    Google Scholar 

  21. Webber, B.: D-ltag: Extending lexicalized tag to discourse. Cognitive Science 28(5), 751–779 (2004)

    Article  Google Scholar 

  22. Wellner, B., Pustejovsky, J.: Automatically identifying the arguments of discourse connectives. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 92–101. Association for Computational Linguistics, Prague (2007), http://www.aclweb.org/anthology/D/D07/D07-1010

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Li, S., Kong, F., Zhou, G. (2014). A Joint Learning Approach to Explicit Discourse Parsing via Structured Perceptron. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12277-9_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12276-2

  • Online ISBN: 978-3-319-12277-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics