Skip to main content

A Derivational Model of Discontinuous Parsing

  • Conference paper
  • First Online:
Language and Automata Theory and Applications (LATA 2017)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10168))

  • 877 Accesses

Abstract

The notion of latent-variable probabilistic context-free derivation of syntactic structures is enhanced to allow heads and unrestricted discontinuities. The chosen formalization covers both constituent parsing and dependency parsing. The derivational model is accompanied by an equivalent probabilistic automaton model. By the new framework, one obtains a probability distribution over the space of all discontinuous parses. This lends itself to intrinsic evaluation in terms of perplexity, as shown in experiments.

The second author has been funded by the Academy of Finland (Academy Research Fellowship; decision no 270354).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alshawi, H.: Head automata and bilingual tiling: translation with minimal representations. In: Proceedings of the Conference on 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, California, USA, pp. 167–176, June 1996

    Google Scholar 

  2. Boyd, A.: Discontinuity revisited: an improved conversion to context-free representations. In: Proceedings of the Linguistic Annotation Workshop, at ACL 2007, Prague, Czech Republic, pp. 41–44, June 2007

    Google Scholar 

  3. Collins, M.: Head-driven statistical models for natural language parsing. Comput. Linguis. 29(4), 589–637 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  4. Daniels, M., Meurers, W.: Improving the efficiency of parsing with discontinuous constituents. In: Proceedings of NLULP 2002: The 7th International Workshop on Natural Language Understanding and Logic Programming, Datalogiske Skrifter, vol. 92, pp. 49–68, Roskilde Universitetscenter, Copenhagen (2002)

    Google Scholar 

  5. Evang, K., Kallmeyer, L.: PLCFRS parsing of English discontinuous constituents. In: Proceedings of the 12th International Conference on Parsing Technologies, Dublin, Ireland, pp. 104–116, October 2011

    Google Scholar 

  6. Gaifman, H.: Dependency systems and phrase-structure systems. Inf. Control 8, 304–337 (1965)

    Article  MathSciNet  MATH  Google Scholar 

  7. Hays, D.: Dependency theory: a formalism and some observations. Language 40(4), 511–525 (1964)

    Article  Google Scholar 

  8. Jelinek, F., Lafferty, J., Mercer, R.: Basic methods of probabilistic context free grammars. In: Laface, P., De Mori, R. (eds.) Speech Recognition and Understanding – Recent Advances, Trends and Applications, pp. 345–360. Springer, Heidelberg (1992)

    Chapter  Google Scholar 

  9. Kahane, S., Nasr, A., Rambow, O.: Pseudo-projectivity, a polynomially parsable non-projective dependency grammar. In: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, vol. 1, Montreal, Quebec, Canada, pp. 646–652, August 1998

    Google Scholar 

  10. Kallmeyer, K., Kuhlmann, M.: A formal model for plausible dependencies in lexicalized tree adjoining grammar. In: Eleventh International Workshop on Tree Adjoining Grammar and Related Formalisms, pp. 108–116 (2012)

    Google Scholar 

  11. Kathol, A., Pollard, C.: Extraposition via complex domain formation. In: Proceedings of the Conference on 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, Massachusetts, USA , pp. 174–180, June 1995

    Google Scholar 

  12. Kuhlmann, M.: Mildly non-projective dependency grammar. Comput. Linguis. 39(2), 355–387 (2013)

    Article  MathSciNet  Google Scholar 

  13. Maier, W.: Discontinuous incremental shift-reduce parsing. In: 53rd Annual Meeting of the Association for Computational Linguistics and 7th International Joint Conference on Natural Language Processing, vol. 1, Beijing, pp. 1202–1212, July 2015

    Google Scholar 

  14. McDonald, R., Pereira, F.: Online learning of approximate dependency parsing algorithms. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, pp. 81–88 (2006)

    Google Scholar 

  15. Nederhof, M.J., McCaffery, M.: Deterministic parsing using PCFGs. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden, pp. 338–347 (2014)

    Google Scholar 

  16. Nederhof, M.J., Satta, G.: An alternative method of training probabilistic LR parsers. In: Proceedings of the Conference on 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, pp. 551–558, July 2004

    Google Scholar 

  17. Nederhof, M.J., Vogler, H.: Hybrid grammars for discontinuous parsing. In: The 25th International Conference on Computational Linguistics: Technical papers, Dublin, Ireland, pp. 1370–1381, August 2014

    Google Scholar 

  18. Nivre, J.: Non-projective dependency parsing in expected linear time. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore, pp. 351–359, August 2009

    Google Scholar 

  19. Nivre, J.: Towards a universal grammar for natural language processing. In: Gelbukh, A. (ed.) CICLing 2015. LNCS, vol. 9041, pp. 3–16. Springer, Heidelberg (2015). doi:10.1007/978-3-319-18111-0_1

    Google Scholar 

  20. Nivre, J., Nilsson, J.: Pseudo-projective dependency parsing. In: Proceedings of the Conference on 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, Michigan, pp. 99–106, June 2005

    Google Scholar 

  21. Petrov, S., Barrett, L., Thibaux, R., Klein, D.: Learning accurate, compact, and interpretable tree annotation. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 433–440, July 2006

    Google Scholar 

  22. Reape, M.: A logical treatment of semi-free word order and bounded discontinuous constituency. In: Proceedings of the Conference on Fourth Conference of the European Chapter of the Association for Computational Linguistics, Manchester, England, pp. 103–110, April 1989

    Google Scholar 

  23. Søgaard, A., Haulrich, M.: On the derivation perplexity of treebanks. In: Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories, Tartu, Estonia, pp. 223–232, December 2010

    Google Scholar 

  24. Sornlertlamvanich, V., Inui, K., Tanaka, H., Tokunaga, T., Takezawa, T.: Empirical support for new probabilistic generalized LR parsing. J. Nat. Lang. Process. 6(3), 3–22 (1999)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mark-Jan Nederhof .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Nederhof, MJ., Yli-Jyrä, A. (2017). A Derivational Model of Discontinuous Parsing. In: Drewes, F., Martín-Vide, C., Truthe, B. (eds) Language and Automata Theory and Applications. LATA 2017. Lecture Notes in Computer Science(), vol 10168. Springer, Cham. https://doi.org/10.1007/978-3-319-53733-7_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-53733-7_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-53732-0

  • Online ISBN: 978-3-319-53733-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics