A Derivational Model of Discontinuous Parsing

Nederhof, Mark-Jan; Yli-Jyrä, Anssi

doi:10.1007/978-3-319-53733-7_22

Mark-Jan Nederhof¹⁶ &
Anssi Yli-Jyrä¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10168))

Included in the following conference series:

International Conference on Language and Automata Theory and Applications

877 Accesses

Abstract

The notion of latent-variable probabilistic context-free derivation of syntactic structures is enhanced to allow heads and unrestricted discontinuities. The chosen formalization covers both constituent parsing and dependency parsing. The derivational model is accompanied by an equivalent probabilistic automaton model. By the new framework, one obtains a probability distribution over the space of all discontinuous parses. This lends itself to intrinsic evaluation in terms of perplexity, as shown in experiments.

The second author has been funded by the Academy of Finland (Academy Research Fellowship; decision no 270354).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alshawi, H.: Head automata and bilingual tiling: translation with minimal representations. In: Proceedings of the Conference on 34th Annual Meeting of the Association for Computational Linguistics, Santa Cruz, California, USA, pp. 167–176, June 1996
Google Scholar
Boyd, A.: Discontinuity revisited: an improved conversion to context-free representations. In: Proceedings of the Linguistic Annotation Workshop, at ACL 2007, Prague, Czech Republic, pp. 41–44, June 2007
Google Scholar
Collins, M.: Head-driven statistical models for natural language parsing. Comput. Linguis. 29(4), 589–637 (2003)
Article MathSciNet MATH Google Scholar
Daniels, M., Meurers, W.: Improving the efficiency of parsing with discontinuous constituents. In: Proceedings of NLULP 2002: The 7th International Workshop on Natural Language Understanding and Logic Programming, Datalogiske Skrifter, vol. 92, pp. 49–68, Roskilde Universitetscenter, Copenhagen (2002)
Google Scholar
Evang, K., Kallmeyer, L.: PLCFRS parsing of English discontinuous constituents. In: Proceedings of the 12th International Conference on Parsing Technologies, Dublin, Ireland, pp. 104–116, October 2011
Google Scholar
Gaifman, H.: Dependency systems and phrase-structure systems. Inf. Control 8, 304–337 (1965)
Article MathSciNet MATH Google Scholar
Hays, D.: Dependency theory: a formalism and some observations. Language 40(4), 511–525 (1964)
Article Google Scholar
Jelinek, F., Lafferty, J., Mercer, R.: Basic methods of probabilistic context free grammars. In: Laface, P., De Mori, R. (eds.) Speech Recognition and Understanding – Recent Advances, Trends and Applications, pp. 345–360. Springer, Heidelberg (1992)
Chapter Google Scholar
Kahane, S., Nasr, A., Rambow, O.: Pseudo-projectivity, a polynomially parsable non-projective dependency grammar. In: 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, vol. 1, Montreal, Quebec, Canada, pp. 646–652, August 1998
Google Scholar
Kallmeyer, K., Kuhlmann, M.: A formal model for plausible dependencies in lexicalized tree adjoining grammar. In: Eleventh International Workshop on Tree Adjoining Grammar and Related Formalisms, pp. 108–116 (2012)
Google Scholar
Kathol, A., Pollard, C.: Extraposition via complex domain formation. In: Proceedings of the Conference on 33rd Annual Meeting of the Association for Computational Linguistics, Cambridge, Massachusetts, USA , pp. 174–180, June 1995
Google Scholar
Kuhlmann, M.: Mildly non-projective dependency grammar. Comput. Linguis. 39(2), 355–387 (2013)
Article MathSciNet Google Scholar
Maier, W.: Discontinuous incremental shift-reduce parsing. In: 53rd Annual Meeting of the Association for Computational Linguistics and 7th International Joint Conference on Natural Language Processing, vol. 1, Beijing, pp. 1202–1212, July 2015
Google Scholar
McDonald, R., Pereira, F.: Online learning of approximate dependency parsing algorithms. In: Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, pp. 81–88 (2006)
Google Scholar
Nederhof, M.J., McCaffery, M.: Deterministic parsing using PCFGs. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden, pp. 338–347 (2014)
Google Scholar
Nederhof, M.J., Satta, G.: An alternative method of training probabilistic LR parsers. In: Proceedings of the Conference on 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, pp. 551–558, July 2004
Google Scholar
Nederhof, M.J., Vogler, H.: Hybrid grammars for discontinuous parsing. In: The 25th International Conference on Computational Linguistics: Technical papers, Dublin, Ireland, pp. 1370–1381, August 2014
Google Scholar
Nivre, J.: Non-projective dependency parsing in expected linear time. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Suntec, Singapore, pp. 351–359, August 2009
Google Scholar
Nivre, J.: Towards a universal grammar for natural language processing. In: Gelbukh, A. (ed.) CICLing 2015. LNCS, vol. 9041, pp. 3–16. Springer, Heidelberg (2015). doi:10.1007/978-3-319-18111-0_1
Google Scholar
Nivre, J., Nilsson, J.: Pseudo-projective dependency parsing. In: Proceedings of the Conference on 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, Michigan, pp. 99–106, June 2005
Google Scholar
Petrov, S., Barrett, L., Thibaux, R., Klein, D.: Learning accurate, compact, and interpretable tree annotation. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 433–440, July 2006
Google Scholar
Reape, M.: A logical treatment of semi-free word order and bounded discontinuous constituency. In: Proceedings of the Conference on Fourth Conference of the European Chapter of the Association for Computational Linguistics, Manchester, England, pp. 103–110, April 1989
Google Scholar
Søgaard, A., Haulrich, M.: On the derivation perplexity of treebanks. In: Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories, Tartu, Estonia, pp. 223–232, December 2010
Google Scholar
Sornlertlamvanich, V., Inui, K., Tanaka, H., Tokunaga, T., Takezawa, T.: Empirical support for new probabilistic generalized LR parsing. J. Nat. Lang. Process. 6(3), 3–22 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of St Andrews, St Andrews, UK
Mark-Jan Nederhof
Department of Modern Languages, University of Helsinki, Helsinki, Finland
Anssi Yli-Jyrä

Authors

Mark-Jan Nederhof
View author publications
You can also search for this author in PubMed Google Scholar
Anssi Yli-Jyrä
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark-Jan Nederhof .

Editor information

Editors and Affiliations

Umeå University, Umeå, Sweden
Frank Drewes
Rovira i Virgili University, Tarragona, Spain
Carlos Martín-Vide
University of Giessen, Giessen, Germany
Bianca Truthe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nederhof, MJ., Yli-Jyrä, A. (2017). A Derivational Model of Discontinuous Parsing. In: Drewes, F., Martín-Vide, C., Truthe, B. (eds) Language and Automata Theory and Applications. LATA 2017. Lecture Notes in Computer Science(), vol 10168. Springer, Cham. https://doi.org/10.1007/978-3-319-53733-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-53733-7_22
Published: 16 February 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53732-0
Online ISBN: 978-3-319-53733-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics