Skip to main content

Dependency Parsing with Second-Order Feature Maps and Annotated Semantic Information

  • Chapter
  • First Online:
  • 571 Accesses

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 43))

Abstract

Dependency trees represent sentences as labeled directed graphs encoding syntactic relations between words. The labels on the arcs represent grammatical relations such as “subject”, “object”, various types of modifiers etc. Dependency trees capture grammatical structures that are easy to interpret and can be useful in several language processing tasks such as information extraction (Culotta and Sorensen, 2004), knowledge acquisition (Ciaramita et al., 2005), machine translation (Ding and Palmer, 2005) and information retrieval (Surdeanu et al., 2008). Dependency treebanks are becoming available in many languages. Several approaches to dependency parsing on multiple languages have been evaluated in the CoNLL 2006 and 2007 shared tasks (Buchholz and Marsi, 2006; Nivre et al., 2007), and in conjunction with semantic role labeling as a joint learning problem in the CoNLL 2008 shared task (Surdeanu et al., 2008).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    The figure also contains entity annotations which will be explained below in Section 6.4.1.

  2. 2.

    Available from http://desr.sourceforge.net

  3. 3.

    By contrast, the version of the Penn Treebank used for the CoNLL 2007 shared task includes also non-projective representations.

  4. 4.

    BBN Pronoun Coreference and Entity Type Corpus, 2005. Linguistic Data Consortium (LDC) catalog number LDC2005T33.

  5. 5.

    BBN Corpus documentation.

  6. 6.

    The full label for “ORG” is “ORG:Corporation”, and “WOA” stands for “WorkOfArt:Other”.

  7. 7.

    The script is available from http://w3.msi.vxu.se/%7enivre/research/Penn2Malt.html

  8. 8.

    http://wordnet.princeton.edu

  9. 9.

    Tree Tagger is available from http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/

  10. 10.

    The 1st-order parser takes 7 s (user time) to process Section 23.

  11. 11.

    Available from sourceforge.net

References

  • Attardi, G. (2006). Experiments with a multilanguage non-projective dependency parser. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp. 166–170.

    Google Scholar 

  • Buchholz, S. and E. Marsi (2006). Introduction to CoNLL-X shared task on multilingual dependency parsing. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp. 149–164.

    Google Scholar 

  • Carreras, X. and L. Màrquez (2005). Introduction to the CoNLL-2005 shared task: semantic role labeling. In Proceedings of the 9th Conference on Computational Natural Language Learning, Ann Arbor, MI, pp. 152–154.

    Google Scholar 

  • Carreras, X., M. Surdeanu, and L. Màrquez (2006). Projective dependency parsing with perceptron. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp 181–185.

    Google Scholar 

  • Chapelle, O. (2007). Training a support vector machine in the primal. Neural Computation 19, Cambridge, MA, MIT Press, 1155–1178.

    Article  Google Scholar 

  • Charniak, E. (1997). Statistical parsing with a context-free grammar and word statistics. In Proceedings of the 14th National Conference on Artificial Intelligence AAAI, Providence, RI, pp. 598–603.

    Google Scholar 

  • Charniak, E. and M. Johnson (2005). Coarse-to-fine n-best parsing and maxent discriminative reranking. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbar, MI, pp. 173–180.

    Google Scholar 

  • Ciaramita, M. and Y. Altun (2006). Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, pp. 594–602.

    Google Scholar 

  • Ciaramita, M., A. Gangemi, E. Ratsch, J. Šarić, and I. Rojas (2005). Unsupervised learning of semantic relations between concepts of a molecular biology ontology. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, pp. 659–664.

    Google Scholar 

  • Ciaramita, M. and M. Johnson (2003). Supersense tagging of unknown nouns in wordnet. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, Sapporo, pp. 168–175.

    Google Scholar 

  • Collins, M. (1999). Head-driven statistical models for natural language parsing. Ph. D. thesis, University of Pennsylvania, Philadelphia, PA.

    Google Scholar 

  • Collins, M. (2000). Discriminative reranking for natural language parsing. In Proceedings of the 17th International Conference on Machine Learning, Stanford, CA, pp. 175–182.

    Google Scholar 

  • Collins, M. (2002). Discriminative training methods for Hidden Markov Models: theory and experiments with perceptron algorithms. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, pp. 1–8.

    Google Scholar 

  • Collins, M. and T. Koo (2005). Hidden-variable models for discriminative reranking. In Proceedings of the 2005 Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 507–514.

    Google Scholar 

  • Collins, M. and B. Roark (2004). Incremental parsing with the perceptron algorithm. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, pp. 111–118.

    Google Scholar 

  • Crammer, K. and Y. Singer (2003). Ultraconservative online algorithms for multiclass problems. Journal of Machine Learning Research, Cambridge, MA, MIT Press, pp. 951–991.

    Google Scholar 

  • Culotta, A. and J. Sorensen (2004). Dependency tree kernels for relation extraction. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona pp. 728–736..

    Google Scholar 

  • Ding, Y. and M. Palmer (2005). Machine translation using probabilistic synchronous dependency insertion grammars. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Ann Arbor, MI, pp. 541–548.

    Google Scholar 

  • Eisner, J. (2000). Bilexical grammars and their cubic-time parsing algorithms. In H. Bunt and A. Nijholt (Eds.), New Developments in Natural Language Parsing. Kluwer Academic Publishers, pp. 29–62.

    Google Scholar 

  • Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.

    Google Scholar 

  • Hall, J., J. Nivre, and J. Nilsson (2006). Discriminative classifiers for deterministic dependency parsing. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Sydney, pp. 316–323.

    Google Scholar 

  • Kalt, T. (2004). Induction of greedy controllers for deterministic treebank parsers. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language, Barcelona, pp. 17–24.

    Google Scholar 

  • Keerthi, S. and D. DeCoste (2005). A modified finite newton method for fast solution of large scale linear SVMs. Journal of Machine Learning Research 6, Cambridge, MA, MIT Press, pp. 341–361.

    Google Scholar 

  • Marcus, M., B. Santorini, and M. Marcinkiewicz (1993). Building a large annotated corpus of english: The penn treebank. Computational Linguistics 19(2), 313–330.

    Google Scholar 

  • McDonald, R. (2006). Discriminative training and spanning tree algorithms for dependency parsing. Ph. D. thesis, University of Pennsylvania, Philadelphia, PA.

    Google Scholar 

  • McDonald, R. and F. Pereira (2006). Online learning of approximate dependency parsing algorithms. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Trente, Italy, pp. 81–88.

    Google Scholar 

  • McDonald, R., F. Pereira, K. Ribarov, and J. Hajič (2005). Non-projective dependency parsing using spanning tree algorithms. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 523–530.

    Google Scholar 

  • Minsky, M. and S. Papert (1969). Perceptrons: An Introduction to Computational Geometry. Cambridge, MA: MIT Press.

    Google Scholar 

  • Moschitti, A. (2006). Efficient convolution kernels for dependency and constituent syntactic trees. In Proceedings of the 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases , Philadelphia, PA, pp. 318–329.

    Google Scholar 

  • Nivre, J., J. Hall, S. Kübler, R. McDonald, J. Nilsson, D. S. Riedel, and D. Yuret (2007). The CoNLL 2007 shared task on dependency parsing. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, New York, NY, pp. 915–932.

    Google Scholar 

  • Nivre, J. and M. Scholz (2004). Deterministic dependency parsing of english text. In Proceedings of COLING 2004, Geneva, pp. 64–70.

    Google Scholar 

  • Punyakanok, V., D. Roth, and W. Yih (2005). The necessity of syntactic parsing for semantic role labeling. In Proceedings of the International Joint Conference on Artificial Intelligence, Edinburgh, pp. 1117–1123.

    Google Scholar 

  • Rosenblatt, F. (1958). The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, pp. 386–408.

    Google Scholar 

  • Sagae, K. and A. Lavie (2006). Parser combination by reparsing. In Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, New York, NY, pp. 129–132.

    Google Scholar 

  • Sha, F. and F. Pereira (2003). Shallow parsing with conditional random fields. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Edmonton, pp. 134–141.

    Google Scholar 

  • Shawe-Taylor, J. and N. Cristianini (2004). Kernel Methods for Pattern Analysis. Cambridge, UK: Cambridge University Press.

    Book  Google Scholar 

  • Surdeanu, M., M. Ciaramita, and H. Zaragoza (2008). Learning to rank answers on large online QA collections. In Proceedings of the 46th Annual Meeting on Association for Computational Linguistics: Human Language Technologies, Columbus, OH, pp. 719–727.

    Google Scholar 

  • Surdeanu, M., R. Johansson, A. Meyers, L. Màrquez, and J. Nivre (2008). The CoNLL 2008 shared task on joint parsing of syntactic and semantic dependencies. In Proceedings of the 12th Conference on Computational Natural Language Learning, Manchester, pp. 159–177.

    Google Scholar 

  • Titov, I. and J. Henderson (2007). A latent variable model for generative dependency parsing. In Proceedings of the 10th International Conference on Parsing Technologies, Prague, Czech Republic, pp. 144–155.

    Google Scholar 

  • Wong, A. and D. Wu (1999). Learning a lightweight deterministic parser. In Proceedings of the 6th European Conference on Speech Communication and Technology, Budapest, Hungary, pp. 2047–2050.

    Google Scholar 

  • Yamada, H. and Y. Matsumoto (2003). Statistical dependency analysis with support vector machines. In Proceedings of the 8th International Workshop on Parsing Technologies, Nancy, pp. 195–206.

    Google Scholar 

  • Yi, S. and M. Palmer (2005). The integration of syntactic parsing and semantic role labeling. In Proceedings of the 9th Conference on Computational Natural Language Learning, Ann Arbor, MI, pp. 237–240.

    Google Scholar 

  • Zhang, D. and W. Less (2003). Question classification using support vector machines. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, Toronto, pp. 26–32.

    Google Scholar 

Download references

Acknowledgements

The first author would like to thank Thomas Hofmann for useful discussions concerning the issue of higher-order feature representations of Section 6.3.4. We would also like to thank Brian Roark and the editors for useful comments and references to related work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Massimiliano Ciaramita .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media B.V.

About this chapter

Cite this chapter

Ciaramita, M., Attardi, G. (2010). Dependency Parsing with Second-Order Feature Maps and Annotated Semantic Information. In: Bunt, H., Merlo, P., Nivre, J. (eds) Trends in Parsing Technology. Text, Speech and Language Technology, vol 43. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9352-3_6

Download citation

Publish with us

Policies and ethics