Abstract
Dependency trees represent sentences as labeled directed graphs encoding syntactic relations between words. The labels on the arcs represent grammatical relations such as “subject”, “object”, various types of modifiers etc. Dependency trees capture grammatical structures that are easy to interpret and can be useful in several language processing tasks such as information extraction (Culotta and Sorensen, 2004), knowledge acquisition (Ciaramita et al., 2005), machine translation (Ding and Palmer, 2005) and information retrieval (Surdeanu et al., 2008). Dependency treebanks are becoming available in many languages. Several approaches to dependency parsing on multiple languages have been evaluated in the CoNLL 2006 and 2007 shared tasks (Buchholz and Marsi, 2006; Nivre et al., 2007), and in conjunction with semantic role labeling as a joint learning problem in the CoNLL 2008 shared task (Surdeanu et al., 2008).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The figure also contains entity annotations which will be explained below in Section 6.4.1.
- 2.
Available from http://desr.sourceforge.net
- 3.
By contrast, the version of the Penn Treebank used for the CoNLL 2007 shared task includes also non-projective representations.
- 4.
BBN Pronoun Coreference and Entity Type Corpus, 2005. Linguistic Data Consortium (LDC) catalog number LDC2005T33.
- 5.
BBN Corpus documentation.
- 6.
The full label for “ORG” is “ORG:Corporation”, and “WOA” stands for “WorkOfArt:Other”.
- 7.
The script is available from http://w3.msi.vxu.se/%7enivre/research/Penn2Malt.html
- 8.
- 9.
Tree Tagger is available from http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/
- 10.
The 1st-order parser takes 7 s (user time) to process Section 23.
- 11.
Available from sourceforge.net
References
Attardi, G. (2006). Experiments with a multilanguage non-projective dependency parser. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp. 166–170.
Buchholz, S. and E. Marsi (2006). Introduction to CoNLL-X shared task on multilingual dependency parsing. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp. 149–164.
Carreras, X. and L. Màrquez (2005). Introduction to the CoNLL-2005 shared task: semantic role labeling. In Proceedings of the 9th Conference on Computational Natural Language Learning, Ann Arbor, MI, pp. 152–154.
Carreras, X., M. Surdeanu, and L. Màrquez (2006). Projective dependency parsing with perceptron. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp 181–185.
Chapelle, O. (2007). Training a support vector machine in the primal. Neural Computation 19, Cambridge, MA, MIT Press, 1155–1178.
Charniak, E. (1997). Statistical parsing with a context-free grammar and word statistics. In Proceedings of the 14th National Conference on Artificial Intelligence AAAI, Providence, RI, pp. 598–603.
Charniak, E. and M. Johnson (2005). Coarse-to-fine n-best parsing and maxent discriminative reranking. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbar, MI, pp. 173–180.
Ciaramita, M. and Y. Altun (2006). Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, pp. 594–602.
Ciaramita, M., A. Gangemi, E. Ratsch, J. Šarić, and I. Rojas (2005). Unsupervised learning of semantic relations between concepts of a molecular biology ontology. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, pp. 659–664.
Ciaramita, M. and M. Johnson (2003). Supersense tagging of unknown nouns in wordnet. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, Sapporo, pp. 168–175.
Collins, M. (1999). Head-driven statistical models for natural language parsing. Ph. D. thesis, University of Pennsylvania, Philadelphia, PA.
Collins, M. (2000). Discriminative reranking for natural language parsing. In Proceedings of the 17th International Conference on Machine Learning, Stanford, CA, pp. 175–182.
Collins, M. (2002). Discriminative training methods for Hidden Markov Models: theory and experiments with perceptron algorithms. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, pp. 1–8.
Collins, M. and T. Koo (2005). Hidden-variable models for discriminative reranking. In Proceedings of the 2005 Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 507–514.
Collins, M. and B. Roark (2004). Incremental parsing with the perceptron algorithm. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, pp. 111–118.
Crammer, K. and Y. Singer (2003). Ultraconservative online algorithms for multiclass problems. Journal of Machine Learning Research, Cambridge, MA, MIT Press, pp. 951–991.
Culotta, A. and J. Sorensen (2004). Dependency tree kernels for relation extraction. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona pp. 728–736..
Ding, Y. and M. Palmer (2005). Machine translation using probabilistic synchronous dependency insertion grammars. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Ann Arbor, MI, pp. 541–548.
Eisner, J. (2000). Bilexical grammars and their cubic-time parsing algorithms. In H. Bunt and A. Nijholt (Eds.), New Developments in Natural Language Parsing. Kluwer Academic Publishers, pp. 29–62.
Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.
Hall, J., J. Nivre, and J. Nilsson (2006). Discriminative classifiers for deterministic dependency parsing. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Sydney, pp. 316–323.
Kalt, T. (2004). Induction of greedy controllers for deterministic treebank parsers. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language, Barcelona, pp. 17–24.
Keerthi, S. and D. DeCoste (2005). A modified finite newton method for fast solution of large scale linear SVMs. Journal of Machine Learning Research 6, Cambridge, MA, MIT Press, pp. 341–361.
Marcus, M., B. Santorini, and M. Marcinkiewicz (1993). Building a large annotated corpus of english: The penn treebank. Computational Linguistics 19(2), 313–330.
McDonald, R. (2006). Discriminative training and spanning tree algorithms for dependency parsing. Ph. D. thesis, University of Pennsylvania, Philadelphia, PA.
McDonald, R. and F. Pereira (2006). Online learning of approximate dependency parsing algorithms. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Trente, Italy, pp. 81–88.
McDonald, R., F. Pereira, K. Ribarov, and J. Hajič (2005). Non-projective dependency parsing using spanning tree algorithms. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 523–530.
Minsky, M. and S. Papert (1969). Perceptrons: An Introduction to Computational Geometry. Cambridge, MA: MIT Press.
Moschitti, A. (2006). Efficient convolution kernels for dependency and constituent syntactic trees. In Proceedings of the 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases , Philadelphia, PA, pp. 318–329.
Nivre, J., J. Hall, S. Kübler, R. McDonald, J. Nilsson, D. S. Riedel, and D. Yuret (2007). The CoNLL 2007 shared task on dependency parsing. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, New York, NY, pp. 915–932.
Nivre, J. and M. Scholz (2004). Deterministic dependency parsing of english text. In Proceedings of COLING 2004, Geneva, pp. 64–70.
Punyakanok, V., D. Roth, and W. Yih (2005). The necessity of syntactic parsing for semantic role labeling. In Proceedings of the International Joint Conference on Artificial Intelligence, Edinburgh, pp. 1117–1123.
Rosenblatt, F. (1958). The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, pp. 386–408.
Sagae, K. and A. Lavie (2006). Parser combination by reparsing. In Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, New York, NY, pp. 129–132.
Sha, F. and F. Pereira (2003). Shallow parsing with conditional random fields. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Edmonton, pp. 134–141.
Shawe-Taylor, J. and N. Cristianini (2004). Kernel Methods for Pattern Analysis. Cambridge, UK: Cambridge University Press.
Surdeanu, M., M. Ciaramita, and H. Zaragoza (2008). Learning to rank answers on large online QA collections. In Proceedings of the 46th Annual Meeting on Association for Computational Linguistics: Human Language Technologies, Columbus, OH, pp. 719–727.
Surdeanu, M., R. Johansson, A. Meyers, L. Màrquez, and J. Nivre (2008). The CoNLL 2008 shared task on joint parsing of syntactic and semantic dependencies. In Proceedings of the 12th Conference on Computational Natural Language Learning, Manchester, pp. 159–177.
Titov, I. and J. Henderson (2007). A latent variable model for generative dependency parsing. In Proceedings of the 10th International Conference on Parsing Technologies, Prague, Czech Republic, pp. 144–155.
Wong, A. and D. Wu (1999). Learning a lightweight deterministic parser. In Proceedings of the 6th European Conference on Speech Communication and Technology, Budapest, Hungary, pp. 2047–2050.
Yamada, H. and Y. Matsumoto (2003). Statistical dependency analysis with support vector machines. In Proceedings of the 8th International Workshop on Parsing Technologies, Nancy, pp. 195–206.
Yi, S. and M. Palmer (2005). The integration of syntactic parsing and semantic role labeling. In Proceedings of the 9th Conference on Computational Natural Language Learning, Ann Arbor, MI, pp. 237–240.
Zhang, D. and W. Less (2003). Question classification using support vector machines. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, Toronto, pp. 26–32.
Acknowledgements
The first author would like to thank Thomas Hofmann for useful discussions concerning the issue of higher-order feature representations of Section 6.3.4. We would also like to thank Brian Roark and the editors for useful comments and references to related work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media B.V.
About this chapter
Cite this chapter
Ciaramita, M., Attardi, G. (2010). Dependency Parsing with Second-Order Feature Maps and Annotated Semantic Information. In: Bunt, H., Merlo, P., Nivre, J. (eds) Trends in Parsing Technology. Text, Speech and Language Technology, vol 43. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9352-3_6
Download citation
DOI: https://doi.org/10.1007/978-90-481-9352-3_6
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9351-6
Online ISBN: 978-90-481-9352-3
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)