Dependency Parsing with Second-Order Feature Maps and Annotated Semantic Information

Ciaramita, Massimiliano; Attardi, Giuseppe

doi:10.1007/978-90-481-9352-3_6

Massimiliano Ciaramita⁴ &
Giuseppe Attardi⁵

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 43))

572 Accesses

Abstract

Dependency trees represent sentences as labeled directed graphs encoding syntactic relations between words. The labels on the arcs represent grammatical relations such as “subject”, “object”, various types of modifiers etc. Dependency trees capture grammatical structures that are easy to interpret and can be useful in several language processing tasks such as information extraction (Culotta and Sorensen, 2004), knowledge acquisition (Ciaramita et al., 2005), machine translation (Ding and Palmer, 2005) and information retrieval (Surdeanu et al., 2008). Dependency treebanks are becoming available in many languages. Several approaches to dependency parsing on multiple languages have been evaluated in the CoNLL 2006 and 2007 shared tasks (Buchholz and Marsi, 2006; Nivre et al., 2007), and in conjunction with semantic role labeling as a joint learning problem in the CoNLL 2008 shared task (Surdeanu et al., 2008).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The figure also contains entity annotations which will be explained below in Section 6.4.1.
2.
Available from http://desr.sourceforge.net
3.
By contrast, the version of the Penn Treebank used for the CoNLL 2007 shared task includes also non-projective representations.
4.
BBN Pronoun Coreference and Entity Type Corpus, 2005. Linguistic Data Consortium (LDC) catalog number LDC2005T33.
5.
BBN Corpus documentation.
6.
The full label for “ORG” is “ORG:Corporation”, and “WOA” stands for “WorkOfArt:Other”.
7.
The script is available from http://w3.msi.vxu.se/%7enivre/research/Penn2Malt.html
8.
http://wordnet.princeton.edu
9.
Tree Tagger is available from http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/
10.
The 1st-order parser takes 7 s (user time) to process Section 23.
11.
Available from sourceforge.net

References

Attardi, G. (2006). Experiments with a multilanguage non-projective dependency parser. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp. 166–170.
Google Scholar
Buchholz, S. and E. Marsi (2006). Introduction to CoNLL-X shared task on multilingual dependency parsing. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp. 149–164.
Google Scholar
Carreras, X. and L. Màrquez (2005). Introduction to the CoNLL-2005 shared task: semantic role labeling. In Proceedings of the 9th Conference on Computational Natural Language Learning, Ann Arbor, MI, pp. 152–154.
Google Scholar
Carreras, X., M. Surdeanu, and L. Màrquez (2006). Projective dependency parsing with perceptron. In Proceedings of the 10th Conference on Computational Natural Language Learning, New York, NY, pp 181–185.
Google Scholar
Chapelle, O. (2007). Training a support vector machine in the primal. Neural Computation 19, Cambridge, MA, MIT Press, 1155–1178.
Article Google Scholar
Charniak, E. (1997). Statistical parsing with a context-free grammar and word statistics. In Proceedings of the 14th National Conference on Artificial Intelligence AAAI, Providence, RI, pp. 598–603.
Google Scholar
Charniak, E. and M. Johnson (2005). Coarse-to-fine n-best parsing and maxent discriminative reranking. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbar, MI, pp. 173–180.
Google Scholar
Ciaramita, M. and Y. Altun (2006). Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, pp. 594–602.
Google Scholar
Ciaramita, M., A. Gangemi, E. Ratsch, J. Šarić, and I. Rojas (2005). Unsupervised learning of semantic relations between concepts of a molecular biology ontology. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, pp. 659–664.
Google Scholar
Ciaramita, M. and M. Johnson (2003). Supersense tagging of unknown nouns in wordnet. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, Sapporo, pp. 168–175.
Google Scholar
Collins, M. (1999). Head-driven statistical models for natural language parsing. Ph. D. thesis, University of Pennsylvania, Philadelphia, PA.
Google Scholar
Collins, M. (2000). Discriminative reranking for natural language parsing. In Proceedings of the 17th International Conference on Machine Learning, Stanford, CA, pp. 175–182.
Google Scholar
Collins, M. (2002). Discriminative training methods for Hidden Markov Models: theory and experiments with perceptron algorithms. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, pp. 1–8.
Google Scholar
Collins, M. and T. Koo (2005). Hidden-variable models for discriminative reranking. In Proceedings of the 2005 Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 507–514.
Google Scholar
Collins, M. and B. Roark (2004). Incremental parsing with the perceptron algorithm. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, pp. 111–118.
Google Scholar
Crammer, K. and Y. Singer (2003). Ultraconservative online algorithms for multiclass problems. Journal of Machine Learning Research, Cambridge, MA, MIT Press, pp. 951–991.
Google Scholar
Culotta, A. and J. Sorensen (2004). Dependency tree kernels for relation extraction. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona pp. 728–736..
Google Scholar
Ding, Y. and M. Palmer (2005). Machine translation using probabilistic synchronous dependency insertion grammars. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Ann Arbor, MI, pp. 541–548.
Google Scholar
Eisner, J. (2000). Bilexical grammars and their cubic-time parsing algorithms. In H. Bunt and A. Nijholt (Eds.), New Developments in Natural Language Parsing. Kluwer Academic Publishers, pp. 29–62.
Google Scholar
Fellbaum, C. (1998). WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.
Google Scholar
Hall, J., J. Nivre, and J. Nilsson (2006). Discriminative classifiers for deterministic dependency parsing. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Sydney, pp. 316–323.
Google Scholar
Kalt, T. (2004). Induction of greedy controllers for deterministic treebank parsers. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language, Barcelona, pp. 17–24.
Google Scholar
Keerthi, S. and D. DeCoste (2005). A modified finite newton method for fast solution of large scale linear SVMs. Journal of Machine Learning Research 6, Cambridge, MA, MIT Press, pp. 341–361.
Google Scholar
Marcus, M., B. Santorini, and M. Marcinkiewicz (1993). Building a large annotated corpus of english: The penn treebank. Computational Linguistics 19(2), 313–330.
Google Scholar
McDonald, R. (2006). Discriminative training and spanning tree algorithms for dependency parsing. Ph. D. thesis, University of Pennsylvania, Philadelphia, PA.
Google Scholar
McDonald, R. and F. Pereira (2006). Online learning of approximate dependency parsing algorithms. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, Trente, Italy, pp. 81–88.
Google Scholar
McDonald, R., F. Pereira, K. Ribarov, and J. Hajič (2005). Non-projective dependency parsing using spanning tree algorithms. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, pp. 523–530.
Google Scholar
Minsky, M. and S. Papert (1969). Perceptrons: An Introduction to Computational Geometry. Cambridge, MA: MIT Press.
Google Scholar
Moschitti, A. (2006). Efficient convolution kernels for dependency and constituent syntactic trees. In Proceedings of the 17th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases , Philadelphia, PA, pp. 318–329.
Google Scholar
Nivre, J., J. Hall, S. Kübler, R. McDonald, J. Nilsson, D. S. Riedel, and D. Yuret (2007). The CoNLL 2007 shared task on dependency parsing. In Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, New York, NY, pp. 915–932.
Google Scholar
Nivre, J. and M. Scholz (2004). Deterministic dependency parsing of english text. In Proceedings of COLING 2004, Geneva, pp. 64–70.
Google Scholar
Punyakanok, V., D. Roth, and W. Yih (2005). The necessity of syntactic parsing for semantic role labeling. In Proceedings of the International Joint Conference on Artificial Intelligence, Edinburgh, pp. 1117–1123.
Google Scholar
Rosenblatt, F. (1958). The perceptron: a probabilistic model for information storage and organization in the brain. Psychological Review, pp. 386–408.
Google Scholar
Sagae, K. and A. Lavie (2006). Parser combination by reparsing. In Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, New York, NY, pp. 129–132.
Google Scholar
Sha, F. and F. Pereira (2003). Shallow parsing with conditional random fields. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Edmonton, pp. 134–141.
Google Scholar
Shawe-Taylor, J. and N. Cristianini (2004). Kernel Methods for Pattern Analysis. Cambridge, UK: Cambridge University Press.
Book Google Scholar
Surdeanu, M., M. Ciaramita, and H. Zaragoza (2008). Learning to rank answers on large online QA collections. In Proceedings of the 46th Annual Meeting on Association for Computational Linguistics: Human Language Technologies, Columbus, OH, pp. 719–727.
Google Scholar
Surdeanu, M., R. Johansson, A. Meyers, L. Màrquez, and J. Nivre (2008). The CoNLL 2008 shared task on joint parsing of syntactic and semantic dependencies. In Proceedings of the 12th Conference on Computational Natural Language Learning, Manchester, pp. 159–177.
Google Scholar
Titov, I. and J. Henderson (2007). A latent variable model for generative dependency parsing. In Proceedings of the 10th International Conference on Parsing Technologies, Prague, Czech Republic, pp. 144–155.
Google Scholar
Wong, A. and D. Wu (1999). Learning a lightweight deterministic parser. In Proceedings of the 6th European Conference on Speech Communication and Technology, Budapest, Hungary, pp. 2047–2050.
Google Scholar
Yamada, H. and Y. Matsumoto (2003). Statistical dependency analysis with support vector machines. In Proceedings of the 8th International Workshop on Parsing Technologies, Nancy, pp. 195–206.
Google Scholar
Yi, S. and M. Palmer (2005). The integration of syntactic parsing and semantic role labeling. In Proceedings of the 9th Conference on Computational Natural Language Learning, Ann Arbor, MI, pp. 237–240.
Google Scholar
Zhang, D. and W. Less (2003). Question classification using support vector machines. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, Toronto, pp. 26–32.
Google Scholar

Download references

Acknowledgements

The first author would like to thank Thomas Hofmann for useful discussions concerning the issue of higher-order feature representations of Section 6.3.4. We would also like to thank Brian Roark and the editors for useful comments and references to related work.

Author information

Authors and Affiliations

Yahoo! Research, S-08018, Barcelona, Catalonia, Spain
Massimiliano Ciaramita
Università di Pisa, I-56127, Pisa, Italy
Giuseppe Attardi

Authors

Massimiliano Ciaramita
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Attardi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Massimiliano Ciaramita .

Editor information

Editors and Affiliations

Tilburg University, Warandelaan 2, Tilburg, 5000 LE, Netherlands
Harry Bunt
Dépt. Linguistique, Université de Genève, rue de Candolle 2, Genève, 1211, Switzerland
Paola Merlo
Pimpstensvägen 16, Uppsala, 752 67, Sweden
Joakim Nivre

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ciaramita, M., Attardi, G. (2010). Dependency Parsing with Second-Order Feature Maps and Annotated Semantic Information. In: Bunt, H., Merlo, P., Nivre, J. (eds) Trends in Parsing Technology. Text, Speech and Language Technology, vol 43. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9352-3_6

Download citation

DOI: https://doi.org/10.1007/978-90-481-9352-3_6
Published: 29 September 2010
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-9351-6
Online ISBN: 978-90-481-9352-3
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)

Publish with us

Policies and ethics