Performance Evaluation of Supertagging for Partial Parsing

Bangalore, Srinivas

doi:10.1007/978-94-015-9470-7_11

Srinivas Bangalore⁵

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 16))

108 Accesses
1 Citations

Abstract

In previous work we introduced the idea of supertagging as a means of improving the efficiency of a lexicalized grammar parser. In this paper, we present supertagging in conjunction with a lightweight dependency analyzer as a robust and efficient partial parser. The present work is significant for two reasons. First, we have vastly improved our results; 92% accurate for supertag disambiguation using lexical information, larger training corpus and smoothing techniques. Second, we show how supertagging can be used for partial parsing and provide detailed evaluation results for detecting noun chunks, verb chunks, preposition phrase attachment and a variety of other linguistic constructions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abney, S. (1991). Parsing by chunks. In Berwick, R., Abney, S., and Tenny, C., editors, Principle-based parsing. Dordrecht: Kluwer Academic Publishers.
Google Scholar
Bangalore, S and Joshi, A.K. (1999). SuperTagging — An Approach to Almost Parsing. Computational Linguistics, 25:237–265.
Google Scholar
Brill, E. (1993). Automatic grammar induction and parsing free text: A transformation based approach. In Proceedings of the 31 ^st Annual Meeting of the Association for Computational Linguistics, Columbus, Ohio.
Google Scholar
Brill, E. and Resnik, P. (1994). A rule-based approach to prepositional phrase attachment disambiguation. In Proceedings of the International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.
Google Scholar
Chen, J and Bangalore, S and Vijay-Shanker, K. (1999). New Models for Improving Supertag Disambiguation. In Proceedings of 9th Conference of the European Chapter of Association for Computational Linguistics, Bergen, Norway, 1999.
Google Scholar
Church, K. W. (1988). A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text. In 2nd Applied Natural Language Processing Conference, Austin, Texas.
Google Scholar
Collins, M. and Brook, J. (1995). Prepositional phrase attachment through a backed-off model. In Proceedings of the Third Workshop on Very Large Corpora, MIT, Cambridge, Boston.
Google Scholar
Doran, C. (1996). Punctuation in Quoted Speech. In Proceedings of the SIG-PARSE 96, workshop on Punctuation, Santa Cruz, California.
Google Scholar
Doran, C, Egedi, D., Hockey, B. A., Srinivas, B., and Zaidel, M. (1994). XTAG System — A Wide Coverage Grammar for English. In Proceedings of the 17 ^th International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.
Google Scholar
Good, I. (1953). The population frequencies of species and the estimation of population parameters. Biometrika 40 (3 and 4).
Google Scholar
Gross, M. (1984). Lexicon-Grammar and the Syntactic Analysis of French. In Proceedings of the 10^th International Conference on Computational Linguistics (COLING’84), Stanford, California.
Google Scholar
Harrison, P., Abney, S., Flickinger, D., Gdaniec, C., Grishman, R., Hindle, D., Ingria, B., Marcus, M., Santorini, B., and Strzalkowski, T. (1991). Evaluating syntax performance of parser/grammars of English. In Proceedings of the ACL Workshop on Evaluating Natural Language Processing Systems, ACL
Google Scholar
Hindle, D. and Rooth, M. (1991). Structural ambiguity and lexical relations. In 29th Meeting of the Association for Computational Linguistics, Berkeley, CA.
Google Scholar
Jelinek, F., Lafferty, J., Magerman, D. M., Mercer, R., Ratnaparkhi, A., and Roukos, S. (1994). Decision Tree Parsing using a Hidden Derivation Model. In Proceedings of the ARPA Workshop on Human Language Technology, Plainsborough, NJ.
Google Scholar
Joshi, A. K. (1985). Tree Adjoining Grammars: How much context sensitivity is required to provide a reasonable structural description. In Dowty, D., Karttunen, I., and Zwicky, A., editors, Natural Language Parsing, pp. 206–250. Cambridge University Press, Cambridge, U.K.
Google Scholar
Joshi, A. K. and Srinivas, B. (1994). Disambiguation of Super Parts of Speech (or Supertags): Almost Parsing. In Proceedings of the 17 ^th International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.
Google Scholar
Magerman, D. M. (1995). Statistical Decision-Tree Models for Parsing. In Proceedings of the 33 ^rd Annual Meeting of the Association for Computational Linguistics.
Google Scholar
Marcus, M. M., Santorini, B., and Marcinkiewicz, M. A. (1993). Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19.2:313–330.
Google Scholar
Pollard, C. and Sag, I. A. (1987). Information-Based Syntax and Semantics. Vol 1: Fundamentals. CSLI.
Google Scholar
Ramshaw, L. and Marcus, M. P. (1995). Text chunking using transformation-based learning. In Proceedings of the Third Workshop on Very Large Corpora, MIT, Cambridge, Boston.
Google Scholar
Ratnaparkhi, A., Reynar, J., and Roukos, S. (1994). A maximum entropy model for prepositional phrase attachment. In Proceedings of ARPA Workshop on Human Language Technology, Plainsboro, NJ.
Google Scholar
Sampson, G. (1994). SUSANNE: a Doomsday book of English Grammar. In Corpus-based Research into Language. Rodopi, Amsterdam.
Google Scholar
Schabes, Y, Abeillé, A., and Joshi, A. K. (1988). Parsing strategies with ‘lexicalized’ grammars: Application to Tree Adjoining Grammars. In Proceedings of the 12^th International Conference on Computational Linguistics (COLING’88), Budapest, Hungary.
Google Scholar
Schabes, Y. and Shieber, S. (1992). An Alternative Conception of Tree-Adjoining Derivation. In Proceedings of the 20^th Meeting of the Association for Computational Linguistics.
Google Scholar
Sleator, D. and Temperley, D. (1991). Parsing English with a Link Grammar. Technical report CMU-CS-91–196, Department of Computer Science, Carnegie Mellon University.
Google Scholar
Srinivas, B. (1997). Complexity of Lexical Descriptions and its Relevance to Partial Parsing. PhD Dissertation, University of Pennsylvania.
Google Scholar
Steedman, M. (1987). Combinatory Grammars and Parasitic Gaps. Natural Language and Linguistic Theory, 5:403–439.
Article Google Scholar
Weischedel, R., Schwartz, R., Palmucci, J., Meteer, M., and Ramshaw, L. (1993). Coping with ambiguity and unknown words through probabilistic models. Computational Linguistics, 19.2:359–382.
Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Labs - Research, 180 Park Avenue, Florham Park, NJ, 07932, USA
Srinivas Bangalore

Authors

Srinivas Bangalore
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tilburg University, The Netherlands
Harry Bunt
University of Twente, Enschede, The Netherlands
Anton Nijholt

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bangalore, S. (2000). Performance Evaluation of Supertagging for Partial Parsing. In: Bunt, H., Nijholt, A. (eds) Advances in Probabilistic and Other Parsing Technologies. Text, Speech and Language Technology, vol 16. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-9470-7_11

Download citation

DOI: https://doi.org/10.1007/978-94-015-9470-7_11
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-5579-8
Online ISBN: 978-94-015-9470-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics