Abstract
In previous work we introduced the idea of supertagging as a means of improving the efficiency of a lexicalized grammar parser. In this paper, we present supertagging in conjunction with a lightweight dependency analyzer as a robust and efficient partial parser. The present work is significant for two reasons. First, we have vastly improved our results; 92% accurate for supertag disambiguation using lexical information, larger training corpus and smoothing techniques. Second, we show how supertagging can be used for partial parsing and provide detailed evaluation results for detecting noun chunks, verb chunks, preposition phrase attachment and a variety of other linguistic constructions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abney, S. (1991). Parsing by chunks. In Berwick, R., Abney, S., and Tenny, C., editors, Principle-based parsing. Dordrecht: Kluwer Academic Publishers.
Bangalore, S and Joshi, A.K. (1999). SuperTagging — An Approach to Almost Parsing. Computational Linguistics, 25:237–265.
Brill, E. (1993). Automatic grammar induction and parsing free text: A transformation based approach. In Proceedings of the 31 st Annual Meeting of the Association for Computational Linguistics, Columbus, Ohio.
Brill, E. and Resnik, P. (1994). A rule-based approach to prepositional phrase attachment disambiguation. In Proceedings of the International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.
Chen, J and Bangalore, S and Vijay-Shanker, K. (1999). New Models for Improving Supertag Disambiguation. In Proceedings of 9th Conference of the European Chapter of Association for Computational Linguistics, Bergen, Norway, 1999.
Church, K. W. (1988). A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text. In 2nd Applied Natural Language Processing Conference, Austin, Texas.
Collins, M. and Brook, J. (1995). Prepositional phrase attachment through a backed-off model. In Proceedings of the Third Workshop on Very Large Corpora, MIT, Cambridge, Boston.
Doran, C. (1996). Punctuation in Quoted Speech. In Proceedings of the SIG-PARSE 96, workshop on Punctuation, Santa Cruz, California.
Doran, C, Egedi, D., Hockey, B. A., Srinivas, B., and Zaidel, M. (1994). XTAG System — A Wide Coverage Grammar for English. In Proceedings of the 17 th International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.
Good, I. (1953). The population frequencies of species and the estimation of population parameters. Biometrika 40 (3 and 4).
Gross, M. (1984). Lexicon-Grammar and the Syntactic Analysis of French. In Proceedings of the 10th International Conference on Computational Linguistics (COLING’84), Stanford, California.
Harrison, P., Abney, S., Flickinger, D., Gdaniec, C., Grishman, R., Hindle, D., Ingria, B., Marcus, M., Santorini, B., and Strzalkowski, T. (1991). Evaluating syntax performance of parser/grammars of English. In Proceedings of the ACL Workshop on Evaluating Natural Language Processing Systems, ACL
Hindle, D. and Rooth, M. (1991). Structural ambiguity and lexical relations. In 29th Meeting of the Association for Computational Linguistics, Berkeley, CA.
Jelinek, F., Lafferty, J., Magerman, D. M., Mercer, R., Ratnaparkhi, A., and Roukos, S. (1994). Decision Tree Parsing using a Hidden Derivation Model. In Proceedings of the ARPA Workshop on Human Language Technology, Plainsborough, NJ.
Joshi, A. K. (1985). Tree Adjoining Grammars: How much context sensitivity is required to provide a reasonable structural description. In Dowty, D., Karttunen, I., and Zwicky, A., editors, Natural Language Parsing, pp. 206–250. Cambridge University Press, Cambridge, U.K.
Joshi, A. K. and Srinivas, B. (1994). Disambiguation of Super Parts of Speech (or Supertags): Almost Parsing. In Proceedings of the 17 th International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.
Magerman, D. M. (1995). Statistical Decision-Tree Models for Parsing. In Proceedings of the 33 rd Annual Meeting of the Association for Computational Linguistics.
Marcus, M. M., Santorini, B., and Marcinkiewicz, M. A. (1993). Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19.2:313–330.
Pollard, C. and Sag, I. A. (1987). Information-Based Syntax and Semantics. Vol 1: Fundamentals. CSLI.
Ramshaw, L. and Marcus, M. P. (1995). Text chunking using transformation-based learning. In Proceedings of the Third Workshop on Very Large Corpora, MIT, Cambridge, Boston.
Ratnaparkhi, A., Reynar, J., and Roukos, S. (1994). A maximum entropy model for prepositional phrase attachment. In Proceedings of ARPA Workshop on Human Language Technology, Plainsboro, NJ.
Sampson, G. (1994). SUSANNE: a Doomsday book of English Grammar. In Corpus-based Research into Language. Rodopi, Amsterdam.
Schabes, Y, Abeillé, A., and Joshi, A. K. (1988). Parsing strategies with ‘lexicalized’ grammars: Application to Tree Adjoining Grammars. In Proceedings of the 12th International Conference on Computational Linguistics (COLING’88), Budapest, Hungary.
Schabes, Y. and Shieber, S. (1992). An Alternative Conception of Tree-Adjoining Derivation. In Proceedings of the 20th Meeting of the Association for Computational Linguistics.
Sleator, D. and Temperley, D. (1991). Parsing English with a Link Grammar. Technical report CMU-CS-91–196, Department of Computer Science, Carnegie Mellon University.
Srinivas, B. (1997). Complexity of Lexical Descriptions and its Relevance to Partial Parsing. PhD Dissertation, University of Pennsylvania.
Steedman, M. (1987). Combinatory Grammars and Parasitic Gaps. Natural Language and Linguistic Theory, 5:403–439.
Weischedel, R., Schwartz, R., Palmucci, J., Meteer, M., and Ramshaw, L. (1993). Coping with ambiguity and unknown words through probabilistic models. Computational Linguistics, 19.2:359–382.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Bangalore, S. (2000). Performance Evaluation of Supertagging for Partial Parsing. In: Bunt, H., Nijholt, A. (eds) Advances in Probabilistic and Other Parsing Technologies. Text, Speech and Language Technology, vol 16. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-9470-7_11
Download citation
DOI: https://doi.org/10.1007/978-94-015-9470-7_11
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-5579-8
Online ISBN: 978-94-015-9470-7
eBook Packages: Springer Book Archive