Skip to main content

Performance Evaluation of Supertagging for Partial Parsing

  • Chapter
Advances in Probabilistic and Other Parsing Technologies

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 16))

Abstract

In previous work we introduced the idea of supertagging as a means of improving the efficiency of a lexicalized grammar parser. In this paper, we present supertagging in conjunction with a lightweight dependency analyzer as a robust and efficient partial parser. The present work is significant for two reasons. First, we have vastly improved our results; 92% accurate for supertag disambiguation using lexical information, larger training corpus and smoothing techniques. Second, we show how supertagging can be used for partial parsing and provide detailed evaluation results for detecting noun chunks, verb chunks, preposition phrase attachment and a variety of other linguistic constructions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Abney, S. (1991). Parsing by chunks. In Berwick, R., Abney, S., and Tenny, C., editors, Principle-based parsing. Dordrecht: Kluwer Academic Publishers.

    Google Scholar 

  • Bangalore, S and Joshi, A.K. (1999). SuperTagging — An Approach to Almost Parsing. Computational Linguistics, 25:237–265.

    Google Scholar 

  • Brill, E. (1993). Automatic grammar induction and parsing free text: A transformation based approach. In Proceedings of the 31 st Annual Meeting of the Association for Computational Linguistics, Columbus, Ohio.

    Google Scholar 

  • Brill, E. and Resnik, P. (1994). A rule-based approach to prepositional phrase attachment disambiguation. In Proceedings of the International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.

    Google Scholar 

  • Chen, J and Bangalore, S and Vijay-Shanker, K. (1999). New Models for Improving Supertag Disambiguation. In Proceedings of 9th Conference of the European Chapter of Association for Computational Linguistics, Bergen, Norway, 1999.

    Google Scholar 

  • Church, K. W. (1988). A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text. In 2nd Applied Natural Language Processing Conference, Austin, Texas.

    Google Scholar 

  • Collins, M. and Brook, J. (1995). Prepositional phrase attachment through a backed-off model. In Proceedings of the Third Workshop on Very Large Corpora, MIT, Cambridge, Boston.

    Google Scholar 

  • Doran, C. (1996). Punctuation in Quoted Speech. In Proceedings of the SIG-PARSE 96, workshop on Punctuation, Santa Cruz, California.

    Google Scholar 

  • Doran, C, Egedi, D., Hockey, B. A., Srinivas, B., and Zaidel, M. (1994). XTAG System — A Wide Coverage Grammar for English. In Proceedings of the 17 th International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.

    Google Scholar 

  • Good, I. (1953). The population frequencies of species and the estimation of population parameters. Biometrika 40 (3 and 4).

    Google Scholar 

  • Gross, M. (1984). Lexicon-Grammar and the Syntactic Analysis of French. In Proceedings of the 10th International Conference on Computational Linguistics (COLING’84), Stanford, California.

    Google Scholar 

  • Harrison, P., Abney, S., Flickinger, D., Gdaniec, C., Grishman, R., Hindle, D., Ingria, B., Marcus, M., Santorini, B., and Strzalkowski, T. (1991). Evaluating syntax performance of parser/grammars of English. In Proceedings of the ACL Workshop on Evaluating Natural Language Processing Systems, ACL

    Google Scholar 

  • Hindle, D. and Rooth, M. (1991). Structural ambiguity and lexical relations. In 29th Meeting of the Association for Computational Linguistics, Berkeley, CA.

    Google Scholar 

  • Jelinek, F., Lafferty, J., Magerman, D. M., Mercer, R., Ratnaparkhi, A., and Roukos, S. (1994). Decision Tree Parsing using a Hidden Derivation Model. In Proceedings of the ARPA Workshop on Human Language Technology, Plainsborough, NJ.

    Google Scholar 

  • Joshi, A. K. (1985). Tree Adjoining Grammars: How much context sensitivity is required to provide a reasonable structural description. In Dowty, D., Karttunen, I., and Zwicky, A., editors, Natural Language Parsing, pp. 206–250. Cambridge University Press, Cambridge, U.K.

    Google Scholar 

  • Joshi, A. K. and Srinivas, B. (1994). Disambiguation of Super Parts of Speech (or Supertags): Almost Parsing. In Proceedings of the 17 th International Conference on Computational Linguistics (COLING ‘94), Kyoto, Japan.

    Google Scholar 

  • Magerman, D. M. (1995). Statistical Decision-Tree Models for Parsing. In Proceedings of the 33 rd Annual Meeting of the Association for Computational Linguistics.

    Google Scholar 

  • Marcus, M. M., Santorini, B., and Marcinkiewicz, M. A. (1993). Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19.2:313–330.

    Google Scholar 

  • Pollard, C. and Sag, I. A. (1987). Information-Based Syntax and Semantics. Vol 1: Fundamentals. CSLI.

    Google Scholar 

  • Ramshaw, L. and Marcus, M. P. (1995). Text chunking using transformation-based learning. In Proceedings of the Third Workshop on Very Large Corpora, MIT, Cambridge, Boston.

    Google Scholar 

  • Ratnaparkhi, A., Reynar, J., and Roukos, S. (1994). A maximum entropy model for prepositional phrase attachment. In Proceedings of ARPA Workshop on Human Language Technology, Plainsboro, NJ.

    Google Scholar 

  • Sampson, G. (1994). SUSANNE: a Doomsday book of English Grammar. In Corpus-based Research into Language. Rodopi, Amsterdam.

    Google Scholar 

  • Schabes, Y, Abeillé, A., and Joshi, A. K. (1988). Parsing strategies with ‘lexicalized’ grammars: Application to Tree Adjoining Grammars. In Proceedings of the 12th International Conference on Computational Linguistics (COLING’88), Budapest, Hungary.

    Google Scholar 

  • Schabes, Y. and Shieber, S. (1992). An Alternative Conception of Tree-Adjoining Derivation. In Proceedings of the 20th Meeting of the Association for Computational Linguistics.

    Google Scholar 

  • Sleator, D. and Temperley, D. (1991). Parsing English with a Link Grammar. Technical report CMU-CS-91–196, Department of Computer Science, Carnegie Mellon University.

    Google Scholar 

  • Srinivas, B. (1997). Complexity of Lexical Descriptions and its Relevance to Partial Parsing. PhD Dissertation, University of Pennsylvania.

    Google Scholar 

  • Steedman, M. (1987). Combinatory Grammars and Parasitic Gaps. Natural Language and Linguistic Theory, 5:403–439.

    Article  Google Scholar 

  • Weischedel, R., Schwartz, R., Palmucci, J., Meteer, M., and Ramshaw, L. (1993). Coping with ambiguity and unknown words through probabilistic models. Computational Linguistics, 19.2:359–382.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Bangalore, S. (2000). Performance Evaluation of Supertagging for Partial Parsing. In: Bunt, H., Nijholt, A. (eds) Advances in Probabilistic and Other Parsing Technologies. Text, Speech and Language Technology, vol 16. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-9470-7_11

Download citation

  • DOI: https://doi.org/10.1007/978-94-015-9470-7_11

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-5579-8

  • Online ISBN: 978-94-015-9470-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics