Abstract
This article describes a novel approach to probabilistic LR-parsing of spontaneously spoken utterances developed in Verbmobil. It extends the use of context knowledge within the probabilistic model of the parser and improves its output by applying tree transformation rules learned from corpora. The parser was developed for German, English and Japanese and achieves more than 90% Labeled Recall/Precision on parsed Verbmobil utterances.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aho, A.V., Sethi, R., and Ullman, J.D. (1986) Compilers: Principle, Techniques and Tools. Reading, Mass.: Addison Wesley.
Batliner, A., Block, H.-U., Kießling, A., Kompe, R., Niemann, H., Nöth, E., Ruland, T., and Schachtl, S. (1997). Improving Parsing of Spontaneous Speech With the Help of Prosodic Boundaries. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’97). München, Germany.
Block, H. U. (1993). Compiling Trace and Unification Grammar for Parsing and Generation. In Strzalkowski, T., ed., Reversible Grammar in Natural Language Processing, 155–174, Boston, Dordrecht, London: Kluwer.
Bod, R. (1995). The Problem of Computing the Most Probable Tree in Data-Oriented Parsing and Stochastic Tree Grammars. In Proceedings of the Seventh Conference of the European Chapter of the ACL. Dublin, 104–111.
Brill, E. (1993a). A Corpus-Based Approach to Language Learning. Ph.D. Dissertation, University of Pennsylvania, Department of Computer and Information Science.
Brill, E. (1993b). Automatic Grammar Induction and Parsing Free Text: A Transformation Based Approach. In Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics. Columbus, Ohio.
Briscoe, T. and Carroll, J. (1993). Generalized Probabilistic LR Parsing of Natural Language (Corpora) With Unification-Based Grammars. Computational Linguistics 19 (1).
Briscoe, T. and Carroll, J. (1996) Apportioning Development Effort in a Probabilistic LR-Parsing System Through Evaluation. In Proceedings of the ACL SIGDAT Conference on Empirical Methods in Natural Language Processing. Philadelphia, PA., 92–100.
Charniak, E. (1993). Statistical Language Learning. Cambridge, Mass.: MIT Press.
Charniak, E. (1997). Statistical Techniques for Natural Language Parsing. AI Magazine.
Collins, M. (1999) Head-Driven Statistical Models for Natural Language Parsing. Ph.D. Dissertation, University of Pennsylvania, Philadelphia.
Good, I.J. (1953) The Population Frequencies of Species and the Estimation of Population Parameters. Biometrika 40(3,4). 237–263.
Hermjakob, U. (1997). Learning Parse and Translation Decisions From Examples With Rich Context. Ph.D. Dissertation, University of Texas, Austin, TX.
Hinrichs, E.W., Kübler, S., Kordoni, V., and Müller, F., (a). Robust Chunk Parsing For Spontaneous Speech. In this volume.
Hinrichs, E.W., Bartels, J., Kawata, Y., S., Kordoni, V., and Telljohann, H., (b). The Tübingen Treebanks for Spoken German, English and Japanese. In this volume.
Inui, K., Sornlertlamvanich, V., Tanaka, H., and Tokunaga, T. (1997a). A New Formalization of Probabilistic GLR Parsing. In Proceedings of the International Workshop on Parsing Technologies.
Inui, K., Shirai, K., Sornlertlamvanich, V., Tanaka, H., and Tokunaga, T. (1997b). Empirical Evaluation of Probabilistic GLR Parsing. Natural Language Pacific-Rim Symposium.
Kiefer B., Krieger, H.-U., and Nederhof, M.-J. Efficient and Robust HPSG Parsing of Word Graphs. In this volume.
Lavie, A. (1996). GLR*: A Robust Grammar-Focused Parser for Spontaneously Spoken Language. Ph.D. Dissertation, Carnegie Mellon University, Pittsburgh.
Magerman, D. (1994). Natural Language Parsing as Statistical Pattern Recognition. Ph.D. Dissertation, Stanford University, Stanford, CA.
Marcus, M. P. (1980). A Theory of Syntactic Recognition for Natural Language. Cambridge, Mass.: MIT Press.
Nagao, M. (1990). Knowledge and Inference. San Diego: Academic Press.
Ney, H. and Oerder, M. (1993). An Efficient Interface Between Continuous-Speech Recognition and Language Understanding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’93). Minneapolis, MN.
Pinkal, M., Rupp C., and Worm, K. Robust Semantic Processing of Spoken Language. In this volume.
Quinlan, J. R. (1986). Induction of Decision Trees. In: Machine Learning 1 (1). 81–106.
Ruland, T. (1995). Inkrementelles probabilistisches Parsing von Worthypothesengraphen. Diploma Thesis, University of Erlangen-Nürnberg, IMMD 8.
Rupp, C.J., Spilker, J., Klarner, M., and Worm, K. Combining Analyses From Various Parsers. In this volume.
Schiehlen, M. Semantic Construction. In this volume.
Schmid, L. (1994). Parsing Word Graphs Using a Linguistic Grammar and a Statistical Language Model. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’94). Adelaide, Australia.
Tomita, M. (1991). ed., Generalised LR Parsing. Boston: Kiuwer Academic Publishers.
Waibel, A. et al. (1996) Janus-II—Translation of Spontaneous Conversational Speech. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’96). Atlanta, GA.
Weber, H. (1994). LR-inkrementelles, probabilistisches Chartparsing von Worthypothesenmengen mit Unifikationsgrammatiken. Ph.D. Dissertation, University of Hamburg.
Wright, J. H. and Wrigley, E. N. (1991). GLR Parsing With Probability. In Tomita, M.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ruland, T. (2000). Probabilistic LR-Parsing with Symbolic Postprocessing. In: Wahlster, W. (eds) Verbmobil: Foundations of Speech-to-Speech Translation. Artificial Intelligence. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04230-4_12
Download citation
DOI: https://doi.org/10.1007/978-3-662-04230-4_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-08730-1
Online ISBN: 978-3-662-04230-4
eBook Packages: Springer Book Archive