Abstract
We will trace a brief history of context-free parsing algorithms and then describe some representation issues. The purpose of this paper is to share our philosophy and experience in adapting a well-known context-free parsing algorithm (Earley’s algorithm [9, 10] and variations thereof [29, 14, 27, 28]) to the parsing of a difficult and wide-ranging corpus of sentences. The sentences were gathered by Malhotra [23] in an experiment which fooled businessmen users into thinking they were interacting with a computer, when they were actually interacting with Malhotra in another room. The sentences are given in Appendix I. The MALHOTRA corpus is considerably more difficult than a second collection given in Appendix II (originally published in [16]). Section 4 compares empirical results obtained from these collections against theoretical predictions.
This research was supported (in part) by the National Institutes of Health Grant No.1 P01 LM 03374-02 from the National Library of Medicine, and by the Defense Advanced Research Projects Agency (DOD) monitored by the Office of Naval Research under Contract No. N00014-75-C-0661.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aho AV, Ullman JD (1972) The Theory of Parsing, Translation, and Compiling. Englewood Cliffs: Prentice-Hall
Bar-Hillel Y, Gaifman C, Shamir E: On Categorial and Phrase Structure Grammars. The Bulletin of the Research Council of Israel, 9F, 1–16
Bresnan J (1981) The Passive in Lexical Theory. Occasional Paper No 7, Center for Cognitive Science, 1980. Also in: Bresnan, J (ed). Cambridge: MIT Press
Burton R (1976) Semantic Grammar: An Engineering Technique for Constructing Natural Language Understanding Systems. BBN Report No 3453
Chomsky N (1980) On Binding. Linguistic Inquiry
Church K (1980) On Memory Limitations in Natural Language Processing. MIT/LCS/TR245 (also available from the Indiana University Linguistics Club)
Church K, Patil R (1983) Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table. MIT/LCS/TM-216. Also in: American Journal of Computational Linguistics
Dostert B, Thompson F (1971) How Features Resolve Syntactic Ambiguity. In: Minker J, Rosenfeld S (eds ): Proceedings of the Symposium on Information Storage and Retreival
Earley J (1968) An Efficient Context-Free Parsing Algorithm. Unpublished Ph. D. Thesis. Carnegie-Mellon University
Earley J (1970) An Efficient Context-Free Parsing Algorithm. Communications of the ACM 13 (2)
Ford M, Bresnan J, Kaplan R (1981) A Competence-Based Theory of Syntactic Closure. Paper presented at the Sloan Workshop on Parsing Long Distance Dependencies, University of Massachusetts at Amherst, 1981. Also in: Bresnan J (ed). Cambridge: MIT Press
Gazdar G (1981) Unbounded Dependencies and Coordinate Structure. Linguistic Inquiry 12 (2)
Gazdar G: Phrase Structure Grammar. In: Jacobson P, Pullum G (eds): The Nature of Syntactic Representation
Graham S, Harrison M, Ruzzo W (1980) An Improved Context-Free Recognizer. ACM Transactions on Programming Languages and Systems 2 (3), 415–462
Harris L: Experience with ROBOT in 12 Commercial Natural Language Data Base Query Applications. IJCAI 79, p 365
Hendrix G, Sacerdoti E, Sagalowicz D, Slocum J (1978) Developing a Natural Language Interface to Complex Data. ACM Transactions on Database Systems 3 (2), 105–147
Joos M (1968) The English Verb: Form and Meanings. Madison, Milwaukee, and London: The University of Wiscons in Press
Kaplan R (1972) Augmented Transition Networks as Psychological Models of Sentence Comprehension. Artificial Intelligence 3, 77–100
Kaplan R (1973) A General Syntacitc Processor. In: Rustin R (ed): Natural Language Processing. New York: Algorithmics Press
Kaplan R, Bresnan J (1981) Lexical-Functional Grammar: A Formal Systen for Grammatical Representation. Occasional Paper, Center for Cognitive Science, 1980. Also in: Bresnan J (ed). Cambridge: MIT Press
Knuth, D (1975) Fundamental Algorithms. In: The Art of Computer Programming, Vol 1. Reading: Addison-Wesley
Kuno, Susumu, Oettinger AG (1963) Multiple Path Syntactic Analyzer. In: Information Processing. Amsterdam: North-Holland
Malhotra A (1975) Design Criteria for a Knowledge-Based English Language System for Management: An Experimental Analysis. MIT/LCS/TR-146
Marcus M (1980) A Theory of Syntactic Recognition for Natural Language. Cambridge: MIT Press
Mathlab Group (1977) Macsyma Reference Manual. Laboratory for Computer Science, MIT
Milne R (1980) A Framework for Deterministic Parsing Using Syntax and Semantics. DAI Working Paper 64. Department of Artificial Intelligence, University of Edinburgh
Pratt VR (1973) A Linguistics Oriented Programming Language. IJCAI 3
Pratt V (1975) Lingol- A Progress Report. IJCAI 4
Ruzzo WL (1978) General Context-Free Language Recognition. Unpublished Ph. D. Thesis. University of California, Berkeley
Sheil B (1976) Observations on Context-Free Parsing. Statistical Methods in Linguistics, 71–109
Shipman D, Marcus M (1979) Towards Minimal Data Structures for Deterministic Parsing. IJCAI 79
Steele G (1980) The Definition and Implementation of a Computer Programming Language Based on Constraints. MIT, AI-TR-595
Valient L (1975) General Context Free Recognition in Less Than Cubic Time. J. Computer and System Sciences 10, 308–315
Woods W (1970) Transition Network Grammars for Natural Language Analysis. Communications of the ACM 13 (10), 591–606
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1987 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Martin, W.A., Church, K.W., Patil, R.S. (1987). Preliminary Analysis of a Breadth-First Parsing Algorithm: Theoretical and Experimental Results. In: Bolc, L. (eds) Natural Language Parsing Systems. Symbolic Computation. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83030-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-83030-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-83032-7
Online ISBN: 978-3-642-83030-3
eBook Packages: Springer Book Archive