Skip to main content

Preliminary Analysis of a Breadth-First Parsing Algorithm: Theoretical and Experimental Results

  • Chapter

Part of the book series: Symbolic Computation ((1064))

Abstract

We will trace a brief history of context-free parsing algorithms and then describe some representation issues. The purpose of this paper is to share our philosophy and experience in adapting a well-known context-free parsing algorithm (Earley’s algorithm [9, 10] and variations thereof [29, 14, 27, 28]) to the parsing of a difficult and wide-ranging corpus of sentences. The sentences were gathered by Malhotra [23] in an experiment which fooled businessmen users into thinking they were interacting with a computer, when they were actually interacting with Malhotra in another room. The sentences are given in Appendix I. The MALHOTRA corpus is considerably more difficult than a second collection given in Appendix II (originally published in [16]). Section 4 compares empirical results obtained from these collections against theoretical predictions.

This research was supported (in part) by the National Institutes of Health Grant No.1 P01 LM 03374-02 from the National Library of Medicine, and by the Defense Advanced Research Projects Agency (DOD) monitored by the Office of Naval Research under Contract No. N00014-75-C-0661.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aho AV, Ullman JD (1972) The Theory of Parsing, Translation, and Compiling. Englewood Cliffs: Prentice-Hall

    Google Scholar 

  2. Bar-Hillel Y, Gaifman C, Shamir E: On Categorial and Phrase Structure Grammars. The Bulletin of the Research Council of Israel, 9F, 1–16

    Google Scholar 

  3. Bresnan J (1981) The Passive in Lexical Theory. Occasional Paper No 7, Center for Cognitive Science, 1980. Also in: Bresnan, J (ed). Cambridge: MIT Press

    Google Scholar 

  4. Burton R (1976) Semantic Grammar: An Engineering Technique for Constructing Natural Language Understanding Systems. BBN Report No 3453

    Google Scholar 

  5. Chomsky N (1980) On Binding. Linguistic Inquiry

    Google Scholar 

  6. Church K (1980) On Memory Limitations in Natural Language Processing. MIT/LCS/TR245 (also available from the Indiana University Linguistics Club)

    Google Scholar 

  7. Church K, Patil R (1983) Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table. MIT/LCS/TM-216. Also in: American Journal of Computational Linguistics

    Google Scholar 

  8. Dostert B, Thompson F (1971) How Features Resolve Syntactic Ambiguity. In: Minker J, Rosenfeld S (eds ): Proceedings of the Symposium on Information Storage and Retreival

    Google Scholar 

  9. Earley J (1968) An Efficient Context-Free Parsing Algorithm. Unpublished Ph. D. Thesis. Carnegie-Mellon University

    Google Scholar 

  10. Earley J (1970) An Efficient Context-Free Parsing Algorithm. Communications of the ACM 13 (2)

    Google Scholar 

  11. Ford M, Bresnan J, Kaplan R (1981) A Competence-Based Theory of Syntactic Closure. Paper presented at the Sloan Workshop on Parsing Long Distance Dependencies, University of Massachusetts at Amherst, 1981. Also in: Bresnan J (ed). Cambridge: MIT Press

    Google Scholar 

  12. Gazdar G (1981) Unbounded Dependencies and Coordinate Structure. Linguistic Inquiry 12 (2)

    Google Scholar 

  13. Gazdar G: Phrase Structure Grammar. In: Jacobson P, Pullum G (eds): The Nature of Syntactic Representation

    Google Scholar 

  14. Graham S, Harrison M, Ruzzo W (1980) An Improved Context-Free Recognizer. ACM Transactions on Programming Languages and Systems 2 (3), 415–462

    Article  MATH  Google Scholar 

  15. Harris L: Experience with ROBOT in 12 Commercial Natural Language Data Base Query Applications. IJCAI 79, p 365

    Google Scholar 

  16. Hendrix G, Sacerdoti E, Sagalowicz D, Slocum J (1978) Developing a Natural Language Interface to Complex Data. ACM Transactions on Database Systems 3 (2), 105–147

    Article  Google Scholar 

  17. Joos M (1968) The English Verb: Form and Meanings. Madison, Milwaukee, and London: The University of Wiscons in Press

    Google Scholar 

  18. Kaplan R (1972) Augmented Transition Networks as Psychological Models of Sentence Comprehension. Artificial Intelligence 3, 77–100

    Article  MATH  Google Scholar 

  19. Kaplan R (1973) A General Syntacitc Processor. In: Rustin R (ed): Natural Language Processing. New York: Algorithmics Press

    Google Scholar 

  20. Kaplan R, Bresnan J (1981) Lexical-Functional Grammar: A Formal Systen for Grammatical Representation. Occasional Paper, Center for Cognitive Science, 1980. Also in: Bresnan J (ed). Cambridge: MIT Press

    Google Scholar 

  21. Knuth, D (1975) Fundamental Algorithms. In: The Art of Computer Programming, Vol 1. Reading: Addison-Wesley

    Google Scholar 

  22. Kuno, Susumu, Oettinger AG (1963) Multiple Path Syntactic Analyzer. In: Information Processing. Amsterdam: North-Holland

    Google Scholar 

  23. Malhotra A (1975) Design Criteria for a Knowledge-Based English Language System for Management: An Experimental Analysis. MIT/LCS/TR-146

    Google Scholar 

  24. Marcus M (1980) A Theory of Syntactic Recognition for Natural Language. Cambridge: MIT Press

    MATH  Google Scholar 

  25. Mathlab Group (1977) Macsyma Reference Manual. Laboratory for Computer Science, MIT

    Google Scholar 

  26. Milne R (1980) A Framework for Deterministic Parsing Using Syntax and Semantics. DAI Working Paper 64. Department of Artificial Intelligence, University of Edinburgh

    Google Scholar 

  27. Pratt VR (1973) A Linguistics Oriented Programming Language. IJCAI 3

    Google Scholar 

  28. Pratt V (1975) Lingol- A Progress Report. IJCAI 4

    Google Scholar 

  29. Ruzzo WL (1978) General Context-Free Language Recognition. Unpublished Ph. D. Thesis. University of California, Berkeley

    Google Scholar 

  30. Sheil B (1976) Observations on Context-Free Parsing. Statistical Methods in Linguistics, 71–109

    Google Scholar 

  31. Shipman D, Marcus M (1979) Towards Minimal Data Structures for Deterministic Parsing. IJCAI 79

    Google Scholar 

  32. Steele G (1980) The Definition and Implementation of a Computer Programming Language Based on Constraints. MIT, AI-TR-595

    Google Scholar 

  33. Valient L (1975) General Context Free Recognition in Less Than Cubic Time. J. Computer and System Sciences 10, 308–315

    Article  Google Scholar 

  34. Woods W (1970) Transition Network Grammars for Natural Language Analysis. Communications of the ACM 13 (10), 591–606

    Article  MATH  Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1987 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Martin, W.A., Church, K.W., Patil, R.S. (1987). Preliminary Analysis of a Breadth-First Parsing Algorithm: Theoretical and Experimental Results. In: Bolc, L. (eds) Natural Language Parsing Systems. Symbolic Computation. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83030-3_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-83030-3_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-83032-7

  • Online ISBN: 978-3-642-83030-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics