Speech Dialogue Systems and Natural Language Processing

  • Holger Quast
  • Robert Bosch
Part of the Springer Series in Information Sciences book series (SSINF, volume 35)

Abstract

If language is the dress of thought as Johnson claims, speech is its genuine overcoat. Traditionally, speech and language have been studied separately, the former as an acoustic phenomenon, the latter as representation of meaning, symbolized by words. Slow but steady progress in the field of speech recognition together with increasing computational power made it possible to use our natural verbal communication channel as front end for speech dialogue systems, and moreover have allowed the first applications of this kind to step out of the laboratories into the real world. During the second half of the nineties, actual products started to appear that were more than mere technological gadgets: dictation programs for word processors, voice dialing for mobile phones, and dialogue engines that carry out tasks such as air travel reservation, banking, and even translation over the phone. While these devices are often still cumbersome to work with, they bear the potential of the most natural human-machine interface. Spoken language dialogue programs provide useful services where human operators would be too expensive, they are an invaluable help for physically incapacitated people, and allow to control devices where hands-free operation is safer and speech input has a higher data throughput than a tactile interface. This chapter introduces some instrumental concepts and techniques from the field of language processing, and its application, natural language speech dialogue systems.

Keywords

Entropy Mercury Beach Trench Nite 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 4.1
    Barbara F. Grimes (ed.): Ethnologue: Languages of the World, 14th ed. (SIL International, 2000)Google Scholar
  2. 4.2
    Derek Bickerton: Language and Human Behavior (University of Washington Press, 1996)Google Scholar
  3. 4.3
    James Allen: Natural Language Understanding, 2nd ed. (Benjamin/Cummings, Redwood, 1995)MATHGoogle Scholar
  4. 4.4
    Stephen Levinson: Pragmatics (Oxford University Press, 1983)Google Scholar
  5. 4.5
    Noam Chomsky: Three models for the description of language. IRE Transactions on Information Theory 2, 113–124 (1956)MATHCrossRefGoogle Scholar
  6. 4.6
    Daniel Jurafsky, James Martin: Speech and Language processing (Prentice-Hall, Upper Saddle River, 2000)Google Scholar
  7. 4.7
    Martin F. Porter: An algorithm for suffix stripping. Program 14 (3), 130–137 (1980)CrossRefGoogle Scholar
  8. 4.8
    Merriam-Webster’s Collegiate Dictionary, 11th ed. (Merriam-Webster, Springfield, 2003)Google Scholar
  9. 4.9
    Henry Kucera, W. Nelson Francis: Computational Analysis of Present Day American English (Brown University Press, Providence, 1967)Google Scholar
  10. 4.10
    Manfred R. Schroeder: Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise (W.H. Freeman, New York, 1991)MATHGoogle Scholar
  11. 4.11
    Eugene Charniak: Statistical Language Learning (MIT Press, Cambridge, 1993)Google Scholar
  12. 4.12
    Lawrence R. Rabiner: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. In: Alex Waibel, Kai-Fu Lee (eds.): Readings in Speech Recognition (Morgan Kaufmann Publishers, San Mateo, 1990)Google Scholar
  13. 4.13
    Eric Brill: A Simple Rule-Based Part Of Speech Tagger. In Proc. 3rd Conference on Applied Natural Language Processing (1992)Google Scholar
  14. 4.14
    Peter F. Brown, Vincent J. Delia Pietra, Peter V. deSouza, Jennifer C. Lai, and Robert L. Mercer: Class-Based n-gram Models of Natural Language. Computational Linguistics 18 (4), 467–479 (1992)Google Scholar
  15. 4.15
    Mitchell P. Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz: Building a large annotated corpus of English: the Penn Treebank. Computational Linguistics, 19 (2) 313–330 (1994)Google Scholar
  16. 4.16
    Christopher D. Manning, Hinrich Schütze: Foundations of Statistical Natural Language Processing (MIT Press, Cambridge, 2000)Google Scholar
  17. 4.17
    Wolfgang Wahlster (ed.): Verbmobil: Foundations of Speech-to-Speech Translation (Springer, Berlin New York Heidelberg, 2000)MATHGoogle Scholar
  18. 4.18
    Alan M. Turing: Computing Machinery and Intelligence. Mind 59, 433–460 (1950)MathSciNetCrossRefGoogle Scholar
  19. 4.19
    Joseph Weizenbaum: Eliza — A Computer Program For the Study of Natural Language Communication Between Man and Machine. Communications of the ACM 9 (1), 36–35 (1966)CrossRefGoogle Scholar
  20. 4.20
    Herbert Paul Grice: Logic and conversation. In: P. Cole, J.L. Morgan (eds.): Syntax and Semantics Vol. 3, Speech Acts (New York, Academic Press, 1975)Google Scholar
  21. 4.21
    Holger Quast, Tobias Scheideck, Petra Geutner, Andreas Korthauer: RoBo-DiMa: A Dialog-Object-Based Natural Language Speech Dialog System. In: Proc. 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE 2003)Google Scholar
  22. 4.22
    Stephanie Seneff and Joseph Polifroni: Dialogue Management in the Mercury Flight Reservation System. In: Proc. ANLP-NAACL (2000).Google Scholar
  23. 4.23
    Stephanie Seneff, Ed Hurley, Raymond Lau, Christine Pao, Philipp Schmid, Victor Zue: Galaxy II: A Reference Architecture for Conversational System Development. In: Proc. 1998 International Conference on Speech and Language Processing (1998)Google Scholar
  24. 4.24
    Frederick Jelinek: Statistical Methods for Speech Recognition (MIT Press, Cambridge, 1997)Google Scholar
  25. 4.25
    Erhard W. Hinrichs, Sandra Kübier, Valia Kordoni und Frank Henrik Müller: Robust Chunk Parsing for Spontaneous Speech. In [4.17]Google Scholar
  26. 4.26
    Aravind K. Joshi, Leon S. Levy, Masako Takahashi: Tree Adjunct Grammars. Journal of Computer and System Sciences 10 (1), 136–163 (1975)MathSciNetMATHCrossRefGoogle Scholar
  27. 4.27
    Yves Schabes, Anne Abeill, Aravind K Joshi: Parsing Strategies with ‘Lexi-calized’ Grammars: Application to Tree Adjoining Grammars. In Proc. 12th International Conference on Computational Linguistics, Budapest (1988)Google Scholar
  28. 4.28
    John R. Searle: Expression and Meaning. Studies in the Theory of Speech Acts. (Cambridge University Press, 1986)Google Scholar
  29. 4.29
    Renato De Mori (ed.): Spoken Dialogues with Computers. (Academic Press, San Diego, 1998)Google Scholar
  30. 4.30
    Egbert Ammicht, Alexandros Potamianos, Eric Fosler-Lussier: Ambiguity Representation and Resolution in Spoken Dialogue Systems. In Proc. Eurospeech (2001)Google Scholar
  31. 4.31
    Oliver Lemon, Alexander Gruenstein, Alexis Battle, Stanley Peters: Multitasking and Collaborative Activities in Dialogue Systems. In Proc. Sigdial (2002)Google Scholar
  32. 4.32
    Lori Lamel, Sophie Rosset, Jean-Luc Gauvain: Considerations in the Design and Evaluation of Spoken Language Dialog Systems. In Proc. ICSLP (2000)Google Scholar
  33. 4.33
    Peter F. Brown, John Cocke, Stephen A. Delia Pietra, Vincent J. Delia Pietra, Frederick Jelinek, John D. Lafferty, Robert L. Mercer, Paul S. Roossin.: A statistical approach to machine translation. Computational Linguistics 16 (2), 79–85 (1990)Google Scholar
  34. 4.34
    Christel Sorin and Renato De Mori: Sentence Generation. In [4.29]Google Scholar
  35. 4.35
    Terrence Sejnowski, Charles R. Rosenberg: Parallel Networks that Learn to Pronounce English Text. Complex Systems 1, 145–168 (1987)MATHGoogle Scholar
  36. 4.36
    Roger K. Moore: Speculating on the Future for Automatic Speech Recognition. A Survey of Attendees. Collected and presented at the 2003 IEEE Workshop on Automatic Speech Recognition and Understanding.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Holger Quast
    • 1
  • Robert Bosch
    • 2
  1. 1.Drittes Physikalisches InstitutUniversität GöttingenGermany
  2. 2.GmbH Research and DevelopmentGermany

Personalised recommendations