Abstract
This paper gives an overview of a research effort whose goal is to develop a system which can carry out a dialog concerning a particular task domain using continuous German speech for input and output. The main processing phases are initial segmentation and labeling, finding words, understanding the meaning and giving an answer. Specialized processing modules for handling these four phases were developed or are being developed. The processing modules communicate via a common database.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
K. R. Popper, J. C. Eccles, “Das Ich und sein Gehirn,” R. Piper, München Zürich, P2.15 und E4.31, 1982.
A. Chapanis, “Interactive Human Communication,” Scient. American, 232, No. 3, 36–42, 1975.
W. A. Lea (ed.), Trends in Speech Recognition, Prentice Hall, Englewood Cliffs, N. J., 1980.
D. H. Klatt, “Review of the ARPA Speech Understanding Project,” J. Acoustical Soc. of America, 62, 1345–1366, 1977.
R. De Mori, “Recent Advances in Automatic Speech Recognition,” Proc. 4, Int. Joint Conf. Pattern Recognition, Kyoto, Japan, 106–124, 1978.
R. De Mori (ed.), Special Issue on Speech Understanding, Information Sciences.
L. Bahl, “Recognition of Isolated Word Sentences from a 5000-Word Vocabulary Office Correspondence Task,” Proc. ICASSP 83, Boston Mass., 1065, 1983.
T. Winograd, Language as a Cognitive Process, Vol. 1, Syntax, Addison Wesley, Rading Mass., 1983.
H.-W. Hein, Das Erlanger Spracherkennungssystem, Dissertation Universität Erlangen-Nürnberg, Arbeitsberichte des IMMD Band 15, Nr. 15, Erlangen 1982.
H. Niemann, “The Erlangen System for Recognition and Understanding of Continuous German Speech,” in J. Nehmer (ed.), GI — 12. Jahrestagung, Informatik Fachberichte 57, Springer Berlin, Heidelberg, New York, 330–348, 1982.
P. Regel, “A Module for Acoustic-Phonetic Transcription of Fluently Spoken German Speech,” IEEE Trans. ASSP-30, 440–450, 1982.
P. Regel, Akustisch phonetische Transkription für die automatische Spracherkennung, Dissertation, in preparation.
S. B. Davis, P. Mermelstein, “Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences,” IEEE Trans. ASSP-28, 357–366, 1980.
J. Kittler, K. S. Fu, L. F. Pau, Pattern Recognition Theory and Applications, D. Reidel Publ. Comp., Dordrecht Boston London, 569, 1982.
J. John, R. Mühlfeld, P. Regel, G. Siller, “Vergleich von Klassifikatoren für die Lauterkennung,” to appear in Proc. DAGM-84.
F. W. Kaeding (ed.), Häufigkeitswörterbuch der deutschen Sprache, Steglitz bei Berlin 1897–1898. We are grateful to Dr. Ruske who made available to us a tape with 8000 words.
D. W. Shipman, V. W. Zue, “Summary of Research in Speech Recognition,” Res. Lab. of Electronics, M.I.T., Cambridge Mass., 1982.
L. R. Bahl, F. Jelinek, “Decoding for Channels with Insertions, Deletions, and Substitutions with Applications to Speech Recognition,” IEEE Trans. IT-21, 404–411, 1975.
L. R. Bahl, F. Jelinek, R. L. Mercer, “A Maximum Likelihood Approach to Continuous Speech Recognition,” IEEE Trans. PAMI-5, 179–190, 1983.
A. R. Smith, Word Hypothesization in a Large-Vocabulary Speech Understanding System, Ph.D. Dissertation Dept. Computer Science, Carnegie Mellon University, Pittsburgh, PA, 1977.
H. Niemann, Pattern Analysis, Springer Series in Information Sciences 4, Springer, Berlin Heidelberg New York, 1981.
R. J. Brachman, A Structural Paradigm for Representing Knowledge,
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1985 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Niemann, H., Brietzmann, A., Mühlfeld, R., Regel, P., Schukat, G. (1985). The Speech Understanding and Dialog System Evar. In: De Mori, R., Suen, C.Y. (eds) New Systems and Architectures for Automatic Speech Recognition and Synthesis. NATO ASI Series, vol 16. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-82447-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-82447-0_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-82449-4
Online ISBN: 978-3-642-82447-0
eBook Packages: Springer Book Archive