Summary
We present two concepts for systems with language identification in the context of multilingual information retrieval dialogs. The first one has an explicit module for language identification. It is based on training a common codebook for all the languages and integrating over the output probabilities of language specific n-gram models trained over the codebook sequences. The system can decide for one language either after a predefined time interval or if the difference between the probabilities of the languages succeeds a certain threshold. This approach allows to recognize languages that the system can not process and give out a prerecorded message in that language. In the second approach, the trained recognizers of the languages to be recognized, the lexicons, and the language models are combined to one multilingual recognizer. Only allowing transitions between the words from one language, each hypothesized word chain contains words from just one language and language identification is an implicit by-product of the speech recognizer. First results for both language identification approaches are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
F. Andry, N. Fraser, S., S. Thornton, and N. Youd. Making DATR Work for Speech: Lexicon Compilation in SUNDIAL. Computational Linguistics, 18(3):245–267, Sept. 1992.
W. Eckert. Gesprochener Mensch-Maschine-Dialog. Berichte aus der Informatik. Shaker Verlag, Aachen, 1996.
W. Eckert, T. Kuhn, H. Niemann, S. Rieck, A. Scheuer, and E. G. Schukat-Talamazzini. A Spoken Dialogue System for German Intercity Train Timetable Inquiries. In Proc. European Conf. on Speech Communication and Technology, pages 1871–1874, Berlin, 1993.
R. Evans and G. G. (eds.). The DATR Papers: February 1990. Technical report, Cognitive Science Research Paper CSRP 139, University of Sussex, Brighton, 1990.
C. H. J. Godfrey and G. Doddington. The ATIS Spoken Language Systems Pilot Corpus. In Speech and Natural Language Workshop, pages 96–101. Morgan Kaufmann, Hidden Valley, Pennsylvania, 1990.
G. Hanrieder. Inkrementelles Parsing gesprochener Sprache mit einer linksassoziativen Unifikationsgrammatik. PhD thesis, Universität Erlangen-Nörnberg, 1996.
T. Kuhn. Die Erkennungsphase in einem Dialogsystem volume 80 of Dissertationen zur Könstlichen Intelligenz. Infix, St. Augustin, 1995.
B. Lowerre and D. Reddy. The Harpy Speech Understanding System. In W. Lea, editor, Trends in Speech Recognition, pages 340–360. Prentice-Hall Inc., Englewood Cliffs, New Jersey, 1980.
Y. K. Muthusamy, E. Barnard, and R. A. Cole. Reviewing automatic language identification. IEEE SIGNAL PROCESSING MAGAZINE, pages 33–41, Oktober 1994.
E. Schukat-Talamazzini, T. Kuhn, and H. Niemann. Speech Recognition for Spoken Dialogue Systems. In H. Niemann, R. De Mori, and G. Hanrieder, editors, Progress and Prospects of Speech Research and Technology: Proc. of the CRIM/FORWISS Workshop, PAI1, pages 110–120, Sankt Augustin, 1994. Infix.
E. G. Schukat-Talamazzini. Automatische Spracherkennung-Grundlagen, statistische Modelle und effiziente Algorithmen. Vieweg, Braunschweig, 1995.
V. Warnke. Landessprachenklassifikation. Studienarbeit, Lehrstuhl für Mustererkennung (Informatik 5), Universität Erlangen-Nürnberg, 1995.
M. Zissman. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. on Acoustics, Speech and Signal Processing, 4: 31–44, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Nöth, E., Harbeck, S., Niemann, H. (1999). Multilingual Speech Recognition. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-60087-6_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive