Multilingual Speech Recognition

Nöth, E.; Harbeck, S.; Niemann, H.

doi:10.1007/978-3-642-60087-6_31

E. Nöth²,
S. Harbeck² &
H. Niemann²

Part of the book series: NATO ASI Series ((NATO ASI F,volume 169))

232 Accesses
2 Citations

Summary

We present two concepts for systems with language identification in the context of multilingual information retrieval dialogs. The first one has an explicit module for language identification. It is based on training a common codebook for all the languages and integrating over the output probabilities of language specific n-gram models trained over the codebook sequences. The system can decide for one language either after a predefined time interval or if the difference between the probabilities of the languages succeeds a certain threshold. This approach allows to recognize languages that the system can not process and give out a prerecorded message in that language. In the second approach, the trained recognizers of the languages to be recognized, the lexicons, and the language models are combined to one multilingual recognizer. Only allowing transitions between the words from one language, each hypothesized word chain contains words from just one language and language identification is an implicit by-product of the speech recognizer. First results for both language identification approaches are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

F. Andry, N. Fraser, S., S. Thornton, and N. Youd. Making DATR Work for Speech: Lexicon Compilation in SUNDIAL. Computational Linguistics, 18(3):245–267, Sept. 1992.
Google Scholar
W. Eckert. Gesprochener Mensch-Maschine-Dialog. Berichte aus der Informatik. Shaker Verlag, Aachen, 1996.
Google Scholar
W. Eckert, T. Kuhn, H. Niemann, S. Rieck, A. Scheuer, and E. G. Schukat-Talamazzini. A Spoken Dialogue System for German Intercity Train Timetable Inquiries. In Proc. European Conf. on Speech Communication and Technology, pages 1871–1874, Berlin, 1993.
Google Scholar
R. Evans and G. G. (eds.). The DATR Papers: February 1990. Technical report, Cognitive Science Research Paper CSRP 139, University of Sussex, Brighton, 1990.
Google Scholar
C. H. J. Godfrey and G. Doddington. The ATIS Spoken Language Systems Pilot Corpus. In Speech and Natural Language Workshop, pages 96–101. Morgan Kaufmann, Hidden Valley, Pennsylvania, 1990.
Google Scholar
G. Hanrieder. Inkrementelles Parsing gesprochener Sprache mit einer linksassoziativen Unifikationsgrammatik. PhD thesis, Universität Erlangen-Nörnberg, 1996.
Google Scholar
T. Kuhn. Die Erkennungsphase in einem Dialogsystem volume 80 of Dissertationen zur Könstlichen Intelligenz. Infix, St. Augustin, 1995.
Google Scholar
B. Lowerre and D. Reddy. The Harpy Speech Understanding System. In W. Lea, editor, Trends in Speech Recognition, pages 340–360. Prentice-Hall Inc., Englewood Cliffs, New Jersey, 1980.
Google Scholar
Y. K. Muthusamy, E. Barnard, and R. A. Cole. Reviewing automatic language identification. IEEE SIGNAL PROCESSING MAGAZINE, pages 33–41, Oktober 1994.
Google Scholar
E. Schukat-Talamazzini, T. Kuhn, and H. Niemann. Speech Recognition for Spoken Dialogue Systems. In H. Niemann, R. De Mori, and G. Hanrieder, editors, Progress and Prospects of Speech Research and Technology: Proc. of the CRIM/FORWISS Workshop, PAI1, pages 110–120, Sankt Augustin, 1994. Infix.
Google Scholar
E. G. Schukat-Talamazzini. Automatische Spracherkennung-Grundlagen, statistische Modelle und effiziente Algorithmen. Vieweg, Braunschweig, 1995.
Google Scholar
V. Warnke. Landessprachenklassifikation. Studienarbeit, Lehrstuhl für Mustererkennung (Informatik 5), Universität Erlangen-Nürnberg, 1995.
Google Scholar
M. Zissman. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. on Acoustics, Speech and Signal Processing, 4: 31–44, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Lehrstuhl für Mustererkennung (Informatik 5), Universität Erlangen—Nürnberg, Martensstr. 3, 91058, Erlangen, Germany
E. Nöth, S. Harbeck & H. Niemann

Authors

E. Nöth
View author publications
You can also search for this author in PubMed Google Scholar
S. Harbeck
View author publications
You can also search for this author in PubMed Google Scholar
H. Niemann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Speech Research Unit, DERA Malvern, St. Andrew’s Road, WR14 4DT, Great Malvern, Worcs, UK
Keith Ponting

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Nöth, E., Harbeck, S., Niemann, H. (1999). Multilingual Speech Recognition. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-60087-6_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics