Skip to main content

Part of the book series: NATO ASI Series ((NATO ASI F,volume 169))

Summary

We present two concepts for systems with language identification in the context of multilingual information retrieval dialogs. The first one has an explicit module for language identification. It is based on training a common codebook for all the languages and integrating over the output probabilities of language specific n-gram models trained over the codebook sequences. The system can decide for one language either after a predefined time interval or if the difference between the probabilities of the languages succeeds a certain threshold. This approach allows to recognize languages that the system can not process and give out a prerecorded message in that language. In the second approach, the trained recognizers of the languages to be recognized, the lexicons, and the language models are combined to one multilingual recognizer. Only allowing transitions between the words from one language, each hypothesized word chain contains words from just one language and language identification is an implicit by-product of the speech recognizer. First results for both language identification approaches are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. F. Andry, N. Fraser, S., S. Thornton, and N. Youd. Making DATR Work for Speech: Lexicon Compilation in SUNDIAL. Computational Linguistics, 18(3):245–267, Sept. 1992.

    Google Scholar 

  2. W. Eckert. Gesprochener Mensch-Maschine-Dialog. Berichte aus der Informatik. Shaker Verlag, Aachen, 1996.

    Google Scholar 

  3. W. Eckert, T. Kuhn, H. Niemann, S. Rieck, A. Scheuer, and E. G. Schukat-Talamazzini. A Spoken Dialogue System for German Intercity Train Timetable Inquiries. In Proc. European Conf. on Speech Communication and Technology, pages 1871–1874, Berlin, 1993.

    Google Scholar 

  4. R. Evans and G. G. (eds.). The DATR Papers: February 1990. Technical report, Cognitive Science Research Paper CSRP 139, University of Sussex, Brighton, 1990.

    Google Scholar 

  5. C. H. J. Godfrey and G. Doddington. The ATIS Spoken Language Systems Pilot Corpus. In Speech and Natural Language Workshop, pages 96–101. Morgan Kaufmann, Hidden Valley, Pennsylvania, 1990.

    Google Scholar 

  6. G. Hanrieder. Inkrementelles Parsing gesprochener Sprache mit einer linksassoziativen Unifikationsgrammatik. PhD thesis, Universität Erlangen-Nörnberg, 1996.

    Google Scholar 

  7. T. Kuhn. Die Erkennungsphase in einem Dialogsystem volume 80 of Dissertationen zur Könstlichen Intelligenz. Infix, St. Augustin, 1995.

    Google Scholar 

  8. B. Lowerre and D. Reddy. The Harpy Speech Understanding System. In W. Lea, editor, Trends in Speech Recognition, pages 340–360. Prentice-Hall Inc., Englewood Cliffs, New Jersey, 1980.

    Google Scholar 

  9. Y. K. Muthusamy, E. Barnard, and R. A. Cole. Reviewing automatic language identification. IEEE SIGNAL PROCESSING MAGAZINE, pages 33–41, Oktober 1994.

    Google Scholar 

  10. E. Schukat-Talamazzini, T. Kuhn, and H. Niemann. Speech Recognition for Spoken Dialogue Systems. In H. Niemann, R. De Mori, and G. Hanrieder, editors, Progress and Prospects of Speech Research and Technology: Proc. of the CRIM/FORWISS Workshop, PAI1, pages 110–120, Sankt Augustin, 1994. Infix.

    Google Scholar 

  11. E. G. Schukat-Talamazzini. Automatische Spracherkennung-Grundlagen, statistische Modelle und effiziente Algorithmen. Vieweg, Braunschweig, 1995.

    Google Scholar 

  12. V. Warnke. Landessprachenklassifikation. Studienarbeit, Lehrstuhl für Mustererkennung (Informatik 5), Universität Erlangen-Nürnberg, 1995.

    Google Scholar 

  13. M. Zissman. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. on Acoustics, Speech and Signal Processing, 4: 31–44, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Nöth, E., Harbeck, S., Niemann, H. (1999). Multilingual Speech Recognition. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-60087-6_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-64250-0

  • Online ISBN: 978-3-642-60087-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics