A Study on Speech Coders for Automatic Speech Recognition in Adverse Communication Environments

Ho Choi, Seung

doi:10.1007/978-3-642-25453-6_7

Seung Ho Choi³

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 252))

Included in the following conference series:

International Conference on Informatics Engineering and Information Science

1466 Accesses

Abstract

In this research work, we present the effects of several standard speech coders on automatic speech recognition in adverse communication environments such as tandem, frame erasure, and noisy conditions. The adverse conditions were chosen to simulate the operations of mobile communication environments. The comparative results can provide a guideline for selecting a speech coder when a speech recognition service is needed in digital communication networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tan, Z.H., Lindberg, B.: Automatic speech recognition on mobile devices and over communication networks. In: Tan, Z.H., Lindberg, B. (eds.). Springer, Heidelberg (2008)
Google Scholar
Lilly, B.T., Paliwal, K.K.: Effect of speech coders on speech recognition performance. In: Proc. ICSLP, Philadelphia, PA, pp. 2344–2347 (1996)
Google Scholar
Mokbel, C., Mauuary, L., Karray, L., Jouvet, D., Monne, J., Simonin, J., Bartkova, K.: Towards improving ASR robustness for PSN and GSM telephone applications. Speech Communication 23(1-2), 141–159 (1997)
Article Google Scholar
Choi, S.H., Kim, H.K., Lee, H.S., Gray, R.M.: Speech recognition method using quantised LSP parameters in CELP-type coders. Electron. Lett. 34(2), 156–157 (1998)
Article Google Scholar
Huerta, J.M., Stern, R.M.: Speech recognition from GSM codec parameters. In: Proc. ICSLP, Sydney, Australia, pp. 1463–1466 (1998)
Google Scholar
Turunen, J., Vlag, D.: A Study of speech coding parameters in speech recognition. In: Proc. EUROSPEECH, Scandinavia, pp. 2363–2366 (2001)
Google Scholar
Carmen, P.M., Ascension, G.A., Diego, F.G.C., Fernando, D.M.: A comparison of front-ends for bitstream-based ASR over IP. Signal Processing 86(7), 1502–1508 (2006)
Article MATH Google Scholar
Kleijn, W.B., Paliwal, K.K.: Speech coding and synthesis. In: Kleijn, W.B., Paliwal, K.K. (eds.). Elsevier Science, Amsterdam (1995)
Google Scholar
TIA/EIA IS96A: Speech service option standard for wideband spread spectrum digital cellular system (1994)
Google Scholar
Qualcomm: High rate speech service option for wideband spread spectrum communication systems (1996)
Google Scholar
TIA/EIA IS127: Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems (1995)
Google Scholar
ITU-T G.729: Coding for speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP) (1996)
Google Scholar
Jarvinen, K., Vainio, J., Kapanen, P., Honkanen, T., Haavisto, P., Salami, R., Laflamme, C., Adoul, J.-P.: GSM enhanced full rate speech codec. In: Proc. ICASSP, Munich, Germany, pp. 771–774 (1997)
Google Scholar
Kondoz, A.M.: Digital speech: Coding for low bit rate communications systems. In: Kondoz, A.M. (ed.). John Wiley (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronic and Information Engineering, Seoul National University of Science and Technology, Seoul, 139-743, Korea
Seung Ho Choi

Authors

Seung Ho Choi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Advanced Informatics School (UTM AIS), UTM International Campus, 54100, Kuala Lumpur, Malaysia
Azizah Abd Manaf , Akram Zeki , Mazdak Zamani & Suriayati Chuprat , , &
Information Systems Department, King Saud University, Riyadh, Saudi Arabia
Eyas El-Qawasmeh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ho Choi, S. (2011). A Study on Speech Coders for Automatic Speech Recognition in Adverse Communication Environments. In: Abd Manaf, A., Zeki, A., Zamani, M., Chuprat, S., El-Qawasmeh, E. (eds) Informatics Engineering and Information Science. ICIEIS 2011. Communications in Computer and Information Science, vol 252. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25453-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-25453-6_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25452-9
Online ISBN: 978-3-642-25453-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics