Abstract
In this research work, we present the effects of several standard speech coders on automatic speech recognition in adverse communication environments such as tandem, frame erasure, and noisy conditions. The adverse conditions were chosen to simulate the operations of mobile communication environments. The comparative results can provide a guideline for selecting a speech coder when a speech recognition service is needed in digital communication networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Tan, Z.H., Lindberg, B.: Automatic speech recognition on mobile devices and over communication networks. In: Tan, Z.H., Lindberg, B. (eds.). Springer, Heidelberg (2008)
Lilly, B.T., Paliwal, K.K.: Effect of speech coders on speech recognition performance. In: Proc. ICSLP, Philadelphia, PA, pp. 2344–2347 (1996)
Mokbel, C., Mauuary, L., Karray, L., Jouvet, D., Monne, J., Simonin, J., Bartkova, K.: Towards improving ASR robustness for PSN and GSM telephone applications. Speech Communication 23(1-2), 141–159 (1997)
Choi, S.H., Kim, H.K., Lee, H.S., Gray, R.M.: Speech recognition method using quantised LSP parameters in CELP-type coders. Electron. Lett. 34(2), 156–157 (1998)
Huerta, J.M., Stern, R.M.: Speech recognition from GSM codec parameters. In: Proc. ICSLP, Sydney, Australia, pp. 1463–1466 (1998)
Turunen, J., Vlag, D.: A Study of speech coding parameters in speech recognition. In: Proc. EUROSPEECH, Scandinavia, pp. 2363–2366 (2001)
Carmen, P.M., Ascension, G.A., Diego, F.G.C., Fernando, D.M.: A comparison of front-ends for bitstream-based ASR over IP. Signal Processing 86(7), 1502–1508 (2006)
Kleijn, W.B., Paliwal, K.K.: Speech coding and synthesis. In: Kleijn, W.B., Paliwal, K.K. (eds.). Elsevier Science, Amsterdam (1995)
TIA/EIA IS96A: Speech service option standard for wideband spread spectrum digital cellular system (1994)
Qualcomm: High rate speech service option for wideband spread spectrum communication systems (1996)
TIA/EIA IS127: Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems (1995)
ITU-T G.729: Coding for speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP) (1996)
Jarvinen, K., Vainio, J., Kapanen, P., Honkanen, T., Haavisto, P., Salami, R., Laflamme, C., Adoul, J.-P.: GSM enhanced full rate speech codec. In: Proc. ICASSP, Munich, Germany, pp. 771–774 (1997)
Kondoz, A.M.: Digital speech: Coding for low bit rate communications systems. In: Kondoz, A.M. (ed.). John Wiley (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ho Choi, S. (2011). A Study on Speech Coders for Automatic Speech Recognition in Adverse Communication Environments. In: Abd Manaf, A., Zeki, A., Zamani, M., Chuprat, S., El-Qawasmeh, E. (eds) Informatics Engineering and Information Science. ICIEIS 2011. Communications in Computer and Information Science, vol 252. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25453-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-25453-6_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25452-9
Online ISBN: 978-3-642-25453-6
eBook Packages: Computer ScienceComputer Science (R0)