Distributed Speech Recognition (DSR) systems rely on efficient transmission of speech information from distributed clients to a centralized server. Wireless or network communication channels within DSR systems are typically noisy and bursty. Thus, DSR systems must utilize efficient Error Recovery (ER) schemes during transmission of speech information. Some ER strategies, referred to as forward error control (FEC), aim to create redundancy in the source coded bitstream to overcome the effect of channel errors, while others are designed to create spread or delay in the feature stream in order to overcome the effect of bursty channel errors. Furthermore, ER strategies may be designed as a combination of the previously described techniques. This chapter presents an array of error recovery techniques for remote speech recognition applications.
This chapter is organized as follows. First, channel characterization and modeling are discussed. Next, media-specific FEC is presented for packet erasure applications, followed by a discussion on media-independent FEC techniques for bit error applications, including general linear block codes, cyclic codes, and convolutional codes. The application of unequal error protection (UEP) strategies utilizing combinations of the aforementioned FEC methods is also presented. Finally, frame-based interleaving is discussed as an alternative to overcoming the effect of bursty channel erasures. The chapter concludes with examples of modern standards for channel coding strategies for distributed speech recognition (DSR).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bai, H., and Atiquzzaman, M. (2003). Error Modeling Schemes for Fading Channels in Wire-less Communications: A Survey. IEEE Communications Surveys and Tutorials, Fourth Quarter, vol. 5, no. 2, pp. 2-9.
Bernard, A., and Alwan, A. (2001). Joint Channel Decoding- Viterbi Recognition for Wireless Applications. Proceedings of Europspeech, vol. 3, pp. 2703-2706.
Bernard, A., and Alwan, A. (2002a). Low-Bitrate Distributed Speech Recognition for Packet-Based and Wireless Communication. IEEE Transactions on Speech and Audio Processing, vol. 10, no. 8, pp. 570-579.
Bernard, A., Liu, X., Wesel, R., and Alwan, A. (2002b). Speech Transmission Using Rate-Compatible Trellis Codes and Embedded Source Coding. IEEE Transactions on Commu-nications, vol. 50, no. 2, pp. 309-320.
Bernard, A. (2002). Source and Channel Coding for Speech Transmission and Remote Speech Recognition, PhD Thesis, University of California, Los Angeles.
Blahut, R.E. (2004). Algebraic Codes for Data Transmission. Cambridge University Press, Cambridge, UK.
Bolot, J.-C. (2003). End-to-End Packet Delay and Loss Behavior in the Internet. ACM Sig-comm, pp. 289-298.
Boulis, C., Ostendorf, M., Riskin, E. A., and Otterson, S. (2002). Graceful Degradation of Speech Recognition Performance Over Packet-Erasure Networks. IEEE Transactions on Speech and Audio Processing. vol. 10, no. 8, pp. 580-590.
Elliot, E. (1963). Estimates of Error Rates for Codes on Bursty Noise Channels. Bell Systems Technical Journal, vol. 42, no. 9.
ETSI EN 300 909 v7.3.1 (1998). Digital Cellular Telecommunications System(Phase 2+)(GSM); Channel Coding.
ETSI ES 201 108 v1.1.2 (2000). Distributed Speech Recognition; Front-end Feature Extrac-tion Algorithm; Compression Algorithms.
Han, K.J., Srinivasamurthy, N., and Narayanan, S. (2004). Robust Speech Recognition over Packet Networks: An Overview. Proceedings of ICSLP, vol. 3, pp. 1791-1794.
Hirsch, H.G., and Pearce, D. (2000). The AURORA Experimental Framework for the Per-formance Evaluation of Speech Recognition Systems under Noisy Condition. In Proceed-ings of ISCA ITRW ASR 2000.
James, A.B., and Milner, B.P. (2004). An Analysis of Interleavers for Robust Speech Recogni-tion in Burst-Like Packet Loss. In Proceedings of ICASSP, vol. 1, pp. 853-856.
Jiao, C., Schwiebert, L., and Xu, B. (2002). On Modeling the Packet Error Statistics in Bursty Channels. IEEE LCN, pp. 534-538.
Leon-Garcia, A. (2007). Probability and Random Processes for Electrical Engineers. Prentice-Hall.
Peinado, A.M., Gomez, A.M., Sanchez, V., Perez-Cordoba, J.L., and Rubio, A.J. (2005a). Packet Loss Concealment Based on VQ Replicas and MMSE Estimation Applied to Dis-tributed Speech Recognition. Proceedings of ICASSP. vol. 1, pp. 329-330.
Peindo, A.M., Sanchez, V., Perez-Cordoba, J.L., and Rubio, A.J. (2005b). Efficient MMSE-Based Channel Error Mitigation Techniques. Application to Distributed Speech Recogni-tion Over Wireless Channels. IEEE Transactions on Wireless Communications, vol. 4, no. 1, pp.14-19.
Peindo, A.M., and Segura, J.C. (2006). Speech Recognition Over Digital Channels. Wiley, New York, West Sussex, England.
Sklar, B. (1997). Rayleigh Fading Channels in Mobile Digital Communication Systems Part 1: Characterization. IEEE Communications Magazine, pp. 90-100.
Tan, Z.-H., Dalsgaard, P., and Lindberg, B. (2005). Automatic Speech Recognition over Error-Prone Wireless Networks. Speech Communication, vol. 47, pp. 220-242.
Weerackody, V., Reichl, W., and Potamianos, A. (2002). An Error-Protected Speech Recogni-tion System for Wireless Communications. IEEE Transactions on Wireless Communica-tions, vol. 1, no. 2, pp. 282-291.
Viterbi, A. (1971). Convolutional Codes and Their Performance in Communication Systems. IEEE Transactions on Communications, vol. 19, no. 5, part 1, pp. 751-772.
Wesel, R.D. (2003). Convolutional Codes, from Encyclopedia of Telecommunications. Wiley, New York.
Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., and Woodland, P. (2000). The HTK Book. Microsoft Corporation.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag London Limited
About this chapter
Cite this chapter
Borgström, B.J., Bernard, A., Alwan, A. (2008). Error Recovery: Channel Coding and Packetization. In: Automatic Speech Recognition on Mobile Devices and over Communication Networks. Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-84800-143-5_8
Download citation
DOI: https://doi.org/10.1007/978-1-84800-143-5_8
Publisher Name: Springer, London
Print ISBN: 978-1-84800-142-8
Online ISBN: 978-1-84800-143-5
eBook Packages: Computer ScienceComputer Science (R0)