Advertisement

A Smart Error Protection Scheme Based on Estimation of Perceived Speech Quality for Portable Digital Speech Streaming Systems

  • Jin Ah Kang
  • Hong Kook Kim
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 151)

Abstract

In this paper, a smart error protection (SEP) scheme is proposed to improve speech quality of a portable digital speech streaming (PDSS) system via a lossy transmission channel. To this end, the proposed SEP scheme estimates the perceived speech quality (PSQ) for received speech data, and then transmits redundant speech data (RSD) in order to assist speech decoder to reconstruct lost speech signals for high packet loss rates. According to the estimated PSQ, the proposed SEP scheme controls the RSD transmission, and then optimizes a bitrate of speech coding to encode the current speech data (CSD) against the amount of RSD without increasing transmission bandwidth. The effectiveness of the proposed SEP scheme is finally demonstrated using adaptive multirate-narrowband (AMR-NB) and ITU-T Recommendation P.563 as a scalable speech codec and a PSQ estimator, respectively. It is shown from experiments that a PDSS system employing the proposed SEP scheme significantly improves speech quality under packet loss conditions.

Keywords

Portable digital speech streaming systems packet loss error protection perceived speech quality redundant speech transmission 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Wu, C.-F., Lee, C.-L., Chang, W.-W.: Perceptual-based playout mechanisms for multi-stream voice over IP networks. In: Proceedings of Interspeech, Antwerp, Belgium, pp. 1673–1676 (September 2007)Google Scholar
  2. 2.
    Zhang, Q., Wang, G., Xiong, Z., Zhou, J., Zhu, W.: Error robust scalable audio streaming over wireless IP networks. IEEE Transactions on Multimedia 6(6), 897–909 (2004)CrossRefGoogle Scholar
  3. 3.
    Bolot, J.-C., Fosse-Parisis, S., Towsley, D.: Adaptive FEC-based error control for Internet telephony. In: Proceedings of IEEE International Conference on Computer Communications (INFOCOM), New York, NY, pp. 1453–1460 (March 1999)Google Scholar
  4. 4.
    Jiang, W., Schulzrinne, H.: Comparison and optimization of packet loss repair methods on VoIP perceived quality under bursty loss. In: Proceedings of 12th International Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV), Miami, FL, pp. 73–81 (May 2002)Google Scholar
  5. 5.
    Yung, C., Fu, H., Tsui, C., Cheng, R.S., George, D.: Unequal error protection for wireless transmission of MPEG audio. In: Proceedings of IEEE International Symposium on Circuits and Systems (ISCAS), Orlando, FL, pp. 342–345 (May 1999)Google Scholar
  6. 6.
    Hagenauer, J., Stockhammer, T.: Channel coding and transmission aspects for wireless multimedia. Proceedings of the IEEE 87, 1764–1777 (1999)CrossRefGoogle Scholar
  7. 7.
    Ito, A., Konno, K., Makino, S.: Packet loss concealment for MDCT-based audio codec using correlation-based side information. International Journal of Innovative Computing, Information and Control 6, 3(B), 1347–1361 (2010)Google Scholar
  8. 8.
    ETSI 3GPP TS 26.101: Adaptive Multi-Rate (AMR) Speech Codec Frame Structure (January 2010)Google Scholar
  9. 9.
    ITU-T Recommendation P.563: Single-Ended Method for Objective Audio Quality Assessment in Narrow-Band Telephony Applications (May 2004)Google Scholar
  10. 10.
    IETF RFC 3267: Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs (June 2002)Google Scholar
  11. 11.
    IETF RFC 1889: RTP: A Transport Protocol for Real-Time Applications (January 1996)Google Scholar
  12. 12.
    NTT-AT: Multi-Lingual Speech Database for Telephonometry (1994)Google Scholar
  13. 13.
    ITU-T Recommendation G.191: Software Tools for Speech and Audio Coding Standardization (November 1996)Google Scholar
  14. 14.
    ITU-T Recommendation P.862: Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs (February 2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Jin Ah Kang
    • 1
  • Hong Kook Kim
    • 1
  1. 1.School of Information and CommunicationsGwangju Institute of Science and Technology (GIST)GwangjuKorea

Personalised recommendations