Skip to main content

An Embedded Variable Bit-Rate Audio Coder for Ubiquitous Speech Communications

  • Conference paper
Ubiquitous Convergence Technology (ICUCT 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4412))

Included in the following conference series:

  • 430 Accesses

Abstract

In this paper, we propose an embedded variable bit-rate (VBR) audio coder to provide the fittest quality of service (QoS) and better connectivity of service for the ubiquitous speech communications. It has scalable bandwidth for narrowband to wideband speech signal, and embedded 8 32 kbit/s VBR corresponding to the network condition and terminal capacity. For the design of the embedded VBR coder, the narrowband signals are compressed by an existing standard speech coding method for the compatibility with G.729 coder, and then the other signals are compressed hierarchically on the basis of CELP enhancement and transform coding with temporal noise shaping (TNS) method. By the objective and subjective quality tests, it is shown that the proposed embedded VBR audio coder provides a reasonable quality compared with existing audio coders such as G.722 and G.722.2 in terms of mean opinion score (MOS) and perceptual evaluation of speech quality of wideband (PESQ-WB).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kim, D.Y., Lee, M.S., H.W.J.H.K.K.: Scalable speech and audio coding technologies for wireless network. In: Proc. of KICS, vol. 22, Seoul, pp. 1397–1407. KICS (2005)

    Google Scholar 

  2. G.729: Coding of speech at 8kbps using conjugate-structure algebraic-code-excited linear-prediction (cs-celp). In: ITU-T Recommendation, Geneva, ITU, ITU-T (1996)

    Google Scholar 

  3. G.729A: G.729 annex a: Reduced complexity 8 kbit/s cs-acelp speech codec. In: ITU-T Recommendation, Geneva, ITU, ITU-T (1996)

    Google Scholar 

  4. G.729B: G.729 annex b: A silence compression scheme for g.729 optimized for terminals conforming to recommendation v.70. In: ITU-T Recommendation, Geneva, ITU, ITU-T (1996)

    Google Scholar 

  5. G.711: Pulse coded modulation(pcm) of voice frequencies. In: ITU-T Recommendation, Geneva, ITU, ITU-T (1988)

    Google Scholar 

  6. G.722: 7 khz audio coding within 64 kbit/s. In: ITU-T Recommendation, Geneva, ITU, ITU-T (1988)

    Google Scholar 

  7. G.722.2: Wideband coding of speech at around 16kbit/s using adaptive multi-rate wideband (amr-wb). In: ITU-T Recommendation, Geneva, ITU, ITU-T (2002)

    Google Scholar 

  8. Lee, G.H., et al.: A scalable audio coder for high-quality speech and audio services. In: Proc. of the 9th Western Pacific Acoustics Conference, Seoul, pp. 178–185 (2006)

    Google Scholar 

  9. ITU-T: Q10/16 meeting report, Geneva, ITU, ITU-T (2004)

    Google Scholar 

  10. ITU-T: High-level description of etri candidate codec for g.729ev, Geneva, ITU, ITU-T (2005)

    Google Scholar 

  11. P.800: Methods for subjective determination of transmission quality, Geneva, ITU, ITU-T (1996)

    Google Scholar 

  12. P.862.2: Wideband extension to recommendation p.862 for the assessment of wideband telephone networks and speech codecs, Geneva, ITU, ITU-T (2005)

    Google Scholar 

  13. P.191: Software tools for speech and audio coding, Geneva, ITU, ITU-T (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Frank Stajano Hyoung Joong Kim Jong-Suk Chae Seong-Dong Kim

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Kim, D.Y., Park, J.W. (2007). An Embedded Variable Bit-Rate Audio Coder for Ubiquitous Speech Communications. In: Stajano, F., Kim, H.J., Chae, JS., Kim, SD. (eds) Ubiquitous Convergence Technology. ICUCT 2006. Lecture Notes in Computer Science, vol 4412. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71789-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71789-8_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71788-1

  • Online ISBN: 978-3-540-71789-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics