Skip to main content

Introduction

  • Chapter
  • First Online:
Book cover Speech Coding

Part of the book series: Signals and Communication Technology ((SCT))

  • 1060 Accesses

Abstract

The objective of speech coding technologies is primarily to enable spoken communication between geographically separated people and also, to allow storage of speech signals. The performance of such technologies can be measured by both the perceived quality of the communication experience as well as the amount of resources required. For efficient performance, speech codecs are based on two types of modelling techniques applied in parallel: (1) they model the signal source by a model of speech production and (2) for optimisation of quality, they apply a perceptual model. These models include also entropy coding to remove statistical redundancy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Most authors do not define phonations so specifically, but let the term phonation refer to all physiological processes. Here it is, however, useful to use a similar definition for all three terms; phoneme, phonation and phone.

References

  1. 3GPP. TS 26.445, EVS Codec Detailed Algorithmic Description; 3GPP Technical Specification (Release 12) (2014)

    Google Scholar 

  2. ANSI. S3. 5-1997, Methods for the calculation of the speech intelligibility index (1997)

    Google Scholar 

  3. Bosi, M., Goldberg, R.E.: Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, Dordrecht (2003)

    Book  Google Scholar 

  4. Fastl, H., Zwicker, E.: Psychoacoustics: Facts and Models, vol. 22. Springer, Heidelberg (2006)

    Google Scholar 

  5. ISO/IEC 23003–3:2012. MPEG-D (MPEG audio technologies), Part 3: Unified speech and audio coding (2012)

    Google Scholar 

  6. Kates, J.M., Arehart, K.H.: Coherence and the speech intelligibility index. J. Acoust. Soc. Am. 117(4), 2224–2237 (2005)

    Article  Google Scholar 

  7. Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. Proc. ICASSP 2, 1109–1112 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tom Bäckström .

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Bäckström, T. (2017). Introduction. In: Speech Coding. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-50204-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50204-5_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50202-1

  • Online ISBN: 978-3-319-50204-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics