The objective of speech coding technologies is primarily to enable spoken communication between geographically separated people and also, to allow storage of speech signals. The performance of such technologies can be measured by both the perceived quality of the communication experience as well as the amount of resources required. For efficient performance, speech codecs are based on two types of modelling techniques applied in parallel: (1) they model the signal source by a model of speech production and (2) for optimisation of quality, they apply a perceptual model. These models include also entropy coding to remove statistical redundancy.
KeywordsSpeech Signal Source Model Minimum Mean Square Error Audio Signal Speech Sound
- 1.3GPP. TS 26.445, EVS Codec Detailed Algorithmic Description; 3GPP Technical Specification (Release 12) (2014)Google Scholar
- 2.ANSI. S3. 5-1997, Methods for the calculation of the speech intelligibility index (1997)Google Scholar
- 4.Fastl, H., Zwicker, E.: Psychoacoustics: Facts and Models, vol. 22. Springer, Heidelberg (2006)Google Scholar
- 5.ISO/IEC 23003–3:2012. MPEG-D (MPEG audio technologies), Part 3: Unified speech and audio coding (2012)Google Scholar
- 7.Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. Proc. ICASSP 2, 1109–1112 (2005)Google Scholar