Abstract
The aim of this research is to the improve performance of the E-model, which is one of the most successful non-intrusive speech quality prediction models for voice communication over a packet based network. However, the E-model still has limitations. The calculation method of the E-model is restricted to a set of voice codecs from ITU-T. This paper proposes a method to estimate two codec-related parameters that used to calculate the E-model, which are called equipment impairment factor \( I_{e} \) and packet loss robustness factor \( Bpl \) of the non ITU-T codec. The process to estimate both parameters uses a curve fitting method to calculate \( I_{e} \) values from PESQ results under various levels of network packet loss. The set of \( I_{e} \) and \( Bpl \) of eight narrowband codecs (G.711, G.729, GSM, AMR, iLBC, Speex, Silk, and Opus) are presented. Statistical analysis was also performed for model validation. The results show that the E-model with our I e and Bpl parameters achieved a good accuracy and a good correspondence with PESQ MOS among the eight codecs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
ITU-T Recommendation P.800: Methods for subjective determination of transmission quality (1996)
Ebem, D.U., et al.: The impact of tone language and non-native language listening on measuring speech quality. J. Audio Eng. Soc. 59(9), 647–655 (2011)
ITU-T Recommendation P.862: Perceptual evaluation of speech quality (PESQ), an ojective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs (2001)
ITU-T Recommendation P.800.1: Mean Opinion Score Terminology (2006)
ITU-T Recommendation G.107: The E-model: a computational model for use in transmission planning (2015)
ITU-T Recommendation P.833: Methodology for derivation of equipment impairment factors from subjective listening-only tests (2001)
ITU-T Recommendation P.834: Methodology for the derivation of equipment impairment factors from instrumental models (2015)
Möller, S., Raake, A., Kitawaki, N., Takahashi, A., Wältermann, M.: Impairment factor framework for wide-band speech codecs. IEEE Trans. Audio Speech Lang. Process. 14, 1969–1976 (2006)
Waltermannn, M., et al.: Extension of the E-model towards super-wideband speech transmission. In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 4654–4657 (2010)
Cole, R.G., Rosenbluth, J.H.: Voice over IP performance monitoring, pp. 9–24. ACM SIGCOMM, Comput. Commun. Rev. (2001)
Sun, L., Emmanuel, I.C.: Voice quality prediction models and their application in VoIP networks. IEEE Trans. Multimedia 8(4), 809–820 (2006)
Raja, A., et al.: A methodology for deriving VoIP equipment impairment factors for a mixed NB/WB context. IEEE Trans. Multimedia 10, 1046–1058 (2008)
Hoene, C., Holger, K., Adam, W.: A perceptual quality model intended for adaptive VoIP applications. Int. J. Commun. Syst. 19, 299–316 (2006)
Goudarzi, M., Lingfen, S., Emmanuel, I.C.: Modelling speech quality for NB and WB SILK codec for VoIP applications. In: 5th International Conference on Next Generation Mobile Applications, Services and Technologies (NGMAST). (2011)
Assem, H., Merabet, A., Brendan, J., David, M., Jonathan, D., Pat, O.: A generic algorithm for mid-call audio codec switching. In: IFIP/IEEE International Symposium on Integrated Network Management (IM 2013), pp. 1276–1281 (2013)
Orosz, P., Tamás, S., Zoltán, N., Tamás, L.: A no-reference voice quality estimation method for Opus-based VoIP services. Int. J. Adv. Telecommun. (2014)
ITU-T Recommendation G.113: Transmission Impairment due to Speech Processing (2007)
ITU-T Recommendation P.564: Conformance testing for voice over IP transmission quality assessment models (2007)
PJSIP–Open Source, SIP: Stack and Media Stack for Presence, Im/instant Messaging, and Multimedia Communication (2008)
IPNetSim-(IPNetwork/WANEmulator—100Mbps, 1Gbps, 4x1Gbps). http://www.gl.com/ipnetsim.html
MicroSIP lightweight VoIP SIP softphone for Windows. http://www.microsip.org
PhonerLite. http://phonerlite.de/index_en.htm
Virtual Audio Cable (VAC). http://www.fox-magic.com/vac.php
Audacity. http://www.audacityteam.org
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Triyason, T., Kanthamanon, P. (2016). E-Model Parameters Estimation for VoIP with Non-ITU Codec Speech Quality Prediction. In: Meesad, P., Boonkrong, S., Unger, H. (eds) Recent Advances in Information and Communication Technology 2016. Advances in Intelligent Systems and Computing, vol 463. Springer, Cham. https://doi.org/10.1007/978-3-319-40415-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-40415-8_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40414-1
Online ISBN: 978-3-319-40415-8
eBook Packages: EngineeringEngineering (R0)