E-Model Parameters Estimation for VoIP with Non-ITU Codec Speech Quality Prediction

Triyason, Tuul; Kanthamanon, Prasert

doi:10.1007/978-3-319-40415-8_30

Tuul Triyason⁵ &
Prasert Kanthamanon⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 463))

543 Accesses
1 Citations

Abstract

The aim of this research is to the improve performance of the E-model, which is one of the most successful non-intrusive speech quality prediction models for voice communication over a packet based network. However, the E-model still has limitations. The calculation method of the E-model is restricted to a set of voice codecs from ITU-T. This paper proposes a method to estimate two codec-related parameters that used to calculate the E-model, which are called equipment impairment factor \( I_{e} \) and packet loss robustness factor \( Bpl \) of the non ITU-T codec. The process to estimate both parameters uses a curve fitting method to calculate \( I_{e} \) values from PESQ results under various levels of network packet loss. The set of \( I_{e} \) and \( Bpl \) of eight narrowband codecs (G.711, G.729, GSM, AMR, iLBC, Speex, Silk, and Opus) are presented. Statistical analysis was also performed for model validation. The results show that the E-model with our I _e and Bpl parameters achieved a good accuracy and a good correspondence with PESQ MOS among the eight codecs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

ITU-T Recommendation P.800: Methods for subjective determination of transmission quality (1996)
Google Scholar
Ebem, D.U., et al.: The impact of tone language and non-native language listening on measuring speech quality. J. Audio Eng. Soc. 59(9), 647–655 (2011)
Google Scholar
ITU-T Recommendation P.862: Perceptual evaluation of speech quality (PESQ), an ojective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs (2001)
Google Scholar
ITU-T Recommendation P.800.1: Mean Opinion Score Terminology (2006)
Google Scholar
ITU-T Recommendation G.107: The E-model: a computational model for use in transmission planning (2015)
Google Scholar
ITU-T Recommendation P.833: Methodology for derivation of equipment impairment factors from subjective listening-only tests (2001)
Google Scholar
ITU-T Recommendation P.834: Methodology for the derivation of equipment impairment factors from instrumental models (2015)
Google Scholar
Möller, S., Raake, A., Kitawaki, N., Takahashi, A., Wältermann, M.: Impairment factor framework for wide-band speech codecs. IEEE Trans. Audio Speech Lang. Process. 14, 1969–1976 (2006)
Article Google Scholar
Waltermannn, M., et al.: Extension of the E-model towards super-wideband speech transmission. In: IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp. 4654–4657 (2010)
Google Scholar
Cole, R.G., Rosenbluth, J.H.: Voice over IP performance monitoring, pp. 9–24. ACM SIGCOMM, Comput. Commun. Rev. (2001)
Google Scholar
Sun, L., Emmanuel, I.C.: Voice quality prediction models and their application in VoIP networks. IEEE Trans. Multimedia 8(4), 809–820 (2006)
Article Google Scholar
Raja, A., et al.: A methodology for deriving VoIP equipment impairment factors for a mixed NB/WB context. IEEE Trans. Multimedia 10, 1046–1058 (2008)
Article Google Scholar
Hoene, C., Holger, K., Adam, W.: A perceptual quality model intended for adaptive VoIP applications. Int. J. Commun. Syst. 19, 299–316 (2006)
Article Google Scholar
Goudarzi, M., Lingfen, S., Emmanuel, I.C.: Modelling speech quality for NB and WB SILK codec for VoIP applications. In: 5th International Conference on Next Generation Mobile Applications, Services and Technologies (NGMAST). (2011)
Google Scholar
Assem, H., Merabet, A., Brendan, J., David, M., Jonathan, D., Pat, O.: A generic algorithm for mid-call audio codec switching. In: IFIP/IEEE International Symposium on Integrated Network Management (IM 2013), pp. 1276–1281 (2013)
Google Scholar
Orosz, P., Tamás, S., Zoltán, N., Tamás, L.: A no-reference voice quality estimation method for Opus-based VoIP services. Int. J. Adv. Telecommun. (2014)
Google Scholar
ITU-T Recommendation G.113: Transmission Impairment due to Speech Processing (2007)
Google Scholar
ITU-T Recommendation P.564: Conformance testing for voice over IP transmission quality assessment models (2007)
Google Scholar
PJSIP–Open Source, SIP: Stack and Media Stack for Presence, Im/instant Messaging, and Multimedia Communication (2008)
Google Scholar
IPNetSim-(IPNetwork/WANEmulator—100Mbps, 1Gbps, 4x1Gbps). http://www.gl.com/ipnetsim.html
MicroSIP lightweight VoIP SIP softphone for Windows. http://www.microsip.org
PhonerLite. http://phonerlite.de/index_en.htm
Virtual Audio Cable (VAC). http://www.fox-magic.com/vac.php
Audacity. http://www.audacityteam.org

Download references

Author information

Authors and Affiliations

School of Information Technology, King Mongkut’s University of Technology Thonburi, Pracha-utid Road, Bangmod, Toongkru, Bangkok, Thailand
Tuul Triyason & Prasert Kanthamanon

Authors

Tuul Triyason
View author publications
You can also search for this author in PubMed Google Scholar
Prasert Kanthamanon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tuul Triyason .

Editor information

Editors and Affiliations

Fac of Information Technology, King Mongkut's Uni of Tech North Bangkok, Bangkok, Thailand
Phayung Meesad
Faculty of Information Technology, King Mongkut's Uni of Teck North Banqkok, Bangkok, Thailand
Sirapat Boonkrong
Lehrgebiet Kommunikationsnetze, FernUniversität in Hagen, Hagen, Germany
Herwig Unger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Triyason, T., Kanthamanon, P. (2016). E-Model Parameters Estimation for VoIP with Non-ITU Codec Speech Quality Prediction. In: Meesad, P., Boonkrong, S., Unger, H. (eds) Recent Advances in Information and Communication Technology 2016. Advances in Intelligent Systems and Computing, vol 463. Springer, Cham. https://doi.org/10.1007/978-3-319-40415-8_30

Download citation

DOI: https://doi.org/10.1007/978-3-319-40415-8_30
Published: 12 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40414-1
Online ISBN: 978-3-319-40415-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics