Abstract
In this work, we analyze several quality measures for speaker verification from the point of view of their utility, i.e., their ability to predict performance in an authentication task. We select several quality measures derived from classic indicators of speech degradation, namely ITU P.563 estimator of subjective quality, signal to noise ratio and kurtosis of linear predictive coefficients. Moreover, we propose a novel quality measure derived from what we have called Universal Background Model Likelihood (UBML), which indicates the degradation of a speech utterance in terms of its divergence with respect to a given universal model. Utility of quality measures is evaluated following the protocols and databases of NIST Speaker Recognition Evaluation (SRE) 2006 and 2008 (telephone-only subset), and ultimately by means of error-vs.-rejection plots as recommended by NIST. Results presented in this study show significant utility for all the quality measures analyzed, and also a moderate decorrelation among them.
Keywords
Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Inter-Speaker Variability in Speaker Verification. IEEE Transactions on Audio, Speech and Language Processing 16(5), 980–988 (2008)
Brümmer, N., Burget, L., Černocký, J., Glembek, O., Grézl, F., Karafiát, M., van Leeuwen, D., Matějka, P., Schwarz, P., Strasheim, A.: Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech and Language Processing 15(7), 2072–2084 (2007)
Przybocki, M.A., Martin, A.F., Le, A.N.: NIST Speaker Recognition Evaluations Utilizing the Mixer Corpora—2004, 2005, 2006. IEEE Transactions on Audio, Speech and Language Processing 15(7), 1951–1959 (2007)
Ramos, D., Gonzalez-Rodriguez, J., Gonzalez-Dominguez, J., Lucena-Molina, J.J.: Addressing database mismatch in forensic speaker recognition with Ahumada III: a public real-casework database in Spanish. In: Proc. Interspeech 2008, vol. 1, pp. 1493–1496 (2008)
Grother, P., Tabassi, E.: Performance of Biometric Quality Measures. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(4), 531–543 (2007)
Malfait, L., Berger, J., Kastner, M.: P.563-The ITU-T Standard for Single-Ended Speech Quality Assessment. IEEE Trans. On audio, speech and language processing 14(6)
Garcia-Romero, D., Fierrez-Aguilar, J., Gonzalez-Rodriguez, J., Ortega-Garcia, J.: Using Quality Measures for Multilevel Speaker Recognition. Computer Speech and Language 20(2,3), 192–209 (2006)
Alonso-Fernandez, F., Fierrez, J., Ortega-Garcia, J., Gonzalez-Rodriguez, J., Fronthaler, H., Kollreider, K., Bigun, J.: A comparative study of fingerprint image-quality estimation methods. IEEE Trans. on Information Forensics and Security 2(4), 734–743 (2007)
Grancharov, V., Kleijn, W.B.: Speech Quality Assessment. Springer Handbook of Speech Processing. Springer, Heidelberg (2008)
Richiardi, J., Drygajlo, A.: Evaluation of speech quality measures for the purpose of speaker verification. In: Proc. of Odyssey 2008, the ISCA Speaker and Language Recognition Workshop, Stellenbosch, South Africa (2008)
Mean opinion score (MOS) terminology, ITU-T Rec. P.800.1 (2003)
Reynolds, D.A.: Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing 10, 19–41 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Harriero, A., Ramos, D., Gonzalez-Rodriguez, J., Fierrez, J. (2009). Analysis of the Utility of Classical and Novel Speech Quality Measures for Speaker Verification. In: Tistarelli, M., Nixon, M.S. (eds) Advances in Biometrics. ICB 2009. Lecture Notes in Computer Science, vol 5558. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01793-3_45
Download citation
DOI: https://doi.org/10.1007/978-3-642-01793-3_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01792-6
Online ISBN: 978-3-642-01793-3
eBook Packages: Computer ScienceComputer Science (R0)