Abstract
In this paper we propose the use of Support Vector Machine Regression (SVR) for robust speaker verification in two scenarios: i) strong mismatch in speech conditions and ii) forensic environment. The proposed approach seeks robustness to situations where a proper background database is reduced or not present, a situation typical in forensic cases which has been called database mismatch. For the mismatching condition scenario, we use the NIST SRE 2008 core task as a highly variable environment, but with a mostly representative background set coming from past NIST evaluations. For the forensic scenario, we use the Ahumada III database, a public corpus in Spanish coming from real authored forensic cases collected by Spanish Guardia Civil. We show experiments illustrating the robustness of a SVR scheme using a GLDS kernel under strong session variability, even when no session variability is applied, and especially in the forensic scenario, under database mismatch.
Chapter PDF
Similar content being viewed by others
Keywords
References
National Institute of Standards and Technology (NIST), 2008 speaker recognition evaluation plan (2008), http://www.nist.gov/speech/tests/sre/2008/index.html
Reynolds, D.A.: Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing 10, 19–41 (2000)
Campbell, W.M., Quatieri, T.F., Dunn, R.B.: Support Vector Machines for Speaker and language Recognition. Computer Speech and Language 20, 210–229 (2006)
Solomonoff, A., Campbell, W.M., Boardman, I.: Advances in Channel Compensation for SVM Speaker Recognition. In: Proc. Of ICASSP, pp. 629–632 (2005)
Kenny, P., Oullet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Inter-Speaker Variability in Speaker Verification. IEEE Transactions on Audio, Speech and Language Processing 16(5), 980–988 (2008)
Campbell, W.M., Campbell, J.P., Reynolds, D.A., Singer, E., Torres-Carrasquillo, P.A.: Support Vector Machines using GMM Supervectors for Speaker Verification. Signal Processing Letters 13(5), 308–311 (2006)
Brümmer, N., et al.: Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006. IEEE Transactions on Audio, Speech and Language Processing 15(7), 2072–2084 (2007)
Ramos, D., Gonzalez-Rodriguez, J., Gonzalez-Dominguez, J., Lucena-Molina, J.J.: Addressing Database Mismatch in Forensic Speaker Recognition with Ahumada III: a Public Real-Casework Database in Spanish. In: Proc. Of Interspeech, pp. 1493–1496 (2008)
Lopez-Moreno, I., Mateos-Garcia, I., Ramos, D., Gonzalez-Rodriguez, J.: Support Vector Regression for Speaker Verification. In: Proc. Of Interspeech, pp. 306–309 (2007)
Smola, A.J., Schoelkopf, B.: A Tutorial on Support Vector Regression. Tech. Rep. NeuroCOLT2 Technical Report NC2-TR-1998-030, Royal Holloway College (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mateos-Garcia, I., Ramos, D., Lopez-Moreno, I., Gonzalez-Rodriguez, J. (2009). Support Vector Machine Regression for Robust Speaker Verification in Mismatching and Forensic Conditions. In: Tistarelli, M., Nixon, M.S. (eds) Advances in Biometrics. ICB 2009. Lecture Notes in Computer Science, vol 5558. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01793-3_50
Download citation
DOI: https://doi.org/10.1007/978-3-642-01793-3_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01792-6
Online ISBN: 978-3-642-01793-3
eBook Packages: Computer ScienceComputer Science (R0)