Abstract
In this paper, an enhanced speaker verification is proposed by exploring a novel modified adaptive Gaussian mixture model (GMM) training. Based weight factor of observation called the observation reliability; we propose to apply a modified Expectation maximization (EM) algorithm, combined with a modified Maximum a posteriori (MAP) estimation to train the modified adaptive GMM model. Using this proposed model, we generate GMM-supervectors which are combined with SVM for verification task. We evaluate performance of speaker verification system based the proposed approaches on utterances from Korean movie database (“You came from the stars”). Experimental results demonstrate that our proposed approaches can outperform the standard GMM-UBM and GMM-supervector approaches in noise conditions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint Factor Analysis versus Eigenchannels in Speaker Recognition. IEEE Trans. Audio, Speech, Lang. Process. 15(4), 1435–1447 (2007)
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker Verification using Adapted Gaussian Mixture Models. Digital Signal Processing (10), 19–41 (2000)
Campbell, W.M., Sturim, D.E., Reynolds, D.A., Solomonoff, A.: SVM based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation. In: Proc. IEEE ICASSP, vol. 1, pp. 97–100 (2006)
Dehak, N., Kenny, P., Dehak, R., Glembek, O., Dumouchel, P., Burget, L., Hubeika, V., Castaldo, F.: Support Vector Machines and Joint Factor Analysis for Speaker Verification. In: Proc. IEEE ICASSP, pp. 4237–4240 (2009)
Kim, J.Y., Min, S.H., Na, S.Y., Choi, H.S., Choi, S.H.: Modified GMM Training for Inexact Observation and Its Application to Speaker Identification. Speech Sciences 14, 163–175 (2007)
May, T., Par, S.V.D., Kohlrausch, A.: Noise-Robust Speaker Recognition Combining Missing Data Techniques and Universal Background Modeling. IEEE Trans. Audio, Speech, Lang. Process. 20(1), 108–121 (2012)
Brookes, M.: Voicebox: Speech Processing Toolbox for Matlab (2007), http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html
Bui, N.N., Kim, J.Y., Trinh, T.D.: A Non-Linear GMM KL and GUMI Kernel for SVM Using GMM-UBM Supervector in Home Acoustic Event Classification. IEICE Trans. Fundamentals E97-A(8), 1791–1794 (2014)
Senturk, A., Gurgen, F.S.: Feature Selection by Independent Component Analysis for Robust Speaker Verification. International Journal of Computer Science and Network Security 6(3B), 229–239 (2006)
Hermansky, H., Morgan, N.: RASTA Processing of Speech. IEEE Trans. on Speech and Audio Proc. 2(4), 578–589 (1994)
Pelecanos, J., Sridharan, S.: Feature Warping for Robust Speaker Verification. In: Proc. Speaker Odyssey, Crete, Greece, pp. 213–218 (2001)
Hsu, C.W., Chang, C.C., Lin, C.J.: A Practical Guide to Support Vector Classification (2010), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trinh, T.D., Park, M.K., Kim, J.Y., Lee, K.R., Cho, K. (2015). Enhanced Speaker Verification Using GMM-Supervector Based Modified Adaptive GMM Training. In: Kim, K., Wattanapongsakorn, N. (eds) Mobile and Wireless Technology 2015. Lecture Notes in Electrical Engineering, vol 310. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47669-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-662-47669-7_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-47668-0
Online ISBN: 978-3-662-47669-7
eBook Packages: EngineeringEngineering (R0)