Enhanced Speaker Verification Using GMM-Supervector Based Modified Adaptive GMM Training

Trinh, Tan Dat; Park, Min Kyung; Kim, Jin Young; Lee, Kyong Rok; Cho, Keeseong

doi:10.1007/978-3-662-47669-7_17

Tan Dat Trinh³,
Min Kyung Park³,
Jin Young Kim³,
Kyong Rok Lee⁴ &
…
Keeseong Cho⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 310))

1295 Accesses
1 Citations

Abstract

In this paper, an enhanced speaker verification is proposed by exploring a novel modified adaptive Gaussian mixture model (GMM) training. Based weight factor of observation called the observation reliability; we propose to apply a modified Expectation maximization (EM) algorithm, combined with a modified Maximum a posteriori (MAP) estimation to train the modified adaptive GMM model. Using this proposed model, we generate GMM-supervectors which are combined with SVM for verification task. We evaluate performance of speaker verification system based the proposed approaches on utterances from Korean movie database (“You came from the stars”). Experimental results demonstrate that our proposed approaches can outperform the standard GMM-UBM and GMM-supervector approaches in noise conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint Factor Analysis versus Eigenchannels in Speaker Recognition. IEEE Trans. Audio, Speech, Lang. Process. 15(4), 1435–1447 (2007)
Article Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker Verification using Adapted Gaussian Mixture Models. Digital Signal Processing (10), 19–41 (2000)
Google Scholar
Campbell, W.M., Sturim, D.E., Reynolds, D.A., Solomonoff, A.: SVM based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation. In: Proc. IEEE ICASSP, vol. 1, pp. 97–100 (2006)
Google Scholar
Dehak, N., Kenny, P., Dehak, R., Glembek, O., Dumouchel, P., Burget, L., Hubeika, V., Castaldo, F.: Support Vector Machines and Joint Factor Analysis for Speaker Verification. In: Proc. IEEE ICASSP, pp. 4237–4240 (2009)
Google Scholar
Kim, J.Y., Min, S.H., Na, S.Y., Choi, H.S., Choi, S.H.: Modified GMM Training for Inexact Observation and Its Application to Speaker Identification. Speech Sciences 14, 163–175 (2007)
Google Scholar
May, T., Par, S.V.D., Kohlrausch, A.: Noise-Robust Speaker Recognition Combining Missing Data Techniques and Universal Background Modeling. IEEE Trans. Audio, Speech, Lang. Process. 20(1), 108–121 (2012)
Article Google Scholar
Brookes, M.: Voicebox: Speech Processing Toolbox for Matlab (2007), http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html
Bui, N.N., Kim, J.Y., Trinh, T.D.: A Non-Linear GMM KL and GUMI Kernel for SVM Using GMM-UBM Supervector in Home Acoustic Event Classification. IEICE Trans. Fundamentals E97-A(8), 1791–1794 (2014)
Article Google Scholar
Senturk, A., Gurgen, F.S.: Feature Selection by Independent Component Analysis for Robust Speaker Verification. International Journal of Computer Science and Network Security 6(3B), 229–239 (2006)
Google Scholar
Hermansky, H., Morgan, N.: RASTA Processing of Speech. IEEE Trans. on Speech and Audio Proc. 2(4), 578–589 (1994)
Article Google Scholar
Pelecanos, J., Sridharan, S.: Feature Warping for Robust Speaker Verification. In: Proc. Speaker Odyssey, Crete, Greece, pp. 213–218 (2001)
Google Scholar
Hsu, C.W., Chang, C.C., Lin, C.J.: A Practical Guide to Support Vector Classification (2010), http://www.csie.ntu.edu.tw/~cjlin/libsvm

Download references

Author information

Authors and Affiliations

Chonnam National University, Gwangju, Rep. of Korea
Tan Dat Trinh, Min Kyung Park & Jin Young Kim
Nambu University, Gwangju, Rep. of Korea
Kyong Rok Lee
ETRI, Daejeon, Rep. of Korea
Keeseong Cho

Authors

Tan Dat Trinh
View author publications
You can also search for this author in PubMed Google Scholar
Min Kyung Park
View author publications
You can also search for this author in PubMed Google Scholar
Jin Young Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kyong Rok Lee
View author publications
You can also search for this author in PubMed Google Scholar
Keeseong Cho
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Korea Industry Security Forum, Kyoung-gi, Korea, Korea, Republic of (South Korea)
Kuinam J. Kim
Computer Engineering Department, King Mongkut’s University of Technology Thonburi, Thung Khru, Bangkok, Thailand
Naruemon Wattanapongsakorn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Trinh, T.D., Park, M.K., Kim, J.Y., Lee, K.R., Cho, K. (2015). Enhanced Speaker Verification Using GMM-Supervector Based Modified Adaptive GMM Training. In: Kim, K., Wattanapongsakorn, N. (eds) Mobile and Wireless Technology 2015. Lecture Notes in Electrical Engineering, vol 310. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47669-7_17

Download citation

DOI: https://doi.org/10.1007/978-3-662-47669-7_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-47668-0
Online ISBN: 978-3-662-47669-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics