Abstract
Speaker recognition systems frequently use GMM - MAP method for modeling speakers. This method represents a speaker using a Gaussian mixture. However in this mixture not all the Gaussian components are truly representative of the speaker. In order to remove the model redundancy, this work proposes a Gaussian selection method to achieve a new GMM model only with the more representative Gaussian components. Speaker verification experiments applying the proposal show a similar performance to baseline; however the speaker models have a reduction of 80 % regarding the speaker model used for baseline. The application of this Gaussian selection method in real or embedded speaker verification systems could be very useful for reducing computational and memory cost.
Chapter PDF
References
Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Communication 17(1), 91–108 (1995)
Reynolds, D.A., Quatieri, T., Dunn, R.: Speaker verification using adapted gaussian mixture models. Digital Signal Processing 10(1), 19–41 (2000)
Saeidi, R., Sadegh Mohammadi, H.R., Ganchev, T., Rodman, R.D.: Particle Swarm Optimization for Sorted Adapted Gaussian Mixture Models. IEEE Trans. on Audio, Speech, and Language Processing 17(2), 344–353 (2009)
Auckenthaler, R., Mason, J.: Gaussian selection applied to text independent speaker verification. In: Proceedings of Speaker Odyssey: the Speaker Recognition Workshop, Crete, Greece, pp. 83–88 (2001)
Xiang, B., Berger, T.: Efficient text-independent speaker verification with structural Gaussian mixture models and neural network. IEEE Transactions on Speech and Audio Processing, 447–456 (2003)
Kinnunen, T., Karpov, E., Franti, P.: Real-time speaker identification and verification. IEEE Transaction on Audio, Speech and Language Processing 14(1), 277–288 (2006)
Mohammadi, H.R.S., Saeidi, R.: Efficient implementation of GMM based speaker verification using sorted Gaussian mixture model. In: Proc. EUSIPCO 2006, Florence, Italy (2006)
Saeidi, R., Kinnunen, T., Mohammadi, H.R.S., Rodman, R., Fränti, P.: Joint frame and gaussian selection for text independent speaker verification. In: IEEE Trans. ICASSP 2010, pp. 4530–4533 (2010)
Liu, Q., Huang, W., Xu, D., Cai, H., Dai, B.: A fast implementation of factor analysis for speaker verification. In: Interspeech 2010, pp. 1077–1080 (2010)
Anguera, X., Bonastre, J.F.: A Novel Speaker Binary Key Derived from Anchor Models. In: Proceedings of Interspeech (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Reyes Díaz, F.J., Calvo de Lara, J.R., Hernández Sierra, G. (2012). Gaussian Selection for Speaker Recognition Using Cumulative Vectors. In: Alvarez, L., Mejail, M., Gomez, L., Jacobo, J. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2012. Lecture Notes in Computer Science, vol 7441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33275-3_89
Download citation
DOI: https://doi.org/10.1007/978-3-642-33275-3_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33274-6
Online ISBN: 978-3-642-33275-3
eBook Packages: Computer ScienceComputer Science (R0)