Gaussian Selection for Speaker Recognition Using Cumulative Vectors

Reyes Díaz, Flavio J.; Calvo de Lara, José Ramón; Hernández Sierra, Gabriel

doi:10.1007/978-3-642-33275-3_89

Gaussian Selection for Speaker Recognition Using Cumulative Vectors

Flavio J. Reyes Díaz¹⁹,
José Ramón Calvo de Lara¹⁹ &
Gabriel Hernández Sierra¹⁹

Conference paper

4380 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7441))

Abstract

Speaker recognition systems frequently use GMM - MAP method for modeling speakers. This method represents a speaker using a Gaussian mixture. However in this mixture not all the Gaussian components are truly representative of the speaker. In order to remove the model redundancy, this work proposes a Gaussian selection method to achieve a new GMM model only with the more representative Gaussian components. Speaker verification experiments applying the proposal show a similar performance to baseline; however the speaker models have a reduction of 80 % regarding the speaker model used for baseline. The application of this Gaussian selection method in real or embedded speaker verification systems could be very useful for reducing computational and memory cost.

Download to read the full chapter text

Chapter PDF

References

Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Communication 17(1), 91–108 (1995)
Article Google Scholar
Reynolds, D.A., Quatieri, T., Dunn, R.: Speaker verification using adapted gaussian mixture models. Digital Signal Processing 10(1), 19–41 (2000)
Article Google Scholar
Saeidi, R., Sadegh Mohammadi, H.R., Ganchev, T., Rodman, R.D.: Particle Swarm Optimization for Sorted Adapted Gaussian Mixture Models. IEEE Trans. on Audio, Speech, and Language Processing 17(2), 344–353 (2009)
Article Google Scholar
Auckenthaler, R., Mason, J.: Gaussian selection applied to text independent speaker verification. In: Proceedings of Speaker Odyssey: the Speaker Recognition Workshop, Crete, Greece, pp. 83–88 (2001)
Google Scholar
Xiang, B., Berger, T.: Efficient text-independent speaker verification with structural Gaussian mixture models and neural network. IEEE Transactions on Speech and Audio Processing, 447–456 (2003)
Google Scholar
Kinnunen, T., Karpov, E., Franti, P.: Real-time speaker identification and verification. IEEE Transaction on Audio, Speech and Language Processing 14(1), 277–288 (2006)
Article Google Scholar
Mohammadi, H.R.S., Saeidi, R.: Efficient implementation of GMM based speaker verification using sorted Gaussian mixture model. In: Proc. EUSIPCO 2006, Florence, Italy (2006)
Google Scholar
Saeidi, R., Kinnunen, T., Mohammadi, H.R.S., Rodman, R., Fränti, P.: Joint frame and gaussian selection for text independent speaker verification. In: IEEE Trans. ICASSP 2010, pp. 4530–4533 (2010)
Google Scholar
Liu, Q., Huang, W., Xu, D., Cai, H., Dai, B.: A fast implementation of factor analysis for speaker verification. In: Interspeech 2010, pp. 1077–1080 (2010)
Google Scholar
Anguera, X., Bonastre, J.F.: A Novel Speaker Binary Key Derived from Anchor Models. In: Proceedings of Interspeech (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Advanced Technologies Application Center, Cuba
Flavio J. Reyes Díaz, José Ramón Calvo de Lara & Gabriel Hernández Sierra

Authors

Flavio J. Reyes Díaz
View author publications
You can also search for this author in PubMed Google Scholar
José Ramón Calvo de Lara
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Hernández Sierra
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Informatica y Sistemas, Universidad de Las Palmas de Gran Canaria, Campus de Tafira, 35017, Las Palmas de Gran Canaria, Spain
Luis Alvarez
Universidad de Buenos Aires, Argentina
Marta Mejail & Julio Jacobo &
Universidad de Las Palmas de Gran Canaria, Spain
Luis Gomez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Reyes Díaz, F.J., Calvo de Lara, J.R., Hernández Sierra, G. (2012). Gaussian Selection for Speaker Recognition Using Cumulative Vectors. In: Alvarez, L., Mejail, M., Gomez, L., Jacobo, J. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2012. Lecture Notes in Computer Science, vol 7441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33275-3_89

Download citation

DOI: https://doi.org/10.1007/978-3-642-33275-3_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33274-6
Online ISBN: 978-3-642-33275-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)