Advertisement

Speaker Recognition Using a Binary Representation and Specificities Models

  • Gabriel Hernández-Sierra
  • Jean-François Bonastre
  • José Ramón Calvo de Lara
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7441)

Abstract

State of the Art speaker recognition methods are mainly based on GMM/UBM based supervector paradigm. Recently, a simple representation of speech based on local binary decision taken on each acoustic frame have been proposed, allowing to represent a speech excerpt as a binary matrix. This article is based on a similar approach. A new temporal block representation of the binary transformed data as well as three simple algorithms to obtain an efficient similarity measure are proposed. The experimental results show a better robustness of the proposed approach and a similar or better overall performance over classical approaches.

Keywords

speaker recognition binary values accumulative vector 

References

  1. 1.
    Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker Verification Using Adapted Gaussian Mixture Models. In: Digital Signal Processing, pp. 19–41 (2000)Google Scholar
  2. 2.
    Anguera, X., Bonastre, J.: A novel speaker binary key derived from anchor models. In: Proc. Interspeech, pp. 2118–2121 (2010)Google Scholar
  3. 3.
    Moreno, A.-H., Koler, H., et al.: SpeechDat Across Latin America. Project SALA. In: Proc. of the First International Conference on Language Resources and Evaluation, Granada, Spain, vol. I, pp. 367–370 (1998)Google Scholar
  4. 4.
    Ortega, J., et al.: AHUMADA: A large speech corpus in Spanish for speaker characterization and identification. Speech Communication, 255–264 (2000)Google Scholar
  5. 5.
    Bonastre, J.-F., Wils, F., Meignier, S.: ALIZE, a free toolkit for speaker recognition. In: Proc. ICASSP, pp. 737–740 (2005)Google Scholar
  6. 6.
    Bonastre, J.-F., Anguera, X.: H. Sierra, G., et al.: Speaker modeling using local binary decisions. In: Proc. Interspeech, pp. 13–16 (2011)Google Scholar
  7. 7.
    Bonastre, J.-F., Bousquet, P.M., Matrouf, D., et al.: Discriminant binary data representation for speaker recognition. In: Proc. ICASSP, pp. 5284–5287 (2011)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Gabriel Hernández-Sierra
    • 1
    • 2
  • Jean-François Bonastre
    • 2
  • José Ramón Calvo de Lara
    • 1
  1. 1.Advanced Technologies Application CenterHavanaCuba
  2. 2.LIAUniversity of AvignonFrance

Personalised recommendations