Advertisement

A Continuous Unsupervised Adaptation Method For Speaker Verification

  • Alexandre Preti
  • Jean-Franˆois Bonastre
  • Franˆois Capman

Abstract

—This paper deals with unsupervised model adaptation for speaker verification. We proposed a new method for updating speaker models using all test information incoming in the system. This is a continuous adaptation method which relies on the probability of the test trial belonging to the target speaker. Our adaptation scheme is evaluated in the framework of the NIST SRE 2005. This approach reaches a relative improvement for the NIST unsupervised adaptation mode of 15% DCF and 35% EER.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garcia, D. Petrovska, D. A. Reynolds, “A tutorial on text-independent speaker verification,’ EURASIP Journal on Applied Signal Processing, 2004, Vol.4, pp.430-451CrossRefGoogle Scholar
  2. [2]
    P. Kenny, G. Boulianne, P. Oullet, and P. Dumouchel, “Improvements in factor analysis based speaker verification,’ In ICASSP, Toulouse, France, 2006.Google Scholar
  3. [3]
    C. Barras, S. Meignier, J. L. Gauvain, “Unsupervised Online Adaptation for Speaker Verification over the telephone,’ In Odyssey , Toledo, Spain, 2004.Google Scholar
  4. [4]
    L.P. Heck, N. Mirghafori, “Online unsupervised adaptation in speaker verification,’ Proc. International Conference on Spoken Language Processing, Beijing, China, 2000.Google Scholar
  5. [5]
    C. Fredouille, J. Marithoz, C. Jaboulet, J. Hennebert, C. Mokbel, and F. Bimbot, “Behavior of a bayesian adaptation method for incremental enrollment in speaker verification.’ In ICASSP, Istanbul, Turkey, 2000.Google Scholar
  6. [6]
    NIST Speaker Recognition Evaluation campaigns web site, http://www.nist.gov/speech/tests/spk/index.htmGoogle Scholar
  7. [7]
    D. A. van Leeuwen, “Speaker adaptation in the NIST Speaker Recognition Evaluation 2004,’ In Interspeech, Lisbon, Portugal, 2004.Google Scholar
  8. [8]
    E.G. Hansen, R.E. Slyh, T.R. Anderson, “Supervised and Unsupervised Speaker Adaptation in the NIST 2005 Speaker Recognition Evaluation,’ In Odyssey, Puerto Rico, USA, 2006.Google Scholar
  9. [9]
    A. Preti, J-F. Bonstre, “Unsupervised model adaptation for speaker verification,” In ICSLP, Pittsburgh, USA, 2006.Google Scholar
  10. [10]
    C. Auckenthaler, Lloyd-Thomas, “Score Normalization for Text-independent Speaker Verification Systems,’ Digital Signal Processing, vol. 10 No 1-3, 2000.Google Scholar
  11. [11]
    C. Fredouille, J-F. Bonastre, T. Merlin. “Bayesian approach based-decision in speaker verification.” In Odyssey, Crete, Grece, 2001.Google Scholar
  12. [12]
    J.-L. Gauvain and C.H. Lee, “Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains,” IEEE Trans. on Speech and Audio Processing, vol. 2, no. 2, pp. 291–298, Apr. 1994.CrossRefGoogle Scholar
  13. [13]
    LIA_SpkDet system web site, http://www.lia.univ-avignon.fr/heberges/ALIZE/LIA_RALGoogle Scholar
  14. [14]
    ALIZE project web site, http://www.lia.univ-avignon.fr/heberges/ALIZE/Google Scholar
  15. [15]
    J.-F. Bonastre, F. Wils, S. Meignier, “ALIZE, a free toolkit for speaker recognition,” In ICASSP, Philadelphia, USA, 2005.Google Scholar
  16. [16]
    “SPRO: a free speech signal processing toolkit,” Guillaume Gravier,http://www.irisa.fr/metiss/guig/spro/Google Scholar
  17. [17]
    J.-F. Bonastre, N. Scheffer, C. Fredouille, D. Matrouf, “NIST’04 speaker recognition evaluation campaign: new LIA speaker detection plateform based on ALIZE toolkit,” NIST SRE’04 Workshop: speaker detection evaluation campaign, June 2004. Toledo, Spain.Google Scholar
  18. [18]
    D. A. Reynolds, “Channel Robust Speaker Verification via Feature Mapping,” International Conference on Acoustics, Speech, and Signal Processing, IEEE, Hong Kong, 2003, pp. 53-56.Google Scholar
  19. [19]
    A. Martin, G. Doddington, T. Kamm, and M. Ordowski. “The DET curve in assessment of detection task performance,” In EuroSpeech, 1997.Google Scholar
  20. [20]
    D.E. Sturim, D.A. Reynolds, “Speaker Adaptive Cohort Selection for Tnorm in Text-Independent Speaker Verification,” In ICASSP, 2005.Google Scholar

Copyright information

© Springer 2007

Authors and Affiliations

  • Alexandre Preti
    • 1
  • Jean-Franˆois Bonastre
    • 1
  • Franˆois Capman
    • 2
  1. 1.LIA, 339 chemin des MeinajariesFrance
  2. 2.Thales, MMP Laboratory160 Bd ValmyFrance

Personalised recommendations