Abstract
Joint factor analysis (JFA) has been very successful in speaker recognition but its success depends on the choice of development data. In this work, we apply JFA to a very diverse set of recording conditions and conversation modes in NIST 2008 SRE, showing that having channel matched development data will give improvements of about 50% in terms of Equal Error Rate against a Maximum a Posteriori (MAP) system, while not having it will not give significant improvement. To provide robustness to the system, we estimate eigenchannels in two ways. First, we estimate the eigenchannels separately for each condition and stack them. Second, we pool all the relevant development data and obtain a single estimate. Both techniques show good performance, but the former leads to lower performance when working with low-dimension channel subspaces, due to the correlation between those subspaces.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Inter-Speaker Variability in Speaker Verification. IEEE Trans. Audio, Speech and Language Processing 16(5), 980–988 (2008)
Vogt, R., Baker, B., Sridharan, S.: Modelling session variability in text-independent speaker verification. In: Ninth European Conference on Speech Communication and Technology, ISCA (2005)
Brümmer, N.: SUN SDV system description for the NIST SRE 2008 evaluation, Montreal, Canada (2008)
JHU: Johns Hopkins University, Summer workshop, Robust Speaker ID, Fast scoring team, Baltimore, MD (2008)
NIST: The NIST year 2005 speaker recognition evaluation plan (April 2004), http://www.nist.gov/speech/tests/spk/2004/SRE-04_evalplan-v1a.pdf
Shriberg, E., Graciarena, M., Bratt, H., Kathol, A., Kajarekar, S., Jameel, H., Richey, C., Goodman, F.: Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification. In: Proceedings of Interspeech, Brisbane, Australia (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vaquero, C., Scheffer, N., Karajekar, S. (2009). Impact of Prior Channel Information for Speaker Identification. In: Tistarelli, M., Nixon, M.S. (eds) Advances in Biometrics. ICB 2009. Lecture Notes in Computer Science, vol 5558. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01793-3_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-01793-3_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01792-6
Online ISBN: 978-3-642-01793-3
eBook Packages: Computer ScienceComputer Science (R0)