Impact of Prior Channel Information for Speaker Identification

Vaquero, C.; Scheffer, N.; Karajekar, S.

doi:10.1007/978-3-642-01793-3_46

C. Vaquero^18,19,
N. Scheffer¹⁹ &
S. Karajekar¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5558))

Included in the following conference series:

International Conference on Biometrics

2541 Accesses

Abstract

Joint factor analysis (JFA) has been very successful in speaker recognition but its success depends on the choice of development data. In this work, we apply JFA to a very diverse set of recording conditions and conversation modes in NIST 2008 SRE, showing that having channel matched development data will give improvements of about 50% in terms of Equal Error Rate against a Maximum a Posteriori (MAP) system, while not having it will not give significant improvement. To provide robustness to the system, we estimate eigenchannels in two ways. First, we estimate the eigenchannels separately for each condition and stack them. Second, we pool all the relevant development data and obtain a single estimate. Both techniques show good performance, but the former leads to lower performance when working with low-dimension channel subspaces, due to the correlation between those subspaces.

Download to read the full chapter text

Chapter PDF

Dealing with Diverse Data Variances in Factor Analysis Based Methods

On Behaviour of PLDA Models in the Task of Speaker Recognition

An improved i-vector extraction algorithm for speaker verification

Article Open access 27 June 2015

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Inter-Speaker Variability in Speaker Verification. IEEE Trans. Audio, Speech and Language Processing 16(5), 980–988 (2008)
Google Scholar
Vogt, R., Baker, B., Sridharan, S.: Modelling session variability in text-independent speaker verification. In: Ninth European Conference on Speech Communication and Technology, ISCA (2005)
Google Scholar
Brümmer, N.: SUN SDV system description for the NIST SRE 2008 evaluation, Montreal, Canada (2008)
Google Scholar
JHU: Johns Hopkins University, Summer workshop, Robust Speaker ID, Fast scoring team, Baltimore, MD (2008)
Google Scholar
NIST: The NIST year 2005 speaker recognition evaluation plan (April 2004), http://www.nist.gov/speech/tests/spk/2004/SRE-04_evalplan-v1a.pdf
Shriberg, E., Graciarena, M., Bratt, H., Kathol, A., Kajarekar, S., Jameel, H., Richey, C., Goodman, F.: Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification. In: Proceedings of Interspeech, Brisbane, Australia (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Zaragoza, Maria de Luna 1, 50018, Zaragoza, Spain
C. Vaquero
SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493, USA
C. Vaquero, N. Scheffer & S. Karajekar

Authors

C. Vaquero
View author publications
You can also search for this author in PubMed Google Scholar
N. Scheffer
View author publications
You can also search for this author in PubMed Google Scholar
S. Karajekar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Vision Laboratory, Facoltà di Architettura di Alghero, Dipartimento di Architettura e Pianificazione (DAP), Università di Sassari, Palazzo del Pou Salit, Piazza Duomo 6, 07041, Alghero (SS), Italy
Massimo Tistarelli
School of Electronics and Computer Science, University of Southampton, SO17 1BJ, Southampton, UK
Mark S. Nixon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vaquero, C., Scheffer, N., Karajekar, S. (2009). Impact of Prior Channel Information for Speaker Identification. In: Tistarelli, M., Nixon, M.S. (eds) Advances in Biometrics. ICB 2009. Lecture Notes in Computer Science, vol 5558. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01793-3_46

Download citation

DOI: https://doi.org/10.1007/978-3-642-01793-3_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01792-6
Online ISBN: 978-3-642-01793-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Impact of Prior Channel Information for Speaker Identification

Abstract

Chapter PDF

Similar content being viewed by others

Dealing with Diverse Data Variances in Factor Analysis Based Methods

On Behaviour of PLDA Models in the Task of Speaker Recognition

An improved i-vector extraction algorithm for speaker verification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Impact of Prior Channel Information for Speaker Identification

Abstract

Chapter PDF

Similar content being viewed by others

Dealing with Diverse Data Variances in Factor Analysis Based Methods

On Behaviour of PLDA Models in the Task of Speaker Recognition

An improved i-vector extraction algorithm for speaker verification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation