Abstract
The concern in this study is the approach to evaluating the performance of the open-set speaker identification process. In essence, such a process involves first identifying the speaker model in the database that best matches the given test utterance, and then determining if the test utterance has actually been produced by the speaker associated with the best-matched model. Whilst, conventionally, the performance of each of these two sub-processes is evaluated independently, it is argued that the use of a measure of performance for the complete process can provide a more useful basis for comparing the effectiveness of different systems. Based on this argument, an approach to assessing the performance of open-set speaker identification is considered in this paper, which is in principle similar to the method used for computing the diarisation error rate. The paper details the above approach for assessing the performance of open-set speaker identification and presents an analysis of its characteristics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pillay, S., Ariyaeeinia, A., Sivakumaran, P., Pawlewski, M.: Open-Set Speaker Identification under Mismatch Conditions. In: Proc. 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), pp. 2347–2350 (2009)
Ariyaeeinia, A., Fortuna, J., Sivakumaran, P., Malegaonkar, A.: Verification Effectiveness in Open-Set Speaker Identification. IEE Proceedings Vision, Image and Signal Processing 153(5), 618–624 (2006)
Fortuna, J., et al.: Relative effectiveness of score normalisation methods in open-set speaker identification. In: Proc. the Speaker and Language Recognition Workshop (Odyssey), pp. 369–376 (2004)
Singer, E., Reynolds, D.: Analysis of multi-target detection for speaker and language recognition. In: Proc. the Speaker and Language Recognition Workshop (Odyssey), pp. 301–308 (2004)
Anguera Miró, X.: PhD Thesis, Speech Processing Group, Department of Signal Theory and Communications, Universitat Politècnica de Catalunya (2006), http://www.xavieranguera.com/phdthesis/node108.html
Reynolds, D., et al.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10(1-3), 19–41 (2000)
Fortuna, J., et al.: Open set speaker identification using adapted Gaussian mixture models. In: Proc. Interspeech, pp. 1997–2000 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Malegaonkar, A., Ariyaeeinia, A. (2011). Performance Evaluation in Open-Set Speaker Identification. In: Vielhauer, C., Dittmann, J., Drygajlo, A., Juul, N.C., Fairhurst, M.C. (eds) Biometrics and ID Management. BioID 2011. Lecture Notes in Computer Science, vol 6583. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19530-3_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-19530-3_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19529-7
Online ISBN: 978-3-642-19530-3
eBook Packages: Computer ScienceComputer Science (R0)