U-NORM Likelihood Normalization in PIN-Based Speaker Verification Systems
This paper present a new likelihood normalization technique, entitled U-NORM, for speaker recognition systems based on short utterances. A comparison between this new approach and the widely used Z-NORM is reported and evaluated. Phonetic dependency between the speaker model and the test speech utterances is determined as the main impediment for a good performance of Z-NORM technique. A set of experiments are developed on a specifically acquired PIN-oriented real-users database showing the higher performance of the new technique for PIN based security applications. U-NORM provides a common likelihood scale for all system users allowing speaker independent thresholds that simplify the enrollment process and add robustness to PIN based security applications.
Unable to display preview. Download preview PDF.
- Douglas A. Reynolds et al., “Speaker Verification using Adapted Gaussian Mixture Models, Digital Signal Processing”, vol. 10, pp. 19–41 (2000).Google Scholar
- J.-B. Pierrot et al., “A Comparison of a Priori Threshold setting Procedures for Sperker Verification in the CAVE Project”, ICASSP’98.Google Scholar
- D. García-Romero et al., “ATVS-UPM Results and Presentation at NIST’2002 Speaker Recognition Evaluation”, Vienna, VA, 2002.Google Scholar