Abstract
The first step in discriminative training is to define an objective function. In this chapter, the relations among a class of discriminative training objectives is derived and discovered through our theoretical analysis. The objectives selected for our discussion are the minimum classification error (MCE), maximum mutual information (MMI), minimum error rate (MER), and generalized minimum error rate (GMER). The author’s analysis shows that all these objectives can be related to both minimum error rates and maximum a posteriori probability. In theory, the MCE and GMER objectives are more general and flexible than the MMI and MER objectives, and MCE and GMER are beyond the Bayesian decision theory. The results and the analytical methods used in this chapter can help in judging and evaluating discriminative objectives, and in defining new objectives for different tasks and better performances. We note that although our discussions are based on the applications of speaker recognition, the analysis can be further extended to speech recognition tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bahl, L.R., Brown, P.F., de Souza, P.V., and Mercer, R.L.: Maximum mutual information estimation of hidden Markov model parameters for speech recogni- tion, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Tokyo), pp. 49–52 (1986)
Chou, W.:Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition, Proceedings of the IEEE, vol. 88, pp. 1201–1222, August 2000
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, Second Edition. Wiley, New York (2001)
Gopalakrishnan, P.S., Kanevsky, D., Nadas, A., Nahamoo, D.: An inequality for rational functions with applications to some statistical estimation problems. IEEE Trans. on Information Theory 37, 107–113 (1991)
Juang, B.-H., Chou, W., Lee, C.-H.: Minimum classification error rate methods for speech recognition. IEEE Trans. on Speech and Audio Process 5, 257–265 (1997)
Juang, B.-H. Katagiri S.: Discriminative learning for minimum error clas- sification. IEEE Transactions on Signal Processing 40, 3043–3054 (1992)
Katagiri, S., Lee, C.-H., and Juang, B.-H., New discriminative algorithm based on the generalized probabilistic descent method, in Proceedings of IEEE Workshop on Neural Network for Signal Processing (Princeton), pp. 299–309, September 1991
Korkmazskiy, F. and Juang, B.-H., Discriminative adaptation for speaker verification, in Proceedings of Int. Conf. on Spoken Language Processing (Philadelphia), pp. 28–31 1996
Li, J., Yuan, M., and Lee, C.H., Soft margin estimation of hidden markov model parameters, in Proc. ICSLP, pp. 2422–2425 (2007)
Li, Q., Discovering relations among discriminative training objectives, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Montreal), p. 2004, May 2004
Li, Q. and Juang, B.-H., Fast discriminative training for sequential observations with application to speaker identification, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Hong Kong), April 2003
Li, Q. and Juang, B.-H., A new algorithm for fast discriminative training, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Orlando, FL), May 2002
Li, Q. Juang B.-H.: Study of a fast discriminative training algorithm for pattern recognition. IEEE Trans. on Neural Networks 17, 1212–1221 (2006)
Ma, C. and Chang, E., Comparison of discriminative training methods for speaker verification, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. I-192–I-195, 2003
Nadas, A., Nahamoo, D., and Picheny, M. A., On a model-robust training method for speech recognition, IEEE Transactions on Acoust., Speech, Signal Processing, vol. 36, pp. 1432–1436, Sept. 1988
Normandin, Y., Cardin, R., and Mori, R. D., High-performance connected digit recognition using maximum mutual information estimation, IEEE Trans. on Speech and Audio Processing, vol. 2, pp. 299–311, April 1994
Reichl, W. and Ruske, G., Discriminant training for continuous speech recog- nition, in Proceedings of Eurospeech, 1995
Schluter, R. and Macherey,W., Comparison of discriminative training criteria, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 493–497, 1998
Siohan, O., Rosenberg, A. E., and Parthasarathy, S., Speaker identification using minimum verification error training, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Seattle), pp. 109–112, May 1998
Siohan, O., Rosenberg, A., and Parthasarathy, S., Speaker identification using minimum classification error training, in Proc. IEEE Int. Conf. on Acoustic, Speech, and Signal Process, pp. 109–112, 1998
Vapnik, V.N.: The nature of statistical learning theory. Springer, NY (1995)
Yin, Y. and Li, Q., Soft frame margin estimation of Gaussian mixture models for speaker recognition with sparse training data, in ICASSP 2011 (2011)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Li, Q.(. (2012). Objectives for Discriminative Training. In: Speaker Authentication. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23731-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-23731-7_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23730-0
Online ISBN: 978-3-642-23731-7
eBook Packages: EngineeringEngineering (R0)