Objectives for Discriminative Training

Li, Qi (Peter)

doi:10.1007/978-3-642-23731-7_12

Qi (Peter) Li²

Part of the book series: Signals and Communication Technology ((SCT))

725 Accesses

Abstract

The first step in discriminative training is to define an objective function. In this chapter, the relations among a class of discriminative training objectives is derived and discovered through our theoretical analysis. The objectives selected for our discussion are the minimum classification error (MCE), maximum mutual information (MMI), minimum error rate (MER), and generalized minimum error rate (GMER). The author’s analysis shows that all these objectives can be related to both minimum error rates and maximum a posteriori probability. In theory, the MCE and GMER objectives are more general and flexible than the MMI and MER objectives, and MCE and GMER are beyond the Bayesian decision theory. The results and the analytical methods used in this chapter can help in judging and evaluating discriminative objectives, and in defining new objectives for different tasks and better performances. We note that although our discussions are based on the applications of speaker recognition, the analysis can be further extended to speech recognition tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bahl, L.R., Brown, P.F., de Souza, P.V., and Mercer, R.L.: Maximum mutual information estimation of hidden Markov model parameters for speech recogni- tion, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Tokyo), pp. 49–52 (1986)
Google Scholar
Chou, W.:Discriminant-function-based minimum recognition error rate pattern-recognition approach to speech recognition, Proceedings of the IEEE, vol. 88, pp. 1201–1222, August 2000
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, Second Edition. Wiley, New York (2001)
Google Scholar
Gopalakrishnan, P.S., Kanevsky, D., Nadas, A., Nahamoo, D.: An inequality for rational functions with applications to some statistical estimation problems. IEEE Trans. on Information Theory 37, 107–113 (1991)
Article MATH Google Scholar
Juang, B.-H., Chou, W., Lee, C.-H.: Minimum classification error rate methods for speech recognition. IEEE Trans. on Speech and Audio Process 5, 257–265 (1997)
Article Google Scholar
Juang, B.-H. Katagiri S.: Discriminative learning for minimum error clas- sification. IEEE Transactions on Signal Processing 40, 3043–3054 (1992)
Article MathSciNet MATH Google Scholar
Katagiri, S., Lee, C.-H., and Juang, B.-H., New discriminative algorithm based on the generalized probabilistic descent method, in Proceedings of IEEE Workshop on Neural Network for Signal Processing (Princeton), pp. 299–309, September 1991
Google Scholar
Korkmazskiy, F. and Juang, B.-H., Discriminative adaptation for speaker verification, in Proceedings of Int. Conf. on Spoken Language Processing (Philadelphia), pp. 28–31 1996
Google Scholar
Li, J., Yuan, M., and Lee, C.H., Soft margin estimation of hidden markov model parameters, in Proc. ICSLP, pp. 2422–2425 (2007)
Google Scholar
Li, Q., Discovering relations among discriminative training objectives, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Montreal), p. 2004, May 2004
Google Scholar
Li, Q. and Juang, B.-H., Fast discriminative training for sequential observations with application to speaker identification, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Hong Kong), April 2003
Google Scholar
Li, Q. and Juang, B.-H., A new algorithm for fast discriminative training, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Orlando, FL), May 2002
Google Scholar
Li, Q. Juang B.-H.: Study of a fast discriminative training algorithm for pattern recognition. IEEE Trans. on Neural Networks 17, 1212–1221 (2006)
Article Google Scholar
Ma, C. and Chang, E., Comparison of discriminative training methods for speaker verification, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. I-192–I-195, 2003
Google Scholar
Nadas, A., Nahamoo, D., and Picheny, M. A., On a model-robust training method for speech recognition, IEEE Transactions on Acoust., Speech, Signal Processing, vol. 36, pp. 1432–1436, Sept. 1988
Google Scholar
Normandin, Y., Cardin, R., and Mori, R. D., High-performance connected digit recognition using maximum mutual information estimation, IEEE Trans. on Speech and Audio Processing, vol. 2, pp. 299–311, April 1994
Google Scholar
Reichl, W. and Ruske, G., Discriminant training for continuous speech recog- nition, in Proceedings of Eurospeech, 1995
Google Scholar
Schluter, R. and Macherey,W., Comparison of discriminative training criteria, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 493–497, 1998
Google Scholar
Siohan, O., Rosenberg, A. E., and Parthasarathy, S., Speaker identification using minimum verification error training, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing (Seattle), pp. 109–112, May 1998
Google Scholar
Siohan, O., Rosenberg, A., and Parthasarathy, S., Speaker identification using minimum classification error training, in Proc. IEEE Int. Conf. on Acoustic, Speech, and Signal Process, pp. 109–112, 1998
Google Scholar
Vapnik, V.N.: The nature of statistical learning theory. Springer, NY (1995)
MATH Google Scholar
Yin, Y. and Li, Q., Soft frame margin estimation of Gaussian mixture models for speaker recognition with sparse training data, in ICASSP 2011 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Li Creative Technologies (LcT), Inc, Vreeland Road 30 A, Suite 130, 07932, Florham Park, NJ, USA
Qi (Peter) Li

Authors

Qi (Peter) Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qi (Peter) Li .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Li, Q.(. (2012). Objectives for Discriminative Training. In: Speaker Authentication. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23731-7_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-23731-7_12
Published: 30 September 2011
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23730-0
Online ISBN: 978-3-642-23731-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics