Probabilistic Prediction for Text-Prompted Speaker Verification Capable of Accepting Spoken Words with the Same Meaning but Different Pronunciations
So far, we have presented a method of probabilistic prediction using GEBI (Gibbs-distribution based Bayesian inference) for flexible text-prompted speaker verification. For more flexible and practical verification, this paper presents a method of verification capable of accepting spoken words with the same meaning but different pronunciations. For example, Japanese language has different pronunciations for a digit, such as /yon/ and /shi/ for 4, /nana/ and /shichi/ for 7, which are usually uttered via unintentional selection, and then it is a practical problem in speech verification of words involving digits, such as ID numbers. With several assumptions, we present a modification of GEBI for dealing with such words. By means of numerical experiments using recorded real speech data, we examine the properties of the present method and show the validity and the effectiveness.
KeywordsProbabilistic prediction Text-prompted speaker verification Gibbs-distribution-based extended Bayesian inference Words with the same meaning but different pronunciations
- 1.Kurogi, S., Sakashita, S., Takeguchi, S., Ueki, T., Matsuo, K.: Probabilistic prediction in multiclass classification derived for flexible text-prompted speaker verification. In: Arik, S., Huang, T., Lai, W.K., Liu, Q. (eds.) ICONIP 2015. LNCS, vol. 9489, pp. 216–225. Springer, Heidelberg (2015). doi: 10.1007/978-3-319-26532-2_24 CrossRefGoogle Scholar
- 4.Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proceedings of the SCI 2004, vol. V, pp. 24–28 (2004)Google Scholar
- 5.Kurogi, S., Mineishi, S., Sato, S.: An analysis of speaker recognition using bagging CAN2 and pole distribution of speech signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds.) ICONIP 2010. LNCS, vol. 6443, pp. 363–370. Springer, Heidelberg (2010). doi: 10.1007/978-3-642-17537-4_45 CrossRefGoogle Scholar
- 7.Kurogi, S., Ueki, T., Takeguchi, S., Mizobe, Y.: Properties of text-prompted multistep speaker verification using gibbs-distribution-based extended Bayesian inference for rejecting unregistered speakers. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds.) ICONIP 2014, Part II. LNCS, vol. 8835, pp. 35–43. Springer, Heidelberg (2014)Google Scholar