Advertisement

Probabilistic Prediction for Text-Prompted Speaker Verification Capable of Accepting Spoken Words with the Same Meaning but Different Pronunciations

  • Shota Sakashita
  • Satoshi Takeguchi
  • Kazuya Matsuo
  • Shuichi KurogiEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9950)

Abstract

So far, we have presented a method of probabilistic prediction using GEBI (Gibbs-distribution based Bayesian inference) for flexible text-prompted speaker verification. For more flexible and practical verification, this paper presents a method of verification capable of accepting spoken words with the same meaning but different pronunciations. For example, Japanese language has different pronunciations for a digit, such as /yon/ and /shi/ for 4, /nana/ and /shichi/ for 7, which are usually uttered via unintentional selection, and then it is a practical problem in speech verification of words involving digits, such as ID numbers. With several assumptions, we present a modification of GEBI for dealing with such words. By means of numerical experiments using recorded real speech data, we examine the properties of the present method and show the validity and the effectiveness.

Keywords

Probabilistic prediction Text-prompted speaker verification Gibbs-distribution-based extended Bayesian inference Words with the same meaning but different pronunciations 

References

  1. 1.
    Kurogi, S., Sakashita, S., Takeguchi, S., Ueki, T., Matsuo, K.: Probabilistic prediction in multiclass classification derived for flexible text-prompted speaker verification. In: Arik, S., Huang, T., Lai, W.K., Liu, Q. (eds.) ICONIP 2015. LNCS, vol. 9489, pp. 216–225. Springer, Heidelberg (2015). doi: 10.1007/978-3-319-26532-2_24 CrossRefGoogle Scholar
  2. 2.
    Beigi, H.: Fundamentals of Speaker Recognition. Springer-Verlag New York Inc., New York (2011)CrossRefzbMATHGoogle Scholar
  3. 3.
    Slingo, J., Palmer, T.: Uncertainty in weather and climate prediction. Phil. Trans. R. Soc. A 369, 4751–4767 (2011)CrossRefzbMATHGoogle Scholar
  4. 4.
    Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proceedings of the SCI 2004, vol. V, pp. 24–28 (2004)Google Scholar
  5. 5.
    Kurogi, S., Mineishi, S., Sato, S.: An analysis of speaker recognition using bagging CAN2 and pole distribution of speech signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds.) ICONIP 2010. LNCS, vol. 6443, pp. 363–370. Springer, Heidelberg (2010). doi: 10.1007/978-3-642-17537-4_45 CrossRefGoogle Scholar
  6. 6.
    Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)CrossRefGoogle Scholar
  7. 7.
    Kurogi, S., Ueki, T., Takeguchi, S., Mizobe, Y.: Properties of text-prompted multistep speaker verification using gibbs-distribution-based extended Bayesian inference for rejecting unregistered speakers. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds.) ICONIP 2014, Part II. LNCS, vol. 8835, pp. 35–43. Springer, Heidelberg (2014)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Shota Sakashita
    • 1
  • Satoshi Takeguchi
    • 1
  • Kazuya Matsuo
    • 1
  • Shuichi Kurogi
    • 1
    Email author
  1. 1.Kyushu Institute of TechnologyKitakyushuJapan

Personalised recommendations