Skip to main content

Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem

  • Conference paper
Advances in Nonlinear Speech Processing (NOLISP 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

Abstract

The definition of a phoneme as a fuzzy set of minimal speech units from the model database is proposed. On the basis of this definition and the Kullback-Leibler minimum information discrimination principle the novel phoneme recognition algorithm has been developed as an enhancement of the phonetic decoding method. The experimental results in the problems of isolated vowels recognition and word recognition in Russian are presented. It is shown that the proposed method is characterized by the increase of recognition accuracy and reliability in comparison with the phonetic decoding method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benesty, J., Sondh, M., Huang, Y. (eds.): Springer Handbook of Speech Recognition. Springer (2008)

    Google Scholar 

  2. Ramirez, J., Segura, J.C., Benitez, C., de la Torre, A., Rubio, A.J.: A new Kullback-Leibler VAD for speech recognition in noise. IEEE Signal Processing Letters 11(2), 266–269 (2004)

    Article  Google Scholar 

  3. Gruhn, R., Raab, M., Brueckner, R.: US Patent â„–8301445, Speech Recognition Based on a Multilingual Acoustic Model. Nuance Communications, Inc., Assignee (2012)

    Google Scholar 

  4. Kullback, S.: Information Theory and Statistics. Dover Pub. (1997)

    Google Scholar 

  5. Savchenko, V.V.: The Method of Words Phonetic Decoding in Automatic Speech Recognition Problem Using the Minimum Information Discrimination Principle, Izvestia vuzov Rossii. Radioelectronika 5, 31–41 (2009) (in Russian)

    Google Scholar 

  6. Qiao, Y., Shimomura, N., Minematsu, N.: Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3989–3992 (2008)

    Google Scholar 

  7. Rasipuram, R., Magimai-Doss, M.: Improving Articulatory Feature and Phoneme Recognition Using Multitask Learning. In: Honkela, T. (ed.) ICANN 2011, Part I. LNCS, vol. 6791, pp. 299–306. Springer, Heidelberg (2011)

    Google Scholar 

  8. Zadeh, L.A.: Fuzzy Sets. Information Control 8, 338–353 (1965)

    Article  MathSciNet  MATH  Google Scholar 

  9. Marple -Jr., S.L.: Digital Spectral Analysis: With Applications. Prentice-Hall Series in Signal Processing (1989)

    Google Scholar 

  10. Hill, J.E.: The Minimum of n Independent Normal Distributions, http://www.untruth.org/~josh/math/normal-min.pdf

  11. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press (1981)

    Google Scholar 

  12. Koutroumbas, K., Theodoridis, S.: Pattern Recognition, 4th edn. Elsevier Inc. (2008)

    Google Scholar 

  13. Ronzhin, A.L., Yusupov, R.M., Li, I.V., Leontieva, A.B.: Survey of Russian Speech Recognition Systems. In: SPECOM 2006, pp. 54–60 (2006)

    Google Scholar 

  14. Reddy, D.R.: Speech recognition by Machine: A Review. Proceedings of the IEEE 64(4), 501–531 (1976)

    Article  Google Scholar 

  15. Jensen, R., Cornelis, C.: Fuzzy-rough nearest neighbour classification and prediction. Theoretical Computer Science 412(42), 5871–5884 (2011)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Savchenko, L.V., Savchenko, A.V. (2013). Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38847-7_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38846-0

  • Online ISBN: 978-3-642-38847-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics