Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem

Savchenko, Lyudmila V.; Savchenko, Andrey V.

doi:10.1007/978-3-642-38847-7_23

Lyudmila V. Savchenko²¹ &
Andrey V. Savchenko²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

International Conference on Nonlinear Speech Processing

1044 Accesses
4 Citations

Abstract

The definition of a phoneme as a fuzzy set of minimal speech units from the model database is proposed. On the basis of this definition and the Kullback-Leibler minimum information discrimination principle the novel phoneme recognition algorithm has been developed as an enhancement of the phonetic decoding method. The experimental results in the problems of isolated vowels recognition and word recognition in Russian are presented. It is shown that the proposed method is characterized by the increase of recognition accuracy and reliability in comparison with the phonetic decoding method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 72.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Benesty, J., Sondh, M., Huang, Y. (eds.): Springer Handbook of Speech Recognition. Springer (2008)
Google Scholar
Ramirez, J., Segura, J.C., Benitez, C., de la Torre, A., Rubio, A.J.: A new Kullback-Leibler VAD for speech recognition in noise. IEEE Signal Processing Letters 11(2), 266–269 (2004)
Article Google Scholar
Gruhn, R., Raab, M., Brueckner, R.: US Patent №8301445, Speech Recognition Based on a Multilingual Acoustic Model. Nuance Communications, Inc., Assignee (2012)
Google Scholar
Kullback, S.: Information Theory and Statistics. Dover Pub. (1997)
Google Scholar
Savchenko, V.V.: The Method of Words Phonetic Decoding in Automatic Speech Recognition Problem Using the Minimum Information Discrimination Principle, Izvestia vuzov Rossii. Radioelectronika 5, 31–41 (2009) (in Russian)
Google Scholar
Qiao, Y., Shimomura, N., Minematsu, N.: Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3989–3992 (2008)
Google Scholar
Rasipuram, R., Magimai-Doss, M.: Improving Articulatory Feature and Phoneme Recognition Using Multitask Learning. In: Honkela, T. (ed.) ICANN 2011, Part I. LNCS, vol. 6791, pp. 299–306. Springer, Heidelberg (2011)
Google Scholar
Zadeh, L.A.: Fuzzy Sets. Information Control 8, 338–353 (1965)
Article MathSciNet MATH Google Scholar
Marple -Jr., S.L.: Digital Spectral Analysis: With Applications. Prentice-Hall Series in Signal Processing (1989)
Google Scholar
Hill, J.E.: The Minimum of n Independent Normal Distributions, http://www.untruth.org/~josh/math/normal-min.pdf
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press (1981)
Google Scholar
Koutroumbas, K., Theodoridis, S.: Pattern Recognition, 4th edn. Elsevier Inc. (2008)
Google Scholar
Ronzhin, A.L., Yusupov, R.M., Li, I.V., Leontieva, A.B.: Survey of Russian Speech Recognition Systems. In: SPECOM 2006, pp. 54–60 (2006)
Google Scholar
Reddy, D.R.: Speech recognition by Machine: A Review. Proceedings of the IEEE 64(4), 501–531 (1976)
Article Google Scholar
Jensen, R., Cornelis, C.: Fuzzy-rough nearest neighbour classification and prediction. Theoretical Computer Science 412(42), 5871–5884 (2011)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Nizhniy Novgorod State Linguistic University, Russia
Lyudmila V. Savchenko
National Research University Higher School of Economics, Nizhniy Novgorod, Russia
Andrey V. Savchenko

Authors

Lyudmila V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar
Andrey V. Savchenko
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

TCTS Lab, University of Mons, 31, Bouldevard Bolez, 7000, Mons, Belgium
Thomas Drugman
TCTS Lab, University of Mons, 31, Boulevard Dolez, 7000, Mons, Belgium
Thierry Dutoit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Savchenko, L.V., Savchenko, A.V. (2013). Fuzzy Phonetic Decoding Method in a Phoneme Recognition Problem. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-38847-7_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics