An Image-Based System for Spoken-Letter Recognition

Saeed, Khalid; Kozłowski, Marcin

doi:10.1007/978-3-540-45179-2_61

Khalid Saeed⁶ &
Marcin Kozłowski⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2756))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

1408 Accesses
5 Citations

Abstract

A new trial on speech recognition from graphical point of view is introduced. Isolated spoken-letters and color-names words are considered. After recording, the speech signal is processed as an image by Power Spectrum Estimation. For feature extraction, classification and hence recognition, the algorithm of minimal eigenvalues of Toeplitz matrices together with other methods of speech processing and recognition are used. A number of examples on applications and comparisons are presented in the work. The efficiency of the method is very high in the case of the six Polish vowels and English color- names, and the results are encouraging to extend the algorithm to cover more word classes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Burr, D.J.: Experiments on Neural Net Recognition of Spoken and Written Text. IEEE Transactions on Acoustic, Speech, and Signal Processing 36 (July 1988)
Google Scholar
MacDonald, J.L., Zucchini, W., Zucchi, W.: Hidden Markov and Other Models for Discrete-Valued Time Series. CRC Press, Boca Raton (1997)
MATH Google Scholar
Saeed, K.: Computer Graphics Analysis: A Criterion for Image Feature Extraction and Recognition. MGV - International Journal on Machine Graphics and Vision 10(2), 185–194 (2001); Institute of Computer Science, Polish Academy of Sciences, Warsaw
Google Scholar
Schafer, R.W., Rabiner, L.R.: System for Automatic Formant Analysis of Voiced Speech. J. Acoust. Soc. Amer. 47 (February 1970)
Google Scholar
Grad, L.: Obrazowa reprezentacja sygnału mowy. Biuletyn IAiR WAT, nr 11, Warsaw (2000)
Google Scholar
Basztura, C.: Modele analizy i procedury w komputerowym rozpoznawaniu głosów. Prace naukowe ITiA Politechniki Wrocławskiej, nr 30, Wrocław (1989)
Google Scholar
Marple, L.S.: Digital Spectral Analysis. Prentice Hall, Englewood Cliffs (1987)
Google Scholar
Saeed, K., Kozłowski, M., Kaczanowski, A.: Metoda do rozpoznawania obrazów akustycznych izolowanych liter mowy, Zeszyty Politechniki Białostockiej, Białystok, I-1/2002, pp. 181–207 (2002) (in Polish)
Google Scholar
Tadeusiewicz, R.: Sygnałmowy. WKiŁ, Warsaw (1988) (in Polish)
Google Scholar
Ingle, V.K., Proakis, J.G.: Digital Signal Processing Using MATLAB. Brooks Cole (July 1999)
Google Scholar
Levinson, N.: The Wiener RMS (Root Mean Square) Error Criterion in Filter Design and Prediction. Journal Math. Phys. 25 (1947)
Google Scholar
Durbin, J.: Efficient Estimation of Parameters in Moving Average Models. Biometrics 46(part 1, 2) (1969)
Google Scholar
Saeed, K.: Experimental Algorithm for Testing The Realization of Transfer Functions. In: Proceedings of the Fourteenth IASTED International Conference, Austria (1995)
Google Scholar
Niedzielski, R.: Kryterium do rozpoznawania znaków maszynowych alfabetu łukowego. MSc Thesis, Ins. Informatyki PB, Białystok (1999)
Google Scholar
Saeed, K., Dardzinska, A.: Language Processing: Word Recognition without Segmentation. JASIST - Journal of the American Society for Information Science and Technology 52(14), 1275–1279 (2001)
Article Google Scholar
Lyons, R.G.: Wprowadzenie do cyfrowego przetwarzania sygnałów. WKiŁ, Warsaw (1999) (in Polish)
Google Scholar
Furui, S.: Digital Speech Processing, Synthesis, and Recognition. Marcel Dekker, Inc., New York (2001)
Google Scholar
Saeed, K., Rybnik, M., Tabędzki, M.: More Results and Applications about the Algorithm of Thinning Images to One-Pixel-width. In: Skarbek, W. (ed.) CAIP 2001. LNCS, vol. 2124, pp. 601–609. Springer, Heidelberg (2001)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer Science, Bialystok University of Technology, Wiejska 45a, 15-351, Bialystok, Poland
Khalid Saeed
Department of Informatics, Statistical Office in Bialystok, Krakowska 13, 15-959, Bialystok, Poland
Marcin Kozłowski

Authors

Khalid Saeed
View author publications
You can also search for this author in PubMed Google Scholar
Marcin Kozłowski
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Mathematics and Computing Science, University of Groningen, P.O.Box 800, 9700 AV, Groningen, The Netherlands
Nicolai Petkov
Institute of Mathematics and Computing Science, University of Groningen, Blauwborgje 3, 9747 AC, Groningen, The Netherlands
Michel A. Westenberg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saeed, K., Kozłowski, M. (2003). An Image-Based System for Spoken-Letter Recognition. In: Petkov, N., Westenberg, M.A. (eds) Computer Analysis of Images and Patterns. CAIP 2003. Lecture Notes in Computer Science, vol 2756. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45179-2_61

Download citation

DOI: https://doi.org/10.1007/978-3-540-45179-2_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40730-0
Online ISBN: 978-3-540-45179-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics