On the Extraction of the Valid Speech-Sound by the Merging Algorithm with the Discrete Wavelet Transform

Kim, Jin Ok; Paek, Han Wook; Chung, Chin Hyun; Hwang, Jun; Lee, Woongjae

doi:10.1007/3-540-44860-8_32

Jin Ok Kim⁶,
Han Wook Paek⁷,
Chin Hyun Chung⁷,
Jun Hwang⁸ &
…
Woongjae Lee⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2657))

Included in the following conference series:

International Conference on Computational Science

971 Accesses

Abstract

A valid speech-sound block can be classified to provide important information for speech recognition. The classification of the speech-sound block comes from the MRA(multi-resolution analysis) property of the DWT(discrete wavelet transform), which is used to reduce the computational time for the pre-processing of speech recognition. The merging algorithm is proposed to extract valid speech-sounds in terms of position and frequency range. It needs some numerical methods for an adaptive DWT implementation and performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speech-sounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising SNR (signal-to-noise ratio).

Download to read the full chapter text

Chapter PDF

Speech Enhancement Based on Noise Type and Wavelet Thresholding the Multitaper Spectrum

Robust Automatic Speech Recognition Using Wavelet-Based Adaptive Wavelet Thresholding: A Review

Article 01 February 2024

Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum

Article 05 November 2016

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Goldberg, R., Riek, L.: A Practical Handbook of Speech Coders. CRC Press, Boca Raton, FL (2000)
MATH Google Scholar
Goswami, J.C., Chan, A.K.: Fundamentals of Wavelets: Theory, Algorithms and Applications. John Wiley & Sons, New York (1999)
Google Scholar
Teolis, A.: Computational Signal Processing with Wavelets. Springer Verlag, New York (1998)
MATH Google Scholar
Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms: A Primer. Prentice Hall, New Jersey (1997)
Google Scholar
Marzetta, T.L.: A new interpretation for capon’s maximum likelihood method of frequency-wavenumber spectral estimation. IEEE Trans. Acoustics, Speech, and Signal Processing 31 (1983)
Google Scholar
Deller, J.R., Hansen, J.H.L., Proakis, J.G.: Discrete-Time Processing of Speech Signals. IEEE Press, New York (2000)
Google Scholar
Donoho, D.L.: Denoising by soft-thresholding. IEEE Trans. Information Theory 41 (1995)
Google Scholar
Abbate, A., Decusatis, C.M., Das, P.K.: Wavelets and Subband: Fundamentals and Applications. Birkhauser, Stuttgart, Germany (2001)
Google Scholar
Ogden, R.T.: Essential Wavelets for Statistical Applications and Data Analysis. Springer Verlag, New York (1996)
Google Scholar
Parsons, T.W.: Voice and Speech Processing. McGraw-Hill, New York (1986)
Google Scholar
Furui, S.: Digital Speech Processing, Synthesis and Recognition. 2nd edn. Marcel Dekker, New York (2001)
Google Scholar
Jurasfky, D., Martin, J.H., Linden, K.V., Jurafsky, D.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, New Jersey (2000)
Google Scholar
Morgan, N., Gold, B.: Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons, New York (1999)
Google Scholar
Rabiner, L., Juang, B.H., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey (1993)
Google Scholar
Huang, X., Acero, A., Hon, H.W., Reddy, R.: Spoken Language Processing. Prentice Hall, New Jersey (2001)
Google Scholar
Quatieri, T.F.: Discrete-Time Speech Signal Processing. Prentice Hall, New Jersey (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Communication Engineering, Sungkyunkwan University, 300, Chunchun-dong, Jangan-gu, Suwon, Kyunggi-do, 440-746, KOREA
Jin Ok Kim
Department of Information and Control Engineering, Kwangwoon University, 447-1, Wolgye-dong, Nowon-gu, Seoul, 139-701, KOREA
Han Wook Paek & Chin Hyun Chung
Division of Information and Communication Engineering, Seoul Women’s University, 126, Kongnung2-dong, Nowon-gu, Seoul, 139-774, KOREA
Jun Hwang & Woongjae Lee

Authors

Jin Ok Kim
View author publications
You can also search for this author in PubMed Google Scholar
Han Wook Paek
View author publications
You can also search for this author in PubMed Google Scholar
Chin Hyun Chung
View author publications
You can also search for this author in PubMed Google Scholar
Jun Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Woongjae Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Informatics Institute, Section of Computational Science, University of Amsterdam, Kruislaan 403, 1098 SJ, Amsterdam, The Netherlands
Peter M. A. Sloot
School of Computer Science and Software Engineering, Monash University, Wellington Road, Clayton, VIC, 3800, Australia
David Abramson
Institute for High-Performance Computing and Information Systems, Fontanka emb. 6, St. Petersburg, 191187, Russia
Alexander V. Bogdanov & Yuriy E. Gorbachev &
Computer Science Dept., University of Tennessee and Oak Ridge National Laboratory, 1122 Volunteer Blvd., Knoxville, TN, 37996-3450, USA
Jack J. Dongarra
School of Information Technologies, The University of Sydney, CISCO Systems Madsen Building F09, Sydney, NSW, 2006, Australia
Albert Y. Zomaya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, J.O., Paek, H.W., Chung, C.H., Hwang, J., Lee, W. (2003). On the Extraction of the Valid Speech-Sound by the Merging Algorithm with the Discrete Wavelet Transform. In: Sloot, P.M.A., Abramson, D., Bogdanov, A.V., Dongarra, J.J., Zomaya, A.Y., Gorbachev, Y.E. (eds) Computational Science — ICCS 2003. ICCS 2003. Lecture Notes in Computer Science, vol 2657. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44860-8_32

Download citation

DOI: https://doi.org/10.1007/3-540-44860-8_32
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40194-0
Online ISBN: 978-3-540-44860-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

On the Extraction of the Valid Speech-Sound by the Merging Algorithm with the Discrete Wavelet Transform

Abstract

Chapter PDF

Similar content being viewed by others

Speech Enhancement Based on Noise Type and Wavelet Thresholding the Multitaper Spectrum

Robust Automatic Speech Recognition Using Wavelet-Based Adaptive Wavelet Thresholding: A Review

Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

On the Extraction of the Valid Speech-Sound by the Merging Algorithm with the Discrete Wavelet Transform

Abstract

Chapter PDF

Similar content being viewed by others

Speech Enhancement Based on Noise Type and Wavelet Thresholding the Multitaper Spectrum

Robust Automatic Speech Recognition Using Wavelet-Based Adaptive Wavelet Thresholding: A Review

Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation