Skip to main content

A Neural Network Phoneme Classification Based on Wavelet Features

  • Conference paper
Developments in Soft Computing

Part of the book series: Advances in Soft Computing ((AINSC,volume 9))

  • 137 Accesses

Summary

This paper presents the use of discrete wavelet transform for feature extraction of phoneme. Instead of using the conventional wavelet coefficients, energy per sample is calculated in different frequency bands and used as features. Training and test samples of the phonemes were obtained from the TIMIT database from the dialect region DR1 and DR2. Features extracted were updated every 8ms to account for the non-stationary property of the speech signal. For the classification of the phonemes two different classifiers were used based on Linear Discriminant Analysis (LDA) and Multi-Layer Perceptron (MLP). The results obtained show high speaker independent recognition rate by both the classifiers. The recognition rates obtained by using MLP classifier were found to be about 3–10% higher than the LDA for different number offeatures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Beng T. Tan, Minyue Fu, Andrew Spray and Phillip Dermody, “The use of wavelet transform for phoneme recognition”, Proceeding of 4th Int. Conf. of Spoken Language Processing Philadelphia, USA Oct. 3–6 1996, Vol. 4, pp. 2431–2434.

    Google Scholar 

  2. Stéphane Mallat, “A wavelet tour of signal processing”, Academic Press, 1998.

    MATH  Google Scholar 

  3. Sungwook Chang, Y. Kwon and Sung-il Yang, “Speech feature extracted from adaptive wavelet for speech recognition”, Electronic Letters, Vol 134, No. 23, 12th Nov. 1998, pp. 2211–2213.

    Google Scholar 

  4. TIMIT Acoustic-Phonetic Continuous Speech Corpus, National Institute of Standards and Technology, Speech Disc 1–1.1, Oct. 1990, NTIS Order No. PB91–505065.

    Google Scholar 

  5. Olivier Rioul and Martin Vetterli, “Wavelet and signal processing”, IEEE Signal Processing Mag., Oct. 1991, pp. 14–38.

    Google Scholar 

  6. C. J. Long and S. Datta, “Wavelet based feature extraction for phoneme recognition”, Proc. of 4th Int. Conf. of Spoken Language Processing Philadelphia, USA Oct. 3–6 1996, Vol. 1, pp. 264–267.

    Google Scholar 

  7. C. J. Long and S. Datta, “Discriminant wavelet basis construction for speech recognition”, Proc. of 5th Int. Conf. of Spoken Language Processing Sydney, Australia 30th Nov-4th Dec. 1998, Vol. 3, pp. 1047–1049.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Farooq, O., Datta, S. (2001). A Neural Network Phoneme Classification Based on Wavelet Features. In: John, R., Birkenhead, R. (eds) Developments in Soft Computing. Advances in Soft Computing, vol 9. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1829-1_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-7908-1829-1_9

  • Publisher Name: Physica, Heidelberg

  • Print ISBN: 978-3-7908-1361-6

  • Online ISBN: 978-3-7908-1829-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics