Skip to main content

Dynamic Neural Network Model of Speech Perception

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 924))

Abstract

Research results in neurobiology showed that the spatial organization of the somatosensory cortex, with linear or planar topology, seems to be the underlying support for the internal representation of the environment. This paper examines the feasibility of constructing self-organizing feature maps (SOFMs) suitable to model speech perception. The objective was to construct a class of dynamic SOFMs that can extract the time–amplitude and time–frequency features of the phonemes that appear in the formation of words. Two approaches are presented. One is based on constructing time-based embedding maps. The second method involved the construction of a dynamic SOFM having the Gabor transform as a transfer function. The time–frequency features of the speech sounds are revealed in the second approach. The results may be useful in applications of speech recognition.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Vogels, T.P., Rajan, K., Abbott, L.F.: Neural networks dynamics. Annu. Rev. Neurosci. 28(7), 357–376 (2005)

    Article  Google Scholar 

  2. beim Graben, P., Gerth, S., Vasishth, S.: Towards dynamical system models of language-related brain potentials. Cognitive Neurodynamics, 2(3): 229–255 (2008)

    Article  Google Scholar 

  3. West, W.C., Holcomb, P.J.: Event-related potentials during discourse-level semantic integration of complex pictures. Cognitive Brain Res. 13(3), 363–375 (2002)

    Article  Google Scholar 

  4. Takens, F.: Detecting strange attractors in turbulence. Lecture Notes in Mathematics 898, pp. 366–381, Springer, Berlin (1981)

    Google Scholar 

  5. Handel, S.: Listening. MIT Press, Cambridge, Massachusetts (1989)

    Google Scholar 

  6. Monro, G., Pressing, J.: Sound visualization using embedding: the art and science of auditory autocorrelation. Comput. Music J. 22(2), 20–34 (1998)

    Article  Google Scholar 

  7. Van Hulle, M.M.: Self-organizing maps, in handbook of natural computing, pp. 585–622. Springer, Berlin, Heidelberg (2012)

    Book  Google Scholar 

  8. Somervuo, P.: Speech dimensionality analysis on hypercubical self-organizing maps. Neural Proc. Lett. 17(2), 125–136 (2003)

    Article  MATH  Google Scholar 

  9. Gupta, M.M, Homma, N., Jin, L.: Static and Dynamic Neural Networks: From Fundamentals to Advanced Theory. Wiley Inc., NY, (2003)

    Book  Google Scholar 

  10. Homma, T., Atlas, L., Marks, R.J.: An artificial neural network for spatio-temporal bipolar patterns: application to phoneme classification. NIPS, pp. 31–40. American Institute of Physics (1987)

    Google Scholar 

  11. Walker, J.S.: A primer on wavelets and their scientific applications, 2nd edn. Chapman and Hall/CRC (2008)

    Google Scholar 

  12. Velik, R.: Discrete fourier transform computation using neural networks. In: International Conference on Computational Intelligence and Security, CIS 2008, 13–17 Dec 2008, Suzhou, China, Vol. 1, pp. 120–123 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marius Crisan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Crisan, M. (2019). Dynamic Neural Network Model of Speech Perception. In: Bhatia, S., Tiwari, S., Mishra, K., Trivedi, M. (eds) Advances in Computer Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 924. Springer, Singapore. https://doi.org/10.1007/978-981-13-6861-5_32

Download citation

Publish with us

Policies and ethics