Toward Simulated Audition in Open Environments

Port, Robert F.; Anderson, Sven E.; McAuley, J. Devin

doi:10.1007/978-1-4615-1919-5_4

Robert F. Port⁴,
Sven E. Anderson⁴ &
J. Devin McAuley⁴

110 Accesses

Abstract

This paper considers the problem of hearing in the broadest possible terms. What kind of information is available in sound about events in the environment? What kind of cognitive mechanisms could extract this information? To address such global problems, we think it is essential to construct simulations of auditory processing, but to evaluate progress, real-world audition is difficult to deal with. To test the simulations, it is useful to develop artificial auditory environments that are simplified, yet retain certain critical properties of natural auditory environments, such as the property of openness. In this paper, we shall schematize the general properties of auditory environments and some major classes of information about time that are relevant for perception. Our theme is that to understand auditory cognition, we not only need to understand auditory processing mechanisms, we also need a clear idea of the problems that must be solved by an auditory system. If the task can be clearly defined, some hypotheses can be formulated about the global dynamic properties of neural systems that are potentially capable of performing these tasks. These systems may then serve as a starting point for developing models of neural mechanisms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anderson, S., 1992, Self-organization of auditory motion detectors, in: “Proceedings of the Fourteenth Annual Conference of the Cognitive Science Society”, Lawrence Erlbaum Associates., Hillsdale, NJ.
Google Scholar
Anderson, S., 1994, “A Computational Model of Auditory Pattern Recognition”, PhD thesis, Technical Report No. 112, Cognitive Science Program, Indiana University, Bloomington, IN.
Google Scholar
Baddeley, A., 1992, Working memory. Science, 255:556.
Article PubMed CAS Google Scholar
Barlow, H. B., and Levick, W. R., 1965, The mechanism of directionally selective units in a rabbit’s retina. J. Physiol., 173:477.
Google Scholar
Bregman, A. S., 1990. “Auditory Scene Analysis: The Perceptual Organization of Sound”, Bradford Books, MIT Press, Cambridge, MA.
Google Scholar
Carpenter, G., and Grossberg, S., 1987, A massively parallel architecture for a self-organizing neural pattern recognition machine. Computer Vision, Graphics and Image Processing, 37:54.
Article Google Scholar
Crowder, R., and Morton, J., 1969, Precategorical acoustic storage. Percept. Psychophys., 5:365.
Article Google Scholar
Delgutte, B., and Kiang, N. ,1984, Speech coding in the auditory nerve: I. Vowel-like sounds. J. Acoust. Soc. Am., 75:866.
Article PubMed CAS Google Scholar
Drake, C., and Botte, M.-C., 1993, Tempo sensitivity in auditory sequences: Evidence for a multiple-look model. Percept. Psychophys., 54:277.
Article PubMed CAS Google Scholar
Gaver, W. W., 1993, What in the world do we hear? An ecological approach to auditory perception. Ecol. Psychol., 5:1.
Article Google Scholar
Gibson, J. J., 1968, “The Senses Considered as Perceptual Systems”, Harcourt Brace, New York, NY.
Google Scholar
Govindarajan, K. K., Grossberg, S., Wyse, L. L., and Cohen, M. A., 1994, A neural network model of auditory scene analysis and source segregation, Technical Report CAS/CNS-TR-94-039, Center for Adaptive Systems, Boston University, Boston, MA.
Google Scholar
Green, D., 1976, “An Introduction to Hearing”, Lawrence Erlbaum Associates, Hillsdale, NJ.
Google Scholar
Grossberg, S., 1976, Adaptive pattern classification and universal recoding. Biol. Cybern., 23:121.
Article PubMed CAS Google Scholar
Grossberg, S., 1986, The adaptive self-organization of serial order in behavior: Speech language, and motor control, in: “Pattern Recognition by Humans and Machines: Speech Perception”, E. Schwab, and H. Nusbaum, eds., Academic Press, Orlando, FL.
Google Scholar
Grossberg, S., 1987, Competitive learning: From interactive activation to adaptive resonance. Cognitive Sci., 11:23.
Article Google Scholar
Grossberg, S., and Rudd, M. E. (1992). Cortical dynamics of visual motion perception: short-range and long-range apparent motion. Psychol. Rev., 99:78.
Article PubMed CAS Google Scholar
Handel, S., 1989, “Listening: An Introduction to the Perception of Auditory Events”, Bradford Books/MIT Press, Cambridge, MA.
Google Scholar
Handel, S., 1993, The effect of tempo and tone duration on rhythm discrimination. Percept. Psychophys., 54:370.
Article PubMed CAS Google Scholar
Hermansky, H., 1990, Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am., 87:1738.
Article PubMed CAS Google Scholar
Large, E. W., 1994, “Dynamic Representation of Musical Structure”, Ph.D. Thesis, The Ohio State University, Columbus, OH.
Google Scholar
Large, E. W., and Kolen, J. F., 1994, Resonance and the perception of musical meter. Connection Sci., 6:177.
Article Google Scholar
Lesser, V. R., Fennel, R. D., Erman, L. D., and Reddy, D. R., 1975, Organization of the Hearsay-II speech understanding system. International Conference on Acoustics, Speech, and Signal Processing, 23:11.
Article Google Scholar
Levitt, H., 1971, Transformed up-down methods in psychoacoustics. J. Acoust. Soc. Am., 49:467.
Article PubMed Google Scholar
Marshall, J., 1990, Self-organizing neural networks for perception of visual motion. Neural Networks, 3:45.
Article Google Scholar
Marshall, J., 1995, Adaptive perceptual pattern recognition by self-organizing neural networks: Context, uncertainty, multiplicity, and scale. Neural Networks, in press.
Google Scholar
Massaro, D. W., 1972, Perceptual images, processing time, and perceptual units in auditory perception. Psychol. Rev., 79:124.
Article PubMed CAS Google Scholar
McAuley, J. D., 1993, Learning to perceive and produce rhythmic patterns in an artificial neural network., Technical Report 371, Computer Science Department, Indiana University, Bloomington, IN.
Google Scholar
McAuley, J. D., 1994a, Finding metrical structure in time. in: “Proceedings of the 1993 Connectionist Models Summer School”, M.C. Mozer, P. Smolensky, D.S. Touretzky, J.L. Elman, and A.S. Weigend, eds, Lawrence Erlbaum Associates, Hillsdale, NJ.
Google Scholar
McAuley, J. D., 1994b, Time as phase: A dynamic model of time perception. in: “Proceedings of the Sixteenth Annual Meeting of the Cognitive Science Society”, Lawrence Erlbaum Associates, Hillsdale, NJ.
Google Scholar
McAuley, J. D., 1995, “On the Perception of Time as Phase: Toward an Adaptive Oscillator Model of Rhythm”. Ph.D. Thesis, Cognitive Science Technical Report, TR-137, Indiana University, Bloomington, IN.
Google Scholar
Moore, B. C. J., 1989, “An Introduction to Psychology of Hearing”, third edition, Harcourt Brace Jovanovich, New York, NY.
Google Scholar
Patterson, R. and Holdsworth, J., 1990, “An Introduction to Auditory Sensation Processing”. MRC Applied Psychology Unit, Cambridge, UK.
Google Scholar
Port, R., 1986, Invariance in phonetics, in: “Invariance and Variability in Speech Processes”, J. Perkell, and D Klatt., eds, Lawrence Erlbaum Associates, Hillsdale, NJ.
Google Scholar
Port, R., and Cummins, F., 1992, The English voicing contrast as velocity perturbation, in: “Proceedings of the 1992 International Conference on Spoken Language Processing”, J. Ohala, T. Nearey, B Derwing, M. Hodge, and G. Wiebe, eds, University of Alberta, Edmunton.
Google Scholar
Port, R., Cummins, F., and Gasser, M., 1995, A dynamical approach to rhythm in language: Toward a phonology of time. in: “Proceedings of the Chicago Linguistic Society”, Chicago Linguistic Society, Chicago, IL (in press).
Google Scholar
Port, R., Cummins, F., and McAuley, J. D., 1995, Naive time, temporal patterns and human audition, in: “Mind as Motion: Explorations in the Dynamics of Cognition”, R. Port, and T. van Gelder, eds, MIT Press, Cambridge, MA.
Google Scholar
Port, R. and Dalby, J., 1982, C/V ratio as a cue for voicing in English. J. Acoust. Soc. Am., 69:262.
Article Google Scholar
Port, R. F., 1990, Representation and recognition of temporal patterns. Connection Sci., 2:151..
Article Google Scholar
Port, R.F., Dalby, J., and O’Dell, M., 1987, Evidence for mora timing in Japanese. J. Acoust. Soc. Am., 81:1574.
Article PubMed CAS Google Scholar
Povel, D.-J. and Essens, P., 1985, Perception of temporal patterns. Music Perception, 2:411.
Article Google Scholar
Sankoff, D. and Kruskal, J. B., eds, 1983. “Time Warps, String Edits and Macromolecules: The Theory and Practice of Sequence Comparison”, Addison-Wesley, Reading, MA.
Google Scholar
Shamma, S. A., 1985a, Speech processing in the auditory system I. The representation of speech sounds in the responses of the auditory nerve. J. Acoust. Soc. Am., 78:1612.
Article PubMed CAS Google Scholar
Shamma, S. A., 1985b, Speech processing in the auditory system II. Lateral inhibition and the central processing of speech evoked activity in the auditory nerve. J. Acoust. Soc. Am., 78:1622.
Article PubMed CAS Google Scholar
Smythe, E., 1987, The detection of formant transitions in a connectionist network, in: “Proceedings of the First IEEE International Conference on Neural Networks”, San Diego, CA.
Google Scholar
Smythe, E. J., 1988, Temporal computation in connectionist models. Technical Report 251, Indiana University, Computer Science Department, Indiana University, Bloomington, IN.
Google Scholar
Torras, C., 1985, “Temporal-Pattern Learning in Neural Models”, Springer Verlag, Berlin.
Google Scholar
Vercoe, B. L., 1986, C-sound. Technical report, Experimental Music Studio, Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA.
Google Scholar
Watson, C. S. and Nichols, T. S., 1976, Detectability of auditory signals presented without defined observation intervals. J. Acoust. Soci. Am., 59:655.
Article CAS Google Scholar
Whitfield, I. C. and Evans, E. F., 1965, Responses of auditory cortical neurons to stimuli of changing frequency. J. Neurophysiol., 28:655.
PubMed CAS Google Scholar
Wilson, M. A., Bhalla, U. S., Uhley, J. D., and Bower, J. M., 1989, GENESIS: a system for simulating neural networks, in: “Advances in Neural Information Processing Systems I”, D.S. Touretzky, ed, Morgan Kaufmann, San Mateo, CA.
Google Scholar
Yost, W., 1991, Auditory image perception and analysis: the basis for hearing. Hearing Res., 56:244.
Article Google Scholar
Yost, W., 1992, Auditory perception and sound source determination. Psychol. Sci., 1:179.
Google Scholar
Yost, W. A. and Watson, C. S., eds , 1987, “Auditory Processing of Complex Sounds”, Lawrence Erlbaum Associates, Hillsdale, NJ.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Program in Cognitive Science, Indiana University, Bloomington, IN, 47405, USA
Robert F. Port, Sven E. Anderson & J. Devin McAuley

Authors

Robert F. Port
View author publications
You can also search for this author in PubMed Google Scholar
Sven E. Anderson
View author publications
You can also search for this author in PubMed Google Scholar
J. Devin McAuley
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Duke University Medical Center, Durham, North Carolina, USA
Ellen Covey
Office of Naval Research, Arlington, Virginia, USA
Harold L. Hawkins
Indiana University, Bloomington, Indiana, USA
Robert F. Port

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Port, R.F., Anderson, S.E., McAuley, J.D. (1995). Toward Simulated Audition in Open Environments. In: Covey, E., Hawkins, H.L., Port, R.F. (eds) Neural Representation of Temporal Patterns. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1919-5_4

Download citation

DOI: https://doi.org/10.1007/978-1-4615-1919-5_4
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5785-8
Online ISBN: 978-1-4615-1919-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics