Abstract
This paper considers the problem of hearing in the broadest possible terms. What kind of information is available in sound about events in the environment? What kind of cognitive mechanisms could extract this information? To address such global problems, we think it is essential to construct simulations of auditory processing, but to evaluate progress, real-world audition is difficult to deal with. To test the simulations, it is useful to develop artificial auditory environments that are simplified, yet retain certain critical properties of natural auditory environments, such as the property of openness. In this paper, we shall schematize the general properties of auditory environments and some major classes of information about time that are relevant for perception. Our theme is that to understand auditory cognition, we not only need to understand auditory processing mechanisms, we also need a clear idea of the problems that must be solved by an auditory system. If the task can be clearly defined, some hypotheses can be formulated about the global dynamic properties of neural systems that are potentially capable of performing these tasks. These systems may then serve as a starting point for developing models of neural mechanisms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anderson, S., 1992, Self-organization of auditory motion detectors, in: “Proceedings of the Fourteenth Annual Conference of the Cognitive Science Society”, Lawrence Erlbaum Associates., Hillsdale, NJ.
Anderson, S., 1994, “A Computational Model of Auditory Pattern Recognition”, PhD thesis, Technical Report No. 112, Cognitive Science Program, Indiana University, Bloomington, IN.
Baddeley, A., 1992, Working memory. Science, 255:556.
Barlow, H. B., and Levick, W. R., 1965, The mechanism of directionally selective units in a rabbit’s retina. J. Physiol., 173:477.
Bregman, A. S., 1990. “Auditory Scene Analysis: The Perceptual Organization of Sound”, Bradford Books, MIT Press, Cambridge, MA.
Carpenter, G., and Grossberg, S., 1987, A massively parallel architecture for a self-organizing neural pattern recognition machine. Computer Vision, Graphics and Image Processing, 37:54.
Crowder, R., and Morton, J., 1969, Precategorical acoustic storage. Percept. Psychophys., 5:365.
Delgutte, B., and Kiang, N. ,1984, Speech coding in the auditory nerve: I. Vowel-like sounds. J. Acoust. Soc. Am., 75:866.
Drake, C., and Botte, M.-C., 1993, Tempo sensitivity in auditory sequences: Evidence for a multiple-look model. Percept. Psychophys., 54:277.
Gaver, W. W., 1993, What in the world do we hear? An ecological approach to auditory perception. Ecol. Psychol., 5:1.
Gibson, J. J., 1968, “The Senses Considered as Perceptual Systems”, Harcourt Brace, New York, NY.
Govindarajan, K. K., Grossberg, S., Wyse, L. L., and Cohen, M. A., 1994, A neural network model of auditory scene analysis and source segregation, Technical Report CAS/CNS-TR-94-039, Center for Adaptive Systems, Boston University, Boston, MA.
Green, D., 1976, “An Introduction to Hearing”, Lawrence Erlbaum Associates, Hillsdale, NJ.
Grossberg, S., 1976, Adaptive pattern classification and universal recoding. Biol. Cybern., 23:121.
Grossberg, S., 1986, The adaptive self-organization of serial order in behavior: Speech language, and motor control, in: “Pattern Recognition by Humans and Machines: Speech Perception”, E. Schwab, and H. Nusbaum, eds., Academic Press, Orlando, FL.
Grossberg, S., 1987, Competitive learning: From interactive activation to adaptive resonance. Cognitive Sci., 11:23.
Grossberg, S., and Rudd, M. E. (1992). Cortical dynamics of visual motion perception: short-range and long-range apparent motion. Psychol. Rev., 99:78.
Handel, S., 1989, “Listening: An Introduction to the Perception of Auditory Events”, Bradford Books/MIT Press, Cambridge, MA.
Handel, S., 1993, The effect of tempo and tone duration on rhythm discrimination. Percept. Psychophys., 54:370.
Hermansky, H., 1990, Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am., 87:1738.
Large, E. W., 1994, “Dynamic Representation of Musical Structure”, Ph.D. Thesis, The Ohio State University, Columbus, OH.
Large, E. W., and Kolen, J. F., 1994, Resonance and the perception of musical meter. Connection Sci., 6:177.
Lesser, V. R., Fennel, R. D., Erman, L. D., and Reddy, D. R., 1975, Organization of the Hearsay-II speech understanding system. International Conference on Acoustics, Speech, and Signal Processing, 23:11.
Levitt, H., 1971, Transformed up-down methods in psychoacoustics. J. Acoust. Soc. Am., 49:467.
Marshall, J., 1990, Self-organizing neural networks for perception of visual motion. Neural Networks, 3:45.
Marshall, J., 1995, Adaptive perceptual pattern recognition by self-organizing neural networks: Context, uncertainty, multiplicity, and scale. Neural Networks, in press.
Massaro, D. W., 1972, Perceptual images, processing time, and perceptual units in auditory perception. Psychol. Rev., 79:124.
McAuley, J. D., 1993, Learning to perceive and produce rhythmic patterns in an artificial neural network., Technical Report 371, Computer Science Department, Indiana University, Bloomington, IN.
McAuley, J. D., 1994a, Finding metrical structure in time. in: “Proceedings of the 1993 Connectionist Models Summer School”, M.C. Mozer, P. Smolensky, D.S. Touretzky, J.L. Elman, and A.S. Weigend, eds, Lawrence Erlbaum Associates, Hillsdale, NJ.
McAuley, J. D., 1994b, Time as phase: A dynamic model of time perception. in: “Proceedings of the Sixteenth Annual Meeting of the Cognitive Science Society”, Lawrence Erlbaum Associates, Hillsdale, NJ.
McAuley, J. D., 1995, “On the Perception of Time as Phase: Toward an Adaptive Oscillator Model of Rhythm”. Ph.D. Thesis, Cognitive Science Technical Report, TR-137, Indiana University, Bloomington, IN.
Moore, B. C. J., 1989, “An Introduction to Psychology of Hearing”, third edition, Harcourt Brace Jovanovich, New York, NY.
Patterson, R. and Holdsworth, J., 1990, “An Introduction to Auditory Sensation Processing”. MRC Applied Psychology Unit, Cambridge, UK.
Port, R., 1986, Invariance in phonetics, in: “Invariance and Variability in Speech Processes”, J. Perkell, and D Klatt., eds, Lawrence Erlbaum Associates, Hillsdale, NJ.
Port, R., and Cummins, F., 1992, The English voicing contrast as velocity perturbation, in: “Proceedings of the 1992 International Conference on Spoken Language Processing”, J. Ohala, T. Nearey, B Derwing, M. Hodge, and G. Wiebe, eds, University of Alberta, Edmunton.
Port, R., Cummins, F., and Gasser, M., 1995, A dynamical approach to rhythm in language: Toward a phonology of time. in: “Proceedings of the Chicago Linguistic Society”, Chicago Linguistic Society, Chicago, IL (in press).
Port, R., Cummins, F., and McAuley, J. D., 1995, Naive time, temporal patterns and human audition, in: “Mind as Motion: Explorations in the Dynamics of Cognition”, R. Port, and T. van Gelder, eds, MIT Press, Cambridge, MA.
Port, R. and Dalby, J., 1982, C/V ratio as a cue for voicing in English. J. Acoust. Soc. Am., 69:262.
Port, R. F., 1990, Representation and recognition of temporal patterns. Connection Sci., 2:151..
Port, R.F., Dalby, J., and O’Dell, M., 1987, Evidence for mora timing in Japanese. J. Acoust. Soc. Am., 81:1574.
Povel, D.-J. and Essens, P., 1985, Perception of temporal patterns. Music Perception, 2:411.
Sankoff, D. and Kruskal, J. B., eds, 1983. “Time Warps, String Edits and Macromolecules: The Theory and Practice of Sequence Comparison”, Addison-Wesley, Reading, MA.
Shamma, S. A., 1985a, Speech processing in the auditory system I. The representation of speech sounds in the responses of the auditory nerve. J. Acoust. Soc. Am., 78:1612.
Shamma, S. A., 1985b, Speech processing in the auditory system II. Lateral inhibition and the central processing of speech evoked activity in the auditory nerve. J. Acoust. Soc. Am., 78:1622.
Smythe, E., 1987, The detection of formant transitions in a connectionist network, in: “Proceedings of the First IEEE International Conference on Neural Networks”, San Diego, CA.
Smythe, E. J., 1988, Temporal computation in connectionist models. Technical Report 251, Indiana University, Computer Science Department, Indiana University, Bloomington, IN.
Torras, C., 1985, “Temporal-Pattern Learning in Neural Models”, Springer Verlag, Berlin.
Vercoe, B. L., 1986, C-sound. Technical report, Experimental Music Studio, Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA.
Watson, C. S. and Nichols, T. S., 1976, Detectability of auditory signals presented without defined observation intervals. J. Acoust. Soci. Am., 59:655.
Whitfield, I. C. and Evans, E. F., 1965, Responses of auditory cortical neurons to stimuli of changing frequency. J. Neurophysiol., 28:655.
Wilson, M. A., Bhalla, U. S., Uhley, J. D., and Bower, J. M., 1989, GENESIS: a system for simulating neural networks, in: “Advances in Neural Information Processing Systems I”, D.S. Touretzky, ed, Morgan Kaufmann, San Mateo, CA.
Yost, W., 1991, Auditory image perception and analysis: the basis for hearing. Hearing Res., 56:244.
Yost, W., 1992, Auditory perception and sound source determination. Psychol. Sci., 1:179.
Yost, W. A. and Watson, C. S., eds , 1987, “Auditory Processing of Complex Sounds”, Lawrence Erlbaum Associates, Hillsdale, NJ.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1995 Springer Science+Business Media New York
About this chapter
Cite this chapter
Port, R.F., Anderson, S.E., McAuley, J.D. (1995). Toward Simulated Audition in Open Environments. In: Covey, E., Hawkins, H.L., Port, R.F. (eds) Neural Representation of Temporal Patterns. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1919-5_4
Download citation
DOI: https://doi.org/10.1007/978-1-4615-1919-5_4
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5785-8
Online ISBN: 978-1-4615-1919-5
eBook Packages: Springer Book Archive