Abstract
The purpose of this workshop is to discuss the psychophysics of speech perception. The program includes a variety of topics that presumably fall under this heading and that demonstrate that the psychophysics of speech perception is alive and well. Yet it is not really obvious what the psychophysics of speech perception is, what its goals and limitations are, and whether it is indeed a circumscribed area of investigation. It seems useful, therefore, to pose these basic questions explicitly and to include them in our discussions along with the many specific issues addressed by our research. The purpose of my paper is to stimulate such discussion by presenting a particular, possibly controversial, view of speech perception, psychophysics, and the relation between the two.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Preparation of this paper was supported by NICHD Grant HD-01994 to Haskins Laboratories. I am grateful to Al Bregman, Bob Crowder, Jim Flege, Ignatius Mattingly, Robert Remez, and Michael Studdert-Kennedy for helpful comments on an earlier draft.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abramson, A., and Lisker, L. (1985). Relative power of cues: FO versus voice timing. In V.A. Fromkin (Ed.), Phonetic linguistics. Essays in honor of Peter Ladefoged. New York: Academic. Pp. 25–33.
Abramson, A.S., Nye, P.W., Henderson, J.B., and Marshall, C.W. (1981). Vowel height and the perception of consonantal nasality. Journal of the Acoustical Society of America, 70, 329–339.
Ainsworth, W.A., and Lindsay, D. (1986). Perception of pitch movement on tonic syllables in British English. Journal of the Acoustical Society of America, 79, 472–480.
Atal, B.S., Chang, J.J., Mathews, M.V., and Tukey, J.W. (1978). Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique. Journal of the Acoustical Society of America, 63, 1535–1555.
Bailey, P.J., and Summerfield, Q. (1980). Information in speech: Observations on the perception of [s]-stop clusters. Journal of Experimental Psychology: Human Perception and Performance, 6, 536–563.
Bladon, R.A.W., and Lindblom, B.E.F. (1981). Modeling the judgment of vowel quality. Journal of the Acoustical Society of America, 69, 1414–1422.
Blomberg, M., Carlson, R., Elenius, K. and Granström, B. (1986). Auditory models as front ends in speech recognition systems. In J.S. Perkell and D.H. Klatt (Eds.), Invariance and variability in speech processes. Hillsdale, N.J.: Erlbaum. Pp. 108–114.
Blumstein, S.E., and Stevens, K.N. (1979). Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants. Journal of the Acoustical Society of America, 66, 1001–1017.
Browman, C.P., and Goldstein, L.M. (1985). Dynamic modeling of phonetic structure. In V.A. Fromkin (Ed.), Phonetic linguistics. Essays in honor of Peter Ladefoged. New York: Academic. Pp. 35–53.
Bregman, A.S. (1977). Perception and behavior as compositions of ideals. Cognitive Psychology, 9, 250–292.
Chistovich, L.A. (1971). Problems of speech perception. In L.L. Hammerich, R. Jakobson, and E. Zwirner, (Eds.), Form & substance. Copenhagen: Akademisk Forlag. Pp. 83–93.
Chistovich, L.A. (1980). Auditory processing of speech. Language and Speech, 23, 67–75.
Chistovich, L.A. (1985). Central auditory processing of peripheral vowel spectra. Journal of the Acoustical Society of America, 77, 789–805.
Chistovich, L.A., Fant, G., de Serpa-Leitao, A., and Tjernlund, P. (1966). Mimicking and perception of synthetic vowels. Quarterly Progress and Status Report (Royal Technical University, Speech Transmission Laboratory, Stockholm), 2, 1–18.
Chomsky, N. (1965). Aspects of the theory of syntax. Cambridge, MA: MIT Press.
Chomsky, N. (1968). Language and mind. New York: Harcourt, Brace & World.
Darwin, C.J. (1984). Perceiving vowels in the presence of another sound: Constraints on formant perception. Journal of the Acoustical Society of America, 76, 1636–1647.
Denes, P.B., and Pinson, E.N. (1963). The speech chain. Bell Telephone Laboratories.
Derr, M.A., and Massaro, D.W. (1980). The contribution of vowel duration, F0 contour, and frication duration as cues to the /juz/-/jus/ distinction. Perception & Psychophysics, 27, 51–59.
Elliot, L.L., Longinotti, C, Clifton, L.-A., and Meyer, D. (1981). Detection and identification thresholds for consonant-vowel syllables. Perception & Psychophysics, 30, 411–416.
Elman, J.L., and McClelland, J.L. (1984). Speech perception as a cognitive process: The interactive activation model. In N.J. Lass (Ed.), Speech and Language: Advances in research and practice. Vol. 10. New York: Academic. Pp. 337–374.
Elman, J.L., and McClelland, J.L. (1986). Exploiting lawful variability in the speech wave. In J.S. Perkell and D.H. Klatt (Eds.), Invariance and variability in speech processes. Hillsdale, NJ: Erlbaum. Pp. 360–380.
Flege, J.E. (in press). The production and perception of foreign language speech sounds. In H. Winitz (Ed.), Human communication and its disorders, Vol. 1. Norwood, NJ: Ablex.
Fodor, J. (1983). The modularity of mind. Cambridge, MA: MIT Press.
Fónagy, I. (1961). Communication in poetry. Word, 17, 194–218.
Fowler, C.A. (1984). Segmentation of coarticulated speech in perception. Perception & Psychophysics, 36, 359–368.
Fujisaki, H., and Kawashima, T. (1969). On the modes and mechanisms of speech perception. Annual Report of the Engineering Research Institute (Faculty of Engineering, University of Tokyo), 28, 67–73.
Fujisaki, H., and Kawashima, T. (1970). Some experiments on speech perception and a model for the perceptual mechanism. Annual Report of the Engineering Research Institute (Faculty of Engineering, University of Tokyo), 29, 207–214.
Ganong, W.F., III, and Zatorre, R.J. (1980). Measuring phoneme boundaries four ways. Journal of the Acoustical Society of America, 68, 431–439.
Gibson, J.J. (1966). The senses considered as perceptual systems. Boston: Houghton Mifflin.
Green, K.P., and Miller, J.L. (1985). On the role of visual rate information in phonetic perception. Perception & Psychophysics, 38, 269–276.
Grossberg, S. (1986). The adaptive self-organization of serial order in behavior: Speech, language, and motor control. In E.C. Schwab and H.C. Nusbaum (Eds.), Pattern recognition by humans and machines, Vol. I. New York: Academic.
Hary, J.M., and Massaro, D.W. (1982). Categorical results do not imply categorical perception. Perception & Psychophysics, 32, 409–418.
Hayek, F.A. (1952). The sensory order. Chicago: University of Chicago Press.
Howell, P., and Rosen, S. (1983). Production and perception of rise time in the voiceless affricate/fricative distinction. Journal of the Acoustical Society of America, 73, 976–984.
Hrushovski, B. (1980). The meaning of sound patterns in poetry. Poetics Today, 2, 39–56.
Jusczyk, P.W. (1985). On characterizing the development of speech perception. In J. Mehler and R. Fox (Eds.), Neonate cognition: Beyond the blooming buzzing confusion. Hillsdale, NJ: Erlbaum. Pp. 199–229.
Jusczyk, P.W. (1986). Toward a model of the development of speech perception. In J.S. Perkell and D.H. Klatt (Eds.), Invariance and variability in speech processes. Hillsdale, NJ: Erlbaum. Pp. 1–18.
Kasuya, H., Takeuchi, S., Sato, S., and Kido, K. (1982). Articulatory parameters for the perception of bilabials. Phonetica, 39, 61–70.
Kelso, J.A.S., Vatikiotis-Bateson, E., Saltzman, E.L., and Kay, B. (1985). A qualitative dynamic analysis of reiterant speech production: Phase portraits, kinematics, and dynamic modeling. Journal of the Acoustical Society of America, 77, 266–280.
Kent, R.D. (1973). The imitation of synthetic vowels and some implications for speech memory. Phonetica, 28, 1–25.
Klatt, D.H. (1979). Speech perception: a model of acoustic-phonetic analysis and lexical access. Journal of Phonetics, 7, 276–312.
Klatt, D.H. (1982). Prediction of perceived phonetic distance from critical-band spectra: A first step. Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing, Paris, France. Pp. 1278–1281.
Klatt, D.H. (1986). Problem of variability in speech recognition and in models of speech perception. In J.S. Perkell and D.H. Klatt (Eds.), Invariance and variability in speech processes. Hillsdale, NJ: Erlbaum. Pp. 300–319.
Kuhl, P.K. (1979). Speech perception in early infancy: Perceptual constancy for spectrally dissimilar vowel categories. Journal of the Acoustical Society of America, 66, 1668–1679.
Kuhl, P.K. (1981). Discrimination of speech by nonhuman animals: Basic auditory sensitivities conducive to the perception of speech-sound categories. Journal of the Acoustical Society of America, 70, 340–349.
Kuhl, P.K. (1985). Categorization of speech by infants. In J. Mehler and R. Fox (Eds.), Neonate cognition: Beyond the blooming buzzing confusion. Hillsdale, NJ: Erlbaum. Pp. 231–262.
Kuhl, P.K., and Meltzoff, A.N. (1982). The bimodal perception of speech in infancy. Science, 218, 1138–1141.
Ladefoged, P., Harshman, R., Goldstein, L., and Rice, L. (1978). Generating vocal tract shapes from formant frequencies. Journal of the Acoustical Society of America, 64, 1027–1035.
Liberman, A.M. (1982). On finding that speech is special. American Psychologist, 37, 148–167.
Liberman, A.M., and Mattingly, I.G. (1985). The motor theory of speech perception revised. Cognition 21, 1–36.
Lindblom, B. (1983). Economy of speech gestures. In P.F. MacNeilage (Ed.), The production of speech. New York: Springer-Verlag. Pp. 207–246.
Lindblom, B., MacNeilage, P., and Studdert-Kennedy, M. (1983). Self-organizing processes and the explanation of phonological universals. In B. Butterworth, B. Comrie, and D. Dahl (Eds.), Explanations of linguistic universals. The Hague: Mouton. Pp. 181–203.
Lindblom, B.E.F., and Sundberg, J.E.F. (1971). Acoustical consequences of lip, tongue, jaw, and larynx movement. Journal of the Acoustical Society of America, 50, 1166–1179.
Linell, P. (1982). The concept of phonological form and the activities of speech production and speech perception. Journal of Phonetics, 10, 37–72.
Lisker, L., Liberman, A.M., Erickson, D.M., Dechovitz, D., and Mandler, R. (1977). On pushing the voice-onset-time (VOT) boundary about. Language and Speech, 20, 209–216.
MacKain, K.S., Best, C.T. and Strange, W. (1981). Categorical perception of English /r/ and /l/ by Japanese bilinguals. Applied Psycholinguistics, 2, 369–390.
Macmillan, N.A., Kaplan, H.L., and Creelman, C.D. (1977). The psychophysics of categorical perception. Psychological Review, 84, 452–471.
Marks, L.E. (1978). The unity of the senses. New York: Academic.
Marslen-Wilson, W.D., and Welsh, A. (1978). Processing interactions and lexical access during word recognition in continuous speech. Cognitive Psychology, 10, 29–63.
Massaro, D.W. (in press). Categorical partition: A fuzzy logical model of categorization behavior. In S.N. Harnad (Ed.), Categorical Perception. New York: Cambridge University Press.
Massaro, D.W., and Cohen, M.M. (1983a). Evaluation and integration of visual and auditory information in speech perception. Journal of Experimental Psychology: Human Perception and Performance, 9, 753–771.
Massaro, D.W., and Cohen, M.M. (1983b). Categorical or continuous speech perception: a new test. Speech Communication, 2, 15–35.
Massaro, D.W., and Oden, G.C. (1980a). Speech perception: A framework for research and theory. In N.J. Lass (Ed.), Speech and Language: Advances in research and practice. Vol.3. New York: Academic. Pp. 129–165.
Massaro, D.W., and Oden, G.C. (1980b). Evaluation and integration of acoustic features in speech perception. Journal of the Acoustical Society of America, 67, 996–1013.
Mattingly, I.G. (1972). Reading, the linguistic process, and linguistic awareness. In J.F. Kavanagh and I.G. Mattingly (Eds.), Language by ear and eye: The relationships between speech and reading. Cambridge, MA: MIT Press. Pp. 133–148.
McGurk, H., and MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264, 746–748.
Meltzoff, A.N., and Moore, M.K. (1985). Cognitive foundations and social functions of imitation and intermodal representation in infancy. In J. Mehler and R. Fox (Eds.), Neonate cognition: Beyond the blooming buzzing confusion. Hillsdale, NJ: Erlb Pp. 139–156.
Miller, G.A., and Nicely, P. (1955), An analysis of perceptual confusions among some English consonants. Journal of the Acoustical Society of America, 27, 338–352.
Morais, J., Cary, L, Alegria, J., and Bertelson, P. (1979). Does awareness of speech as a sequence of phones arise spontaneously? Cognition, 7, 323–331.
Neisser, U. (1976). Cognition and reality: Principles and implications of cognitive psychology. San Francisco: Freeman.
Ohala, J.J. (1981). The listener as a source of sound change. In C.S. Marek, R.A. Hendrick, and M.F. Miller (Eds.), Papers from the parasessipn in language and behaviour. Chicago: Chicago Linguistic Society. Pp. 178–203.
Ohala, J.J. (1983). The origin of sound patterns in vocal tract constraints. In P.F. MacNeilage (Ed.), The production of speech. New York: Springer-Verlag. Pp. 189–216.
Pisoni, D.B. (1973). Auditory and phonetic memory codes in the discrimination of consonants and vowels. Perception & Psychophysics, 13, 253–260.
Pisoni, D.B., and Lazarus, J.H. (1974). Categorical and noncategorical modes of speech perception along the voicing continuum. Journal of the Acoustical Society of America, 55, 328–333.
Remez, R.E., Rubin, P.E., Pisoni, D.B., and Carrell, T.D. (1981). Speech perception without traditional speech cues. Science, 212, 947–950.
Repp, B.H. (1981). On levels of description in speech research. Journal of the Acoustical Society of America, 69, 1462–1464.
Repp, B.H. (1982). Phonetic trading relations and context effects: New experimental evidence for a speech mode of perception. Psychological Bulletin, 92, 81–110.
Repp, B.H. (1983). Trading relations among acoustic cues in speech perception are largely a result of phonetic categorization. Speech Communication, 2, 341–362.
Repp, B.H. (1984). Categorical perception: Issues, methods, findings. In N.J. Lass (Ed.), Speech and language: Advances in Research and practice. Vol.10. New York: Academic. Pp. 243–335.
Repp, B.H., and Liberman, A.M. (in press). Phonetic category boundaries are flexible. In S.N. Harnad (Ed.), Categorical Perception. New York: Cambridge University Press.
Repp, B.H., and Williams, D.R. (1985). Categorical trends in vowel imitation: Preliminary observations from a replication experiment. Speech Communication, 4, 105–120.
Rosen, S., and Howell, P. (in press). Auditory, articulatory, and learning explanations of categorical perception in speech. In S.N. Harnard (Ed.), Categorical Perception. New York: Cambridge University Press.
Rosner, B.S. (1984). Perception of voice-onset-time continua: A signal detection analysis. Journal of the Acoustical Society of America, 75, 1231–1242.
Rubin, P., Baer, T., and Mermelstein, P. (1981). An articulatory synthesizer for perceptual research. Journal of the Acoustical Society of America, 70, 321–328.
Sachs, R.M., and Grant, K.W. (1976). Stimulus correlates in the perception of voice onset time (VOT): II. Discrimination of speech with high and low stimulus uncertainty. Journal of the Acoustical Society of America, 60, (Suppl. No. 1), S91. (Abstract)
Samuel, A.G. (1977). The effect of discrimination training on speech perception: Noncategorical perception. Perception & Psychophysics, 22, 321–330.
Samuel, A.G. (1982). Phonetic prototypes. Perception & Psychophysics, 31, 307–314.
Schroeder, M.R., and Strube, H.W. (1979). Acoustic measurements of articulator motions. Phonetica, 36, 302–313.
Shepard, R.N. (1980). Psychophysical complementarity. In M. Kubovy and J.R. Pomerantz (Eds.), Perceptual organization. Hillsdale, NJ: Erlbaum. Pp. 279–341.
Shepard, R.N. (1984). Ecological constraints on internal representation: Resonant kinematics of perceiving, imagining, thinking, and dreaming. Psychological Review, 91, 417–447.
Sidwell, A., and Summerfield, Q. (1985). The effect of enhanced spectral contrast on the internal representation of vowel-shaped noise. Journal of the Acoustical Society of America, 78, 495–506.
Silman, S., Gelfand, S.A., and Silverman, K.E.A. (1984). Late-onset auditory deprivation: Effects of monaural versus binaural hearing aids. Journal of the Acoustical Society of America, 76, 1357–1362.
Soli, S.D. (1983). The role of spectral cues in discrimination of voice onset time differences. Journal of the Acoustical Society of America, 73, 2150–2165.
Stevens, K.N., and Blumstein, S.E. (1978). Invariant cues for place of articulation in stop consonants. Journal of the Acoustical Society of America, 64, 1358–1368.
Stevens, K.N., and Blumstein, S.E. (1981). The search for invariant acoustic correlates of phonetic features. In P.D. Eimas and J.L. Miller (Eds.), Perspectives in the study of speech. Hillsdale, NJ: Erlbaum. Pp. 1–38.
Studdert-Kennedy, M. (1982). On the dissociation of auditory and phonetic perception. In R. Carlson and B. Granström (Eds.), The representation of speech in the peripheral auditory system. Amsterdam: Elsevier. Pp. 9–26.
Studdert-Kennedy, M. (1985). Perceiving phonetic events. In W.H. Warren and R.E. Shaw (Eds.), Persistence and change: Proceedings of the first international conference on event perception. Hillsdale, NJ: Erlbaum.
Studdert-Kennedy, M., Liberman, A.M., Harris, K.S., and Cooper, F.S. (1970). Motor theory of speech perception: A reply to Lanefs critical review. Psychological Review, 77, 234–249.
Summerfield, Q. (1979). Use of visual information for phonetic perception. Phonetica, 36, 314–331.
Summerfield, Q. (in press). Preliminaries to a comprehensive account of audio-visual speech perception. In B. Dodd and R. Campbell (Eds.), Hearing by eye. Hilldale, NJ: Erlbaum.
Tallal, P., and Stark, R. (1981). Speech acoustic-cue discrimination abilities of normally developing and language-impaired children. Journal of the Acoustical Society of America, 69, 568–574.
Tatham, M.A.A. (1980). Phonology and phonetics as part of the language encoding/decoding system. In N.J. Lass (Ed.), Speech and language: Advances in research and practice. Vol.3. New York: Academic. Pp. 35–73.
Titchener, E.B. (1909). Lectures on the elementary psychology of the thought-process.
Toulmin, S. (1972). Human understanding: The collective use and evolution of concepts. Princeton: Princeton University Press.
Tuller, B., and Fowler, C.A. (1980). Some articulatory correlates of perceptual isochrony. Perception & Psychophysics, 27, 277–283.
Tyler, R.S., Summerfield, Q., Wood, E.J. and Fernandes, M.A. (1982). Psychoacoustic and phonetic temporal processing in normal and hearing-impaired listeners. Journal of the Accoustical Society of America, 72, 740–752.
Vinegrad, M.D. (1972). A direct magnitude scaling method to investigate categorical versus continuous modes of speech perception. Language and Speech, 15, 114–121.
Warren, R.M. (1981). Chairman’s comments. In T. Myers, J. Laver, and J. Anderson (Eds.), The cognitive representation of speech. Amsterdam: North Holland. Pp. 34–37.
Watson, C.S., Kewley-Port, D., and Foyle, D.C. (1985). Temporal acuity for speech and nonspeech sounds: The role of stimulus uncertainty. Journal of the Acoustical Society of America, 77, (Suppl. No. 1), S27. (Abstract)
Welsh, L.W., Welsh, J.J., and Healy, M.P. (1983). Effect of sound deprivation on central hearing. Laryngoscope, 93, 1569–1575.
Werker, J.F., and Tees, R.C. (1984). Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior and Development, 7, 49–63.
Werker, J.F., Gilbert, J.H.V., Humphrey, K., and Tees, R.C. (1981). Developmental aspects of cross-language speech perception. Child Development, 52, 349–355.
Wood, C.C. (1976). Discrimination, response bias, and phoneme categories in discrimination of voice onset time. Journal of the Acoustical Society of America, 60, 1381–1389.
Yates, J. (1985). The content of awareness is a model of the world. Psychological Review 92, 249–284.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1987 Martinus Nijhoff Publishers, Dordrecht
About this chapter
Cite this chapter
Repp, B.H. (1987). The Role of Psychophysics in Understanding Speech Perception. In: Schouten, M.E.H. (eds) The Psychophysics of Speech Perception. NATO ASI Series, vol 39. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3629-4_1
Download citation
DOI: https://doi.org/10.1007/978-94-009-3629-4_1
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-8123-8
Online ISBN: 978-94-009-3629-4
eBook Packages: Springer Book Archive