Skip to main content

Perceptual Separation of Speech from Concurrent Sounds

  • Chapter
Book cover The Psychophysics of Speech Perception

Part of the book series: NATO ASI Series ((ASID,volume 39))

Abstract

Hearing has evolved in a hostile environment. Between the sound source and the ear, the different frequencies of a sound are differentially absorbed by the air, reflected by surfaces and mixed in with other sounds. Despite all these distortions and additions, the brain achieves a remarkable constancy of percept, especially in the perception of speech. If we take this ability seriously, we are led to difficult but crucial issues concerning the relationship between phonetic knowledge and the representation of sound in the auditory system. These issues never arise if we only consider the perception of speech as the perception of the sound produced by a single speaker, or synthesiser, heard in a sound-proof room, yet they are crucial in understanding the computational problem faced by hearing, and of great practical significance in constructing robust speech recognition devices or hearing aids. Speech perception — come out of the closet and join the cocktail-party!

Gardner’s salary was provided by SERC grants GR/C 8522.1 and GR/C 65930, Research facilities were provided by SERC grant GR/D 6009.9

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bregman, A.S., Abramson, J., Doehring, P., and Darwin, C.J. (1985). Spectral integration based on common amplitude modulation. Perception and Psychophysics, 37, 483–493.

    Article  PubMed  Google Scholar 

  2. Bregman, A.S. and Dannenbring, G.L. (1973). The effect of continuity on auditory stream segregation. Perception and Psychophysics, 13, 308–312.

    Article  Google Scholar 

  3. Bregman, A.S. and Doehring, P. (1984). Fusion of simultaneous tonal glides: the role of parallelness and simple frequency relations. Perception and Psychophysics,36, 251–256.

    Article  PubMed  Google Scholar 

  4. Bregman, A.S. and Pinker, S. (1978). Auditory streaming and the building of timbre, Canadian Journal of Psychology, 32, 19–31.

    Article  PubMed  Google Scholar 

  5. Broadbent, D.E. and Ladefoged, P. (1957). On the fusion of sounds reaching different sense organs. Journal of the Acoustical Society of America, 29, 708–710.

    Article  Google Scholar 

  6. Brokx, J.P.L. and Nooteboom, S.G. (1982). Intonation and the perceptual separation of simultaneous voices. Journal of Phonetics, 10, 23–36.

    Google Scholar 

  7. Cutting, J.E. (1976). Auditory and linguistic processes in speech perception: inferences from six fusions in dichotic listening. Psychological Review, 83, 114–140.

    Article  PubMed  Google Scholar 

  8. Dannenbring, G.L. and Bregman A.S. (1978). Streaming vs fusion of sinusoidal components of complex tones. Perception and Psychophysics, 24, 369–376.

    Article  PubMed  Google Scholar 

  9. Darwin, C.J. (1981). Perceptual grouping of speech components differing in fundamental frequency and onset time.. Quarterly Journal of Experimental Psychology, 33A, 185–207.

    Google Scholar 

  10. Darwin, C.J, (1984). Perceiving vowels in the presence of another sound: constraints on formant perception. Journal of the Acoustical Society of America, 76, 1636–1647.

    Article  PubMed  Google Scholar 

  11. Darwin, C.J. and Bethell-Fox, C.E. (1977). Pitch continuity and speech source attribution. Journal of Experimental Psychology: Human Perception and Performance, 3, 665–672.

    Article  Google Scholar 

  12. Darwin, C.J. and Gardner, R.B. (1986). Mistuning a harmonic of a vowel: grouping and phase effects on vowel quality. Journal of the Acoustical Society of America, 79, 838–845.

    Article  PubMed  Google Scholar 

  13. Darwin, C.J. and Sutherland, N.S. (1984). Grouping frequency components of vowels: when is a harmonic not á harmonic? Quarterly Jounal of Experimental Psychology, 36A, 193–208.

    Google Scholar 

  14. Duifhuis, H., Willems, L.F., and Sluyter, R.J. (1982). Measurement of pitch in speech: An implementation of Goldstein’s theory of pitch perception. Journal of the Acoustical Society of America, 71, 1568–580.

    Article  PubMed  Google Scholar 

  15. Gardner, R.B. and Darwin, C.J. (1986). Grouping of vowel harmonics by frequency modulation: absence of effects on phonemic categorisation. Perception and Psychophysics, 40, 183–187.

    Article  PubMed  Google Scholar 

  16. Goldstein, J.L. (1973). An optimum processor theory for the central formation of pitch, Journal of the Acoustical Society of America, 54, 1496–1516.

    Article  PubMed  Google Scholar 

  17. Halikia, M.H. and Bregman, A.S. (1984). Perceptual separation of simultaneous vowels presented as steady states and as parallel and crossing glides. Journal of the Acoustical Society of America, 75, S1, S83.

    Article  Google Scholar 

  18. Liberman, A.M., Isenberg, D., and Rakerd, B. (1981). Duplex perception of cues for stop consonants. Perception and Psychophysics, 30, 133–143.

    Article  PubMed  Google Scholar 

  19. Liberman, A.M. and Studdert-Kennedy, M.G. (1978). Phonetic Perception. In R. Held, H. Leibowitz and H.L. Teuber (Eds). Handbook of Sensory Physiology VIII: Perception. Heidelberg: Springer-Verlag.

    Google Scholar 

  20. Marr, D. (1982). Vision. San Francisco: Freeman

    Google Scholar 

  21. McAdams, S. and Bregman, A.S. (1979). Hearing Musical Streams. Comp. Mus. J., 3(4), 26–43, 60.

    Google Scholar 

  22. McAdams, S. (1984). Spectral fusion, spectral parsing and the formation of auditory images. Unpublished Ph.D. dissertation, Stanford University.

    Google Scholar 

  23. Moore, B.C.J., Glasberg, B.R., and Peters, R.W. (1985). Relative dominance of individual partials in determining the pitch of complex tones. Journal of the Acoustical Society of America, 77, 1853–1860.

    Article  Google Scholar 

  24. Nusbaum, H.C., Schwab, E.C., and Sawusch, J.R. (1983). The role of ‘chirp’ identification in duplex perception. Perception and Psychophysics, 33, 323–332.

    Article  PubMed  Google Scholar 

  25. Rasch, R.A. (1978). Perception of simultaneous notes such as in polyphonic music. Acustica, 40, 21–33.

    Google Scholar 

  26. Remez, R.E., Rubin, P.E., Pisoni, C.B., and Carrell, T.D. (1981). Speech perception without traditional speech cues. Science, 212, 947–950.

    Article  PubMed  Google Scholar 

  27. Repp, B.H., Milburn, C, and Ashkenas, J. (1983). Duplex perception: confirmation of fusion. Perception and Psychophysics, 33, 333–337.

    Article  PubMed  Google Scholar 

  28. Scheffers, M.T.M. (1983). Sifting vowels: auditory pitch analysis and sound segregation. Doctoral dissertation, Groningen University.

    Google Scholar 

  29. Summerfield, A.Q., Haggard, M.P., Foster, J., and Gray, S. (1984). Perceiving vowels from uniform spectra: phonetic exploration of an auditory after-effect. Perception and Psychophysics, 35, 203–213.

    Article  PubMed  Google Scholar 

  30. Weintraub, M. (1985). A theory and computational model of auditory monaural sound separation. Ph.D. thesis, Stanford University.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1987 Martinus Nijhoff Publishers, Dordrecht

About this chapter

Cite this chapter

Darwin, C.J., Gardner, R.B. (1987). Perceptual Separation of Speech from Concurrent Sounds. In: Schouten, M.E.H. (eds) The Psychophysics of Speech Perception. NATO ASI Series, vol 39. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3629-4_7

Download citation

  • DOI: https://doi.org/10.1007/978-94-009-3629-4_7

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-8123-8

  • Online ISBN: 978-94-009-3629-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics