Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Assmann, P. and Summerfield, Q. (in press) The perception of speech under adverse acoustic conditions. In Speech Processing in the Auditory System, Springer Handbook of Auditory Research Vol. 14, S. Greenberg and W. Ainsworth, eds., Springer.
Barker, J., Cooke, M.P., and Ellis, D.P.W. (in press) Decoding speech in the presence of other sources, Speech Communication.
Bregman, A.S., 1990, Auditory Scene Analysis. Cambridge, MA: MIT Press.
Brown, G.J. and Cooke, M.P., 1994, Computational auditory scene analysis, Computer Speech and Language, 8, 297–336.
Brungart, D.S., Simpson, B.D., Ericson, M.A., and Scott, K.R., 2001, Informational and energetic masking effects in the perception of multiple simultaneous talkers, J. Acoust. Soc. Am., 110, 2527–2538.
de Cheveigné, A., 1993, Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing, J. Acoust. Soc. Am., 93, 3271–3290.
Cooke, M.P., 2003a, Glimpsing Speech, Journal of Phonetics.
Cooke, M.P., 2003b, A glimpsing model of speech perception, Proc. Int. Cong. Phonetic Sciences.
Cooke, M.P., Green, P.D., and Crawford, M.D., 1994, Handling missing data in speech recognition. Proc. ICSLP, 1555–1558.
Cooke, M.P., Green, P.D., Josifovski, L., and Vizinho, A., 2001, Robust automatic speech recognition with missing and uncertain acoustic data, Speech Communication, 34, 267–285.
Cooke, M.P. and Ellis, D.P.W., 2001, The auditory organization of speech and other sources in listeners and computational models, Speech Communication, 35, 141–177.
Darwin, C.J., 1981, Perceptual grouping of speech components differing in fundamental frequency and onset-time, Quarterly Journal of Experimental Psychology, 3A, 185–207.
Miller, G.A., 1947, The masking of speech, Psych. Bull., 44, 105–129.
Nakatani, T., Okuno, H.G., Goto, M., and Ito, T., 1998, Multiagent based binaural sound stream segregation, in: Readings in Computational Auditory Scene Analysis, D. Rosenthal and H. Okuno, eds., Lawrence Erlbaum.
Remez, R.E., Rubin, P.E., Berns, S.M., Pardo, J.S., and Lang, J.M., 1994, On the perceptual organization of speech, Psychological Review, 101(1), 129–156.
Roweis, S.T., 2003, Factorial models and refiltering for speech separation and denoising, Proc. Eurospeech, 1009–1012.
Shannon, R.V., Jensvold, A., Padilla, M., Robert, M.E., and Wang, X., 1999, Consonant recordings for speech testing, J. Acoust. Soc. Am., 106, L71–L74.
Varga, A.P. and Moore, R.K., 1990, Hidden Markov Model decomposition of speech and noise, Proc. ICASSP, 845–848.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science + Business Media, Inc.
About this chapter
Cite this chapter
Cooke, M. (2005). Making Sense of Everyday Speech: a Glimpsing Account. In: Divenyi, P. (eds) Speech Separation by Humans and Machines. Springer, Boston, MA. https://doi.org/10.1007/0-387-22794-6_21
Download citation
DOI: https://doi.org/10.1007/0-387-22794-6_21
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-8001-2
Online ISBN: 978-0-387-22794-8
eBook Packages: EngineeringEngineering (R0)