Skip to main content

Techniques for Robust Speech Recognition in Noisy and Reverberant Conditions

  • Chapter
Book cover Speech Separation by Humans and Machines

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Assmann, P. and Summerfield, Q., 2003, The perception of speech under adverse acoustic conditions, in: Speech processing in the auditory system (Springer handbook of auditory research vol. 18), Greenberg, S., Ainsworth, W., eds. Springer-Verlag.

    Google Scholar 

  • Barker, J., Cooke, M.P., and Green, P.D., 2001, Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise. Proc. EUROSPEECH, 2001, pp. 213–217.

    Google Scholar 

  • Bregman, A.S., 1990, Auditory Scene Analysis. MIT Press, Cambridge, MA.

    Google Scholar 

  • Brown, G. J., Barker, J., and Wang, D. L., 2001, A neural oscillator sound separator for missing data speech recognition. Proc. IJCNN 2001, pp. 2907–2912.

    Google Scholar 

  • Cooke, M.P., Green, P.D., Josifovski, L., and Vizinho, A., 2001, Robust automatic speech recognition with missing and unreliable acoustic data. Speech Comm., 34, pp. 267–285.

    Google Scholar 

  • Culling, J.F. and Summerfield, Q., 1995, Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay. J. Acoust. Soc. Am., 98(2), pp. 785–797.

    Google Scholar 

  • Darwin, C.J. and Hukin, R.W., 2000, Effects of reverberation on spatial, prosodic and vocaltract size cues to selective attention. J. Acoust. Soc. Am., 108(1), pp. 335–342.

    Article  Google Scholar 

  • Hermansky, H., 1998, Should recognisers have ears? Speech Comm., 25, pp. 3–27.

    Google Scholar 

  • Hukin, R.W. and Darwin, C.J., 1995, Effects of contralateral presentation and of interaural time differences in segregating a harmonic from a vowel. J. Acoust. Soc. Am., 98(3), pp. 1380–1387.

    Article  Google Scholar 

  • Kingsbury, B.E.D., 1998, Perceptually inspired signal-processing strategies for robust speech recognition in reverberant environments. PhD thesis, Univ. California, Berkeley.

    Google Scholar 

  • Lippmann, R.P., 1997, Speech recognition by machines and humans. Speech Comm., 22, pp. 1–15.

    Google Scholar 

  • Litovsky, R.Y., Colburn, S.H., Yost, W.A., and Guzman, S.J., 1999, The precedence effect. J. Acoust. Soc. Am., 106(4), pp. 1633–1654.

    Article  Google Scholar 

  • Palomäki, K.J., Brown, G.J., and Barker, J., 2002, Missing data speech recognition in reverberant conditions. Proc. ICASSP, Orlando, 13th–17th May, pp. 65–68.

    Google Scholar 

  • Palomäki, K.J., Brown, G.J., and Wang, D.L., 2004a, A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation. Speech Comm., in press.

    Google Scholar 

  • Palomäki, K.J., Brown, G.J., and Barker, J., 2004b, Techniques for handling convolutional distortion with ‘missing data’ automatic speech recognition. Speech Comm., in press.

    Google Scholar 

  • Shamsoddini, A. and Denbigh, P.N., 2001, A sound segregation algorithm for reverberant conditions. Speech Comm., 33, pp. 179–196.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer Science + Business Media, Inc.

About this chapter

Cite this chapter

Brown, G.J., Palomäki, K.J. (2005). Techniques for Robust Speech Recognition in Noisy and Reverberant Conditions. In: Divenyi, P. (eds) Speech Separation by Humans and Machines. Springer, Boston, MA. https://doi.org/10.1007/0-387-22794-6_14

Download citation

  • DOI: https://doi.org/10.1007/0-387-22794-6_14

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4020-8001-2

  • Online ISBN: 978-0-387-22794-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics