Skip to main content

Auditory Time-Frequency Masking: Psychoacoustical Data and Application to Audio Representations

  • Conference paper
Speech, Sound and Music Processing: Embracing Research in India (CMMR 2011, FRSM 2011)

Abstract

In this paper, the results of psychoacoustical experiments on auditory time-frequency (TF) masking using stimuli (masker and target) with maximal concentration in the TF plane are presented. The target was shifted either along the time axis, the frequency axis, or both relative to the masker. The results show that a simple superposition of spectral and temporal masking functions does not provide an accurate representation of the measured TF masking function. This confirms the inaccuracy of simple models of TF masking currently implemented in some perceptual audio codecs. In the context of audio signal processing, the present results constitute a crucial basis for the prediction of auditory masking in the TF representations of sounds. An algorithm that removes the inaudible components in the wavelet transform of a sound while causing no audible difference to the original sound after re-synthesis is proposed. Preliminary results are promising, although further development is required.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 72.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agerkvist, F.T.: A time-frequency auditory model using wavelet packets. J. Audio Eng. Soc. 44(1/2), 37–50 (1996)

    Google Scholar 

  2. Balazs, P., Dörfler, M., Holighaus, N., Jaillet, F., Velasco, G.: Theory, implementation and applications of nonstationary Gabor frames. J. Comput. Appl. Math. 236(6), 1481–1496 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  3. Balazs, P., Laback, B., Eckel, G., Deutsch, W.A.: Time-frequency sparsity by removing perceptually irrelevant components using a simple model of simultaneous masking. IEEE Trans. Audio Speech Lang. Process. 18(1), 34–49 (2010)

    Article  Google Scholar 

  4. Daubechies, I.: Ten Lectures on Wavelets, 1st edn. CMB-NSF Lecture Notes nr. 61. SIAM, Philadelphia (1992)

    Google Scholar 

  5. Delgutte, B.: Physiological mechanisms of psychophysical masking: Observations from auditory-nerve fibers. J. Acoust. Soc. Am. 87(2), 791–809 (1990)

    Article  Google Scholar 

  6. Duifhuis, H.: Consequences of peripheral frequency selectivity for nonsimultaneous masking. J. Acoust. Soc. Am. 54(6), 1471–1488 (1973)

    Article  Google Scholar 

  7. Fastl, H.: Temporal masking effects: III. Pure tone masker. Acustica 43(5), 282–294 (1979)

    Google Scholar 

  8. Florentine, M.: Level discrimination of tones as a function of duration. J. Acoust. Soc. Am. 79(3), 792–798 (1986)

    Article  Google Scholar 

  9. Glasberg, B.R., Moore, B.C.J.: Development and evaluation of a model for predicting the audibility of time-varying sounds in the presence of background sounds. J. Audio Eng. Soc. 53(10), 906–918 (2005)

    Google Scholar 

  10. Gröchening, K.: Foundations of time-frequency analysis, 1st edn. Birkhaüser, Boston (2001)

    Google Scholar 

  11. Hamdi, K.N., Ali, M., Tewfik, A.H.: Low bit rate high quality audio coding with combined harmonic and wavelet representations. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (ICASSP 1996), Atlanta, GA, USA, vol. 2, pp. 1045–1048 (1996)

    Google Scholar 

  12. He, X., Scordilis, M.S.: Psychoacoustic music analysis based on the discrete wavelet packet transform. Res. Let. Signal Process. 2008(4), 1–5 (2008)

    Google Scholar 

  13. van der Heijden, M., Kohlrausch, A.: Using an excitation-pattern model to predict auditory masking. Hear. Res. 80, 38–52 (1994)

    Article  Google Scholar 

  14. Huang, Y.H., Chiueh, T.D.: A new audio coding scheme using a forward masking model and perceptually weighted vector quantization. IEEE Trans. Audio Speech Lang. Process. 10(5), 325–335 (2002)

    Article  Google Scholar 

  15. Jaillet, F., Balazs, P., Dörfler, M.: Nonstationary Gabor frames. In: Proc. of the 8th International Conference on Sampling Theory and Applications (SAMPTA 2009), Marseille, France (May 2009)

    Google Scholar 

  16. Jeong, H., Ih, J.: Implementation of a new algorithm using the STFT with variable frequency resolution for the time-frequency auditory model. J. Audio Eng. Soc. 47(4), 240–251 (1999)

    Google Scholar 

  17. Jepsen, M., Ewert, S.D., Dau, T.: A computational model of human auditory signal processing and perception. J. Acoust. Soc. Am. 124(1), 422–438 (2008)

    Article  Google Scholar 

  18. Kidd Jr., G., Feth, L.L.: Patterns of residual masking. Hear. Res. 5(1), 49–67 (1981)

    Article  Google Scholar 

  19. Laback, B., Balazs, P., Necciari, T., Savel, S., Meunier, S., Ystad, S., Kronland-Martinet, R.: Additivity of nonsimultaneous masking for short Gaussian-shaped sinusoids. J. Acoust. Soc. Am. 129(2), 888–897 (2011)

    Article  Google Scholar 

  20. Moore, B.C.J.: An introduction to the psychology of hearing, 5th edn. Academic Press, London (2003)

    Google Scholar 

  21. Moore, B.C.J., Alcántara, J.I., Glasberg, B.R.: Behavioural measurement of level-dependent shifts in the vibration pattern on the basilar membrane. Hear. Res. 163, 101–110 (2002)

    Article  Google Scholar 

  22. Moore, B.C.J., Alcántara, J.I., Dau, T.: Masking patterns for sinusoidal and narrow-band noise maskers. J. Acoust. Soc. Am. 104(2), 1023–1038 (1998)

    Article  Google Scholar 

  23. Necciari, T.: Auditory time-frequency masking: Psychoacoustical measures and application to the analysis-synthesis of sound signals. Ph.D. thesis, University of Provence Aix-Marseille I, France (October 2010)

    Google Scholar 

  24. O’Donovan, J.J., Dermot, J.F.: Perceptually motivated time-frequency analysis. J. Acoust. Soc. Am. 117(1), 250–262 (2005)

    Article  Google Scholar 

  25. Oxenham, A.J.: Forward masking: Adaptation or integration? J. Acoust. Soc. Am. 109(2), 732–741 (2001)

    Article  Google Scholar 

  26. Patterson, R.D., Allerhand, M.H., Giguère, C.: Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform. J. Acoust. Soc. Am. 98, 1890–1894 (1995)

    Article  Google Scholar 

  27. Plack, C.J., Oxenham, A.J., Drga, V.: Linear and nonlinear processes in temporal masking. Acta Acust. United Ac. 88(3), 348–358 (2002)

    Google Scholar 

  28. Plack, C.J., Oxenham, A.J.: Basilar-membrane nonlinearity and the growth of forward masking. J. Acoust. Soc. Am. 103(3), 1598–1608 (1998)

    Article  Google Scholar 

  29. Robles, L., Ruggero, A.: Mechanics of the mammalian cochlea. Physiol. Rev. 81(3), 1305–1352 (2001)

    Google Scholar 

  30. van Schijndel, N.H., Houtgast, T., Festen, J.M.: Intensity discrimination of Gaussian-windowed tones: Indications for the shape of the auditory frequency-time window. J. Acoust. Soc. Am. 105(6), 3425–3435 (1999)

    Article  Google Scholar 

  31. Soderquist, D.R., Carstens, A.A., Frank, G.J.H.: Backward, simultaneous, and forward masking as a function of signal delay and frequency. J. Aud. Res. 21, 227–245 (1981)

    Google Scholar 

  32. Spanias, P., Painter, T., Atti, V.: Audio Signal Processing and Coding. Wiley-Interscience, Hoboken (2007)

    Book  Google Scholar 

  33. Terhardt, E.: Calculating virtual pitch. Hear. Res. 1, 155–182 (1979)

    Article  Google Scholar 

  34. Vafin, R., Andersen, S.V., Kleijn, W.B.: Exploiting time and frequency masking in consistent sinusoidal analysis-synthesis. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (ICASSP 2000), Istanbul, Turkey, vol. 2, pp. 901–904 (2000)

    Google Scholar 

  35. Vetterli, M., Kovačević, J.: Wavelets and subband coding. Prentice Hall PTR, Englewood Cliffs (1995)

    MATH  Google Scholar 

  36. Zwicker, E.: Dependence of post-masking on masker duration and its relation to temporal effects in loudness. J. Acoust. Soc. Am. 75(1), 219–223 (1984)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Necciari, T. et al. (2012). Auditory Time-Frequency Masking: Psychoacoustical Data and Application to Audio Representations. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K., Mohanty, S. (eds) Speech, Sound and Music Processing: Embracing Research in India. CMMR FRSM 2011 2011. Lecture Notes in Computer Science, vol 7172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31980-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31980-8_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31979-2

  • Online ISBN: 978-3-642-31980-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics