Auditory Time-Frequency Masking: Psychoacoustical Data and Application to Audio Representations

Necciari, Thibaud; Balazs, Peter; Kronland-Martinet, Richard; Ystad, Sølvi; Laback, Bernhard; Savel, Sophie; Meunier, Sabine

doi:10.1007/978-3-642-31980-8_12

Thibaud Necciari^19,20,
Peter Balazs¹⁹,
Richard Kronland-Martinet²⁰,
Sølvi Ystad²⁰,
Bernhard Laback¹⁹,
Sophie Savel²⁰ &
…
Sabine Meunier²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7172))

Included in the following conference series:

934 Accesses
3 Citations

Abstract

In this paper, the results of psychoacoustical experiments on auditory time-frequency (TF) masking using stimuli (masker and target) with maximal concentration in the TF plane are presented. The target was shifted either along the time axis, the frequency axis, or both relative to the masker. The results show that a simple superposition of spectral and temporal masking functions does not provide an accurate representation of the measured TF masking function. This confirms the inaccuracy of simple models of TF masking currently implemented in some perceptual audio codecs. In the context of audio signal processing, the present results constitute a crucial basis for the prediction of auditory masking in the TF representations of sounds. An algorithm that removes the inaudible components in the wavelet transform of a sound while causing no audible difference to the original sound after re-synthesis is proposed. Preliminary results are promising, although further development is required.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 72.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agerkvist, F.T.: A time-frequency auditory model using wavelet packets. J. Audio Eng. Soc. 44(1/2), 37–50 (1996)
Google Scholar
Balazs, P., Dörfler, M., Holighaus, N., Jaillet, F., Velasco, G.: Theory, implementation and applications of nonstationary Gabor frames. J. Comput. Appl. Math. 236(6), 1481–1496 (2011)
Article MathSciNet MATH Google Scholar
Balazs, P., Laback, B., Eckel, G., Deutsch, W.A.: Time-frequency sparsity by removing perceptually irrelevant components using a simple model of simultaneous masking. IEEE Trans. Audio Speech Lang. Process. 18(1), 34–49 (2010)
Article Google Scholar
Daubechies, I.: Ten Lectures on Wavelets, 1st edn. CMB-NSF Lecture Notes nr. 61. SIAM, Philadelphia (1992)
Google Scholar
Delgutte, B.: Physiological mechanisms of psychophysical masking: Observations from auditory-nerve fibers. J. Acoust. Soc. Am. 87(2), 791–809 (1990)
Article Google Scholar
Duifhuis, H.: Consequences of peripheral frequency selectivity for nonsimultaneous masking. J. Acoust. Soc. Am. 54(6), 1471–1488 (1973)
Article Google Scholar
Fastl, H.: Temporal masking effects: III. Pure tone masker. Acustica 43(5), 282–294 (1979)
Google Scholar
Florentine, M.: Level discrimination of tones as a function of duration. J. Acoust. Soc. Am. 79(3), 792–798 (1986)
Article Google Scholar
Glasberg, B.R., Moore, B.C.J.: Development and evaluation of a model for predicting the audibility of time-varying sounds in the presence of background sounds. J. Audio Eng. Soc. 53(10), 906–918 (2005)
Google Scholar
Gröchening, K.: Foundations of time-frequency analysis, 1st edn. Birkhaüser, Boston (2001)
Google Scholar
Hamdi, K.N., Ali, M., Tewfik, A.H.: Low bit rate high quality audio coding with combined harmonic and wavelet representations. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (ICASSP 1996), Atlanta, GA, USA, vol. 2, pp. 1045–1048 (1996)
Google Scholar
He, X., Scordilis, M.S.: Psychoacoustic music analysis based on the discrete wavelet packet transform. Res. Let. Signal Process. 2008(4), 1–5 (2008)
Google Scholar
van der Heijden, M., Kohlrausch, A.: Using an excitation-pattern model to predict auditory masking. Hear. Res. 80, 38–52 (1994)
Article Google Scholar
Huang, Y.H., Chiueh, T.D.: A new audio coding scheme using a forward masking model and perceptually weighted vector quantization. IEEE Trans. Audio Speech Lang. Process. 10(5), 325–335 (2002)
Article Google Scholar
Jaillet, F., Balazs, P., Dörfler, M.: Nonstationary Gabor frames. In: Proc. of the 8th International Conference on Sampling Theory and Applications (SAMPTA 2009), Marseille, France (May 2009)
Google Scholar
Jeong, H., Ih, J.: Implementation of a new algorithm using the STFT with variable frequency resolution for the time-frequency auditory model. J. Audio Eng. Soc. 47(4), 240–251 (1999)
Google Scholar
Jepsen, M., Ewert, S.D., Dau, T.: A computational model of human auditory signal processing and perception. J. Acoust. Soc. Am. 124(1), 422–438 (2008)
Article Google Scholar
Kidd Jr., G., Feth, L.L.: Patterns of residual masking. Hear. Res. 5(1), 49–67 (1981)
Article Google Scholar
Laback, B., Balazs, P., Necciari, T., Savel, S., Meunier, S., Ystad, S., Kronland-Martinet, R.: Additivity of nonsimultaneous masking for short Gaussian-shaped sinusoids. J. Acoust. Soc. Am. 129(2), 888–897 (2011)
Article Google Scholar
Moore, B.C.J.: An introduction to the psychology of hearing, 5th edn. Academic Press, London (2003)
Google Scholar
Moore, B.C.J., Alcántara, J.I., Glasberg, B.R.: Behavioural measurement of level-dependent shifts in the vibration pattern on the basilar membrane. Hear. Res. 163, 101–110 (2002)
Article Google Scholar
Moore, B.C.J., Alcántara, J.I., Dau, T.: Masking patterns for sinusoidal and narrow-band noise maskers. J. Acoust. Soc. Am. 104(2), 1023–1038 (1998)
Article Google Scholar
Necciari, T.: Auditory time-frequency masking: Psychoacoustical measures and application to the analysis-synthesis of sound signals. Ph.D. thesis, University of Provence Aix-Marseille I, France (October 2010)
Google Scholar
O’Donovan, J.J., Dermot, J.F.: Perceptually motivated time-frequency analysis. J. Acoust. Soc. Am. 117(1), 250–262 (2005)
Article Google Scholar
Oxenham, A.J.: Forward masking: Adaptation or integration? J. Acoust. Soc. Am. 109(2), 732–741 (2001)
Article Google Scholar
Patterson, R.D., Allerhand, M.H., Giguère, C.: Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform. J. Acoust. Soc. Am. 98, 1890–1894 (1995)
Article Google Scholar
Plack, C.J., Oxenham, A.J., Drga, V.: Linear and nonlinear processes in temporal masking. Acta Acust. United Ac. 88(3), 348–358 (2002)
Google Scholar
Plack, C.J., Oxenham, A.J.: Basilar-membrane nonlinearity and the growth of forward masking. J. Acoust. Soc. Am. 103(3), 1598–1608 (1998)
Article Google Scholar
Robles, L., Ruggero, A.: Mechanics of the mammalian cochlea. Physiol. Rev. 81(3), 1305–1352 (2001)
Google Scholar
van Schijndel, N.H., Houtgast, T., Festen, J.M.: Intensity discrimination of Gaussian-windowed tones: Indications for the shape of the auditory frequency-time window. J. Acoust. Soc. Am. 105(6), 3425–3435 (1999)
Article Google Scholar
Soderquist, D.R., Carstens, A.A., Frank, G.J.H.: Backward, simultaneous, and forward masking as a function of signal delay and frequency. J. Aud. Res. 21, 227–245 (1981)
Google Scholar
Spanias, P., Painter, T., Atti, V.: Audio Signal Processing and Coding. Wiley-Interscience, Hoboken (2007)
Book Google Scholar
Terhardt, E.: Calculating virtual pitch. Hear. Res. 1, 155–182 (1979)
Article Google Scholar
Vafin, R., Andersen, S.V., Kleijn, W.B.: Exploiting time and frequency masking in consistent sinusoidal analysis-synthesis. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (ICASSP 2000), Istanbul, Turkey, vol. 2, pp. 901–904 (2000)
Google Scholar
Vetterli, M., Kovačević, J.: Wavelets and subband coding. Prentice Hall PTR, Englewood Cliffs (1995)
MATH Google Scholar
Zwicker, E.: Dependence of post-masking on masker duration and its relation to temporal effects in loudness. J. Acoust. Soc. Am. 75(1), 219–223 (1984)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12–14, A-1040, Vienna, Austria
Thibaud Necciari, Peter Balazs & Bernhard Laback
Laboratoire de Mécanique et d’Acoustique, CNRS-UPR 7051, Aix-Marseille Univ., Centrale Marseille, F-13402, Marseille Cedex 20, France
Thibaud Necciari, Richard Kronland-Martinet, Sølvi Ystad, Sophie Savel & Sabine Meunier

Authors

Thibaud Necciari
View author publications
You can also search for this author in PubMed Google Scholar
Peter Balazs
View author publications
You can also search for this author in PubMed Google Scholar
Richard Kronland-Martinet
View author publications
You can also search for this author in PubMed Google Scholar
Sølvi Ystad
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Laback
View author publications
You can also search for this author in PubMed Google Scholar
Sophie Savel
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Meunier
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS - LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Sølvi Ystad , Mitsuko Aramaki & Richard Kronland-Martinet , &
Aalborg University Esbjerg, Niels Bohr Vej 8, 6700, Esbjerg, Denmark
Kristoffer Jensen
North Orissa University, Sriram Chandra Vihar, Takatpur, 757003, Baripada, Orissa, India
Sanghamitra Mohanty

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Necciari, T. et al. (2012). Auditory Time-Frequency Masking: Psychoacoustical Data and Application to Audio Representations. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K., Mohanty, S. (eds) Speech, Sound and Music Processing: Embracing Research in India. CMMR FRSM 2011 2011. Lecture Notes in Computer Science, vol 7172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31980-8_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-31980-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31979-2
Online ISBN: 978-3-642-31980-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics