Noise Suppression Method Based on Modulation Spectrum Analysis

Isoyama, Takuto; Unoki, Masashi

doi:10.1007/978-3-319-99579-3_25

Takuto Isoyama¹⁶ &
Masashi Unoki¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11096))

Included in the following conference series:

International Conference on Speech and Computer

1460 Accesses

Abstract

Conventional methods for noise suppression can successfully reduce stationary noise. However, non-stationary noise such as intermittent and impulsive noise cannot be sufficiently suppressed since these methods do not focus on temporal features of noise. This paper proposes a method for suppressing both stationary and non-stationary noise based on modulation spectrum analysis. Modulation spectra (MS) of the stationary, intermittent, and impulsive noise were investigated by using the time/frequency/modulation analysis techniques to characterize the MS features. These features were then used to suppress the stationary and non-stationary noise components from the observed signals. Using the proposed method, the direct-current components of the MS in the stationary noise, harmonicity of the MS in the intermittent noise, and higher modulation-frequency components of the MS in the impulsive noise were removed. The following advantages of the proposed method were confirmed: (1) sound pressure level of the noise was dramatically reduced, (2) signal-to-noise ratio of the noisy speech was improved, and (3) loudness, sharpness, and roughness of the restored speech were enhanced. These results indicate that the stationary as well as non-stationary noise can be successfully suppressed using the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27, 113–120 (1979)
Article Google Scholar
Takehara, R., Kawamura, A., Iiguni, Y.: Impulsive noise suppression using interpolated zero phase signal. In: APSIPA2017, pp. 1382–1389 (2017)
Google Scholar
Zhiyao, D., Gautham, J.M., Paris, S.: Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments. In: Proceedings of Interspeech 2012, pp. 595–598 (2012)
Google Scholar
Stephan, D.E., Torsten, D.: Characterizing frequency selectivity for envelope fluctuations. J. Acoust. Soc. Am. 108, 1181 (2000)
Article Google Scholar
Patterson, R., Nimmo-Smith, L., Holdsworth, J., Rice, P.: An auditory filter bank based on the gammatone function. Paper Presented at a Meeting of the IOC Speech Group on Auditory Modelling at RSRE, pp. 14–15 (1987)
Google Scholar
Kondo, T., Amano, S., Sakamoto, S., Susuki, Y.: Development of familiarity-controlled word-lists (FW07). IEICE Tech. Rep. 107(436), 43–48 (2008)
Google Scholar
Varga, A., Steeneken, J.M.H.: Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems. Speech Commun. 12(13), 247–251 (1993)
Article Google Scholar
Atlas, L., Greenberg, S., Hermansky, H.: The Modulation Spectrum and Its Application to Speech Science and Technology. Interspeech Tutorial, Antwerp (2007)
Google Scholar
Kanai, Y., Morita, S., Unoki, M.: Concurrent processing of voice activity detection and noise reduction using empirical mode decomposition and modulation spectrum analysis. In: Proceedings of INTERSPEECH, pp. 742–746 (2013)
Google Scholar
Zwicker, F.: Psychoacoustics: Facts and Models. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-68888-4
Book Google Scholar

Download references

Acknowledgments

This work was supported by the Secom Science and Technology Foundation by the Suzuki Foundation, and by a Grant in Aid for Innovative Areas (No. 16H01669, and 18H05004) from MEXT, Japan.

Author information

Authors and Affiliations

Japan Advanced Institute of Science and Technology, 1–1 Asahidai, Nomi, Ishikawa, 923–1292, Japan
Takuto Isoyama & Masashi Unoki

Authors

Takuto Isoyama
View author publications
You can also search for this author in PubMed Google Scholar
Masashi Unoki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Takuto Isoyama or Masashi Unoki .

Editor information

Editors and Affiliations

SPIIRAS, St. Petersburg, Russia
Alexey Karpov
Leipzig University of Telecommunications, Leipzig, Germany
Oliver Jokisch
Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Isoyama, T., Unoki, M. (2018). Noise Suppression Method Based on Modulation Spectrum Analysis. In: Karpov, A., Jokisch, O., Potapova, R. (eds) Speech and Computer. SPECOM 2018. Lecture Notes in Computer Science(), vol 11096. Springer, Cham. https://doi.org/10.1007/978-3-319-99579-3_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-99579-3_25
Published: 25 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99578-6
Online ISBN: 978-3-319-99579-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics