Audio Effect for Highlighting Speaker’s Voice Corrupted by Background Noise on Portable Digital Imaging Devices

Kang, Jin Ah; Chun, Chan Jun; Kim, Hong Kook; Kim, Ji Woon; Kim, Myeong Bo

doi:10.1007/978-3-642-20998-7_5

Jin Ah Kang³,
Chan Jun Chun³,
Hong Kook Kim³,
Ji Woon Kim⁴ &
…
Myeong Bo Kim⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 151))

Included in the following conference series:

International Conference on Ubiquitous Computing and Multimedia Applications

2483 Accesses

Abstract

In this paper, an audio effect (AE) algorithm is proposed which can be applied to portable digital imaging devices to enjoy video contents effectively. The proposed AE algorithm enhances speech signals corrupted by background noise in audio content based on audio content classification (ACC) and the signal-to-noise ratio (SNR) estimation in order to highlight speaker’s voice. The ACC classifies each short segment of audio content as speech, non-speech, or mixed signal by using the parameters such as signal energy, sub-band energy, and residual signal energy obtained from the linear prediction analysis. Then, we adaptively scale the signals according to the classification and the estimated SNR. To show the effectiveness of the proposed AE algorithm, we perform an informal listening test between the original audio contents and their processed versions by the proposed AE algorithm. Consequently, it is shown that the proposed AE algorithm significantly improves audio quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

http://www.youtube.com/
Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech and Signal Processing 27(2), 113–120 (1979)
Article Google Scholar
Lim, J.S., Oppenheim, A.V.: Enhancement and bandwidth compression of noisy speech. Proceedings of the IEEE 67(12), 1587–1604 (1979)
Article Google Scholar
ISO/IEC 13818-7: Information Technology - Generic Coding of Moving Pictures and Associated Audio Information - Part 7: Advanced Audio Coding, AAC (December 2004)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Communications, Gwangju Institute of Science and Technology (GIST), Gwangju, 500-712, Korea
Jin Ah Kang, Chan Jun Chun & Hong Kook Kim
Digital Imaging Business, Samsung Electronics, Suwon-si, Gyenggi-do, 443-742, Korea
Ji Woon Kim & Myeong Bo Kim

Authors

Jin Ah Kang
View author publications
You can also search for this author in PubMed Google Scholar
Chan Jun Chun
View author publications
You can also search for this author in PubMed Google Scholar
Hong Kook Kim
View author publications
You can also search for this author in PubMed Google Scholar
Ji Woon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Myeong Bo Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Multimedia Engineering Department, Hannam University, 133 Ojeong-dong, Daeduk-gu, Daejeon, Korea
Tai-hoon Kim , Rosslin John Robles & Maricel Balitanas , &
The Ohio State University, 470 Hitchcock Hall, 2070 Neil Avenue, 43210-1275, Columbus, OH, USA
Hojjat Adeli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kang, J.A., Chun, C.J., Kim, H.K., Kim, J.W., Kim, M.B. (2011). Audio Effect for Highlighting Speaker’s Voice Corrupted by Background Noise on Portable Digital Imaging Devices. In: Kim, Th., Adeli, H., Robles, R.J., Balitanas, M. (eds) Ubiquitous Computing and Multimedia Applications. UCMA 2011. Communications in Computer and Information Science, vol 151. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20998-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-20998-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20997-0
Online ISBN: 978-3-642-20998-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics