Skip to main content

Dynamic Subtitle Authoring Method Based on Audio Analysis for the Hearing Impaired

  • Conference paper
Computers Helping People with Special Needs (ICCHP 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8547))

Included in the following conference series:

Abstract

The broadcasting and the Internet are important parts of modern society that a life without media is now unimaginable. However, hearing impaired people have difficulty in understanding media content due to the loss of audio information. If subtitles are available, subtitling with video can be helpful. In this paper, we propose a dynamic subtitle authoring method based on audio analysis for the hearing impaired. We analyze the audio signal and explore a set of audio features that include STE, ZCR, Pitch and MFCC. Using these features, we align the subtitle with the speech and match extracted speech features to subtitle as different text colors, sizes and thicknesses. Furthermore, it highlights the text via aligning them with the voice and tagging the speaker ID using the speaker recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. BSeries, B.T.: Accessibility to broadcasting services for persons with disabilities (2011)

    Google Scholar 

  2. Abrahamian, S.: N. T. S. C. In: EIA-608 and EIA-708 closed captioning (2006)

    Google Scholar 

  3. Boyd, J., Vader, E.A.: Captioned television for the deaf. Am. Ann. Hearing Impaired 117(1), 32–37 (1972)

    Google Scholar 

  4. Hong, R., et al.: Dynamic captioning: video accessibility enhancement for hearing impairment. In: Proceedings of the International Conference on Multimedia. ACM (2010)

    Google Scholar 

  5. Seto, S., et al.: Subtitle system visualizing non-verbal expressions in voice for hearing impaired-Ambient Font. In: Proceeding of the 10th Asia-Pacific Industrial Engineering and Management Systems (2010)

    Google Scholar 

  6. Ververidis, D., Kotropoulos, C.: Emotional speech recognition: Resources, features, and methods. Speech Communication 48(9), 1162–1181 (2006)

    Article  Google Scholar 

  7. Jalil, M., Butt, F.A., Malik, A.: Short-time energy, magnitude, zero crossing rate and autocorrelation measurement for discriminating voiced and unvoiced segments of speech signals. In: International Conference on Technological Advances in Electrical, Electronics and Computer Engineering (2013)

    Google Scholar 

  8. Hess, W.: Pitch Determination of Speech Signals. Springer (1983)

    Google Scholar 

  9. Hasan, M.R., Jamil, M., Rabbani, M.G., Rahman, M.S.: Speaker identification using mel frequency cepstral coefficients (2004)

    Google Scholar 

  10. https://instruct1.cit.cornell.edu/courses/ece576/FinalProjects/f2008/pae26_jsc59/pae26_jsc59/

  11. Kim, N.: A Study on Multimedia Application Service using DTV Closed Caption Data. Journal of Broadcast Engineering (2009)

    Google Scholar 

  12. Peter, O.L.: Making Television Accessible. Report published by the International Tele-communications Union, in collaboration with The Global Initiative for Inclusive Information and Communication Technologies. ITU. Media accessibility 101 (2011)

    Google Scholar 

  13. Maryon, E.: The Science of Tone-Color. CC Birchard & Co., Boston (1924)

    Google Scholar 

  14. Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10(1) (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Lim, W., Jang, I., Ahn, C. (2014). Dynamic Subtitle Authoring Method Based on Audio Analysis for the Hearing Impaired. In: Miesenberger, K., Fels, D., Archambault, D., Peňáz, P., Zagler, W. (eds) Computers Helping People with Special Needs. ICCHP 2014. Lecture Notes in Computer Science, vol 8547. Springer, Cham. https://doi.org/10.1007/978-3-319-08596-8_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08596-8_9

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08595-1

  • Online ISBN: 978-3-319-08596-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics