Skip to main content

Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video

  • Conference paper
  • First Online:
Recent Advances in Visual Information Systems (VISUAL 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2314))

Included in the following conference series:

Abstract

In this paper, a novel approach of automatic closed caption detection and font size differentiation among localized text regions in I-frames of MPEG videos is proposed. The approach consists of five modules: video segmentation, shot selection, caption frame detection, caption localization and font size differentiation. Rather than directly examines scene cut frame by frame, the module of video segmentation first verifies video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Tennis videos are selected as the case study and the module of shot selection is designed to automatically select specific type of shot for further closed caption detection. The noise of potential captions is filtered out based on its long-term consistency over consecutive frames. While the general closed captions are localized, we select the specific caption that is discriminated utilizing the module of font size differentiation. The detected closed captions can support video structuring, video browsing, high-level video indexing and video content description in MPEG-7. Experimental results show the effectiveness and the feasibility of the proposed scheme.

The research is partially supported by Lee & MTI Center, National Chiao-Tung University, Taiwan and National Science Council, Taiwan.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. H. Wang and S. F. Chang, “A Highly Efficient System for Automatic Face Region Detection in MPEG Video,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 7, No. 4, Aug. 1997, pp. 615–628.

    Article  MathSciNet  Google Scholar 

  2. Y. Zhong, H. Zhang and A. K. Jain, “Automatic Caption Localization in Compressed Video,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 4, Apr. 2000, pp. 385–392.

    Article  Google Scholar 

  3. H. Luo and A. Eleftheriadis, “On Face Detection in the Compressed Domain,” Proc. of ACM Multimedia 2000, pp. 285–294.

    Google Scholar 

  4. Y. Zhang and T. S. Chua, “Detection of Text Captions in Compressed Domain Video,” Proc. of ACM Multimedia Workshop, 2000, pp. 201–204.

    Google Scholar 

  5. S. W. Lee, Y. M. Kim and S. W. Choi, “Fast Scene Change Detection using Direct Feature Extraction from MPEG Compressed Videos,” IEEE Transactions on Multimedia, Vol. 2, No. 4, Dec. 2000, pp. 240–254.

    Article  Google Scholar 

  6. X. Chen and H. Zhang, “Text Area Detection from Video Frames,” Proc. of 2nd IEEE Pacific Rim Conference on Multimedia, Oct. 2001, pp. 222–228.

    Google Scholar 

  7. J. Nang, O. Kwon and S. Hong, “Caption Processing for MPEG Video in MC-DCT Compressed Domain,” Proc of ACM Multimedia Workshop, 2000, pp. 211–214.

    Google Scholar 

  8. S. Y. Lee, J. L. Lian and D. Y. Chen, “Video Summary and Browsing Based on Story-Unit for Video-on-Demand Service,” Proc. International Conference on ICICS, Oct. 2001.

    Google Scholar 

  9. J. L. Mitchell, W. B. Pennebaker, Chad E. Fogg, and Didier J. LeGall, “MPEG VIDEO COMPRESSION STANDARD,” Chapman&Hall, NY, USA, 1997.

    Google Scholar 

  10. J. Meng, Y. Juan, S.F. Chang, “Scene Change Detection in a MPEG Compressed Video Sequence,” Proc. IS&T/SPIE, Vol. 2419, 1995, pp.14–25.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Duan-Yu, C., Ming-Ho, H., Suh-Yin, L. (2002). Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video. In: Chang, SK., Chen, Z., Lee, SY. (eds) Recent Advances in Visual Information Systems. VISUAL 2002. Lecture Notes in Computer Science, vol 2314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45925-1_26

Download citation

  • DOI: https://doi.org/10.1007/3-540-45925-1_26

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43358-3

  • Online ISBN: 978-3-540-45925-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics