Skip to main content

A Video Text Detection Method Based on Key Text Points

  • Conference paper
Advances in Multimedia Information Processing - PCM 2010 (PCM 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6297))

Included in the following conference series:

Abstract

This paper proposes a novel video text detection method based on the key text points. For text detection, the keyframes is decomposed by wavelet transform. The key text points (KTPs) are determined by three resulting high-frequency subbands, and merged by the morphological operations. The anti-texture-direction-projection method is proposed for text line localization and verification. A fast text tracking scheme is proposed, in which text detection is only implemented on the first keyframe of an identical text line in the duration. The appearing (disappearing) frame is determined by a fast search method. Experimental results show that the proposed text detection method is robust to the font size, style, color and alignment of texts. The proposed text tracking greatly speeds up the text detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tang, X., Gao, X., Liu, J., Zhang, H.: A spatial-temporal approach for video caption detection and recognition. IEEE Transaction on Neural Networks 13, 961–971 (2002)

    Article  Google Scholar 

  2. Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing 23, 565–576 (2005)

    Article  Google Scholar 

  3. Hase, H., Shinokawa, T., Yoneda, M., Suen, C.Y.: Character string extraction from color documents. Pattern Recognition 34, 1349–1365 (2001)

    Article  MATH  Google Scholar 

  4. Qian, X., Liu, G., Wang, H., Su, R.: Text detection, localization, and tracking in compressed video. Signal Processing: Image communication 22, 752–768 (2007)

    Article  Google Scholar 

  5. Lyu, M.R., Song, J.Q., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Transaction on Circuits and Systems for Video Technology 15, 243–255 (2005)

    Article  Google Scholar 

  6. Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recognition 37, 977–997 (2004)

    Article  Google Scholar 

  7. Chen, T.: Text localization using DWT fusion algorithm. In: IEEE International Conference on Communication Technology, pp. 722–725 (2008)

    Google Scholar 

  8. Chen, D., Odobez, J., Thiran, J.: A localization/ verification scheme for finding text in images and video frames based on contrast independent features and machine learning methods. Signal Processing: Image Communication 19, 205–217 (2004)

    Article  Google Scholar 

  9. Hua, X.S., Yin, P., Zhang, H.J.: Efficient video text recognition using multiple frame integration. In: IEEE International Conference on Image Processing, vol. 2, pp. 397–400 (2002)

    Google Scholar 

  10. Wang, R., Jin, W., Wu, L.: A novel video caption detection approach using multi-frame integration. In: International Conference on Pattern Recognition, pp. 449–452 (2004)

    Google Scholar 

  11. Sato, T., Kanade, T.: Video OCR: Indexing digital news libraries by recognition of superimposed captions. Multimedia Systems 7, 385–395 (1999)

    Article  Google Scholar 

  12. Lienhart, R., Effelsberg, W.: Automatic text segmentation and text recognition for video indexing. Multimedia Systems 8, 69–81 (2000)

    Article  Google Scholar 

  13. Tanaka, M., Goto, H.: Text-tracking wearable camera system for visually-impaired people. In: International Conference on Pattern Recognition, pp. 1–4 (2008)

    Google Scholar 

  14. Gargi, U., Crandall, D., Antani, S., Gandhi, T., Keener, R., Kasturi, R.: A system for automatic text detection in video. In: International Conference on Document Analysis and Recognition, pp. 29–32 (1999)

    Google Scholar 

  15. Jiang, H., Liu, G., Qian, X., Nan, N., Guo, D., Li, Z., Sun, L.: A fast and effective text tracking in compressed video. In: IEEE International Symposium on Multimedia (ISM), pp. 136–141 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, Z., Liu, G., Qian, X., Wang, C., Ma, Y., Yang, Y. (2010). A Video Text Detection Method Based on Key Text Points. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15702-8_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15702-8_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15701-1

  • Online ISBN: 978-3-642-15702-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics