Advertisement

Entity Detection for Information Retrieval in Video Streams

  • Sanghee Lee
  • Kanghyun Jo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10956)

Abstract

The growing amount of video data has raised the need for automatic semantic information indexing and retrieval systems. To accomplish to these needs, the text information in images and videos is proved to be an important source of high-level semantics. This paper discusses the video OCR system designed for overlay text based automatic indexing and retrieval in the video streams. The proposed framework consists of the video segmentation, the video key-frame extraction, the video text recognition, and the entity detection. The experimental results on Korean television news programs show that the proposed method efficiently realizes the automatic indexing in the video streams.

Keywords

Overlay text Video OCR Indexing Named entity NLP 

Notes

Acknowledgment

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the Grand Information Technology Research Center support program (IITP-2018-2016-0-00318) supervised by the IITP (Institute for Information & communications Technology Promotion).

References

  1. 1.
    Jung, C., Kim, J.: Player information extraction for semantic annotation in golf videos. IEEE Tran. Broadcast. 55(1), 79–83 (2009)MathSciNetCrossRefGoogle Scholar
  2. 2.
    Saidane, Z., Garcia, C.: An automatic method video character segmentation. In: International Conference Image Analysis and Recognition, pp. 557–566 (2014)Google Scholar
  3. 3.
    Lee, S., Ahn, J., Jo, K.: Comparison of text beginning frame detection methods in news video sequences. J. Broadcast. Eng. 21(3), 307–318 (2016)CrossRefGoogle Scholar
  4. 4.
    Lee, S., Ahn, J., Lee, Y., Jo, K.: Beginning frame and edge based name text localization in news interview videos. In: Huang, D.-S., Han, K., Hussain, A. (eds.) ICIC 2016. LNCS (LNAI), vol. 9773, pp. 583–594. Springer, Cham (2016).  https://doi.org/10.1007/978-3-319-42297-8_54CrossRefGoogle Scholar
  5. 5.
    Otus, N.: A thresholding selection method from gray level histogram. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)CrossRefGoogle Scholar
  6. 6.
    ABBYY Cloud OCR SDK. ocrsdk.com
  7. 7.
    NLTK Language Toolkit. www.nltk.org

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.School of Electrical EngineeringUniversity of UlsanUlsanKorea

Personalised recommendations