Entity Detection for Information Retrieval in Video Streams

  • Sanghee Lee
  • Kanghyun JoEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10956)


The growing amount of video data has raised the need for automatic semantic information indexing and retrieval systems. To accomplish to these needs, the text information in images and videos is proved to be an important source of high-level semantics. This paper discusses the video OCR system designed for overlay text based automatic indexing and retrieval in the video streams. The proposed framework consists of the video segmentation, the video key-frame extraction, the video text recognition, and the entity detection. The experimental results on Korean television news programs show that the proposed method efficiently realizes the automatic indexing in the video streams.


Overlay text Video OCR Indexing Named entity NLP 



This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the Grand Information Technology Research Center support program (IITP-2018-2016-0-00318) supervised by the IITP (Institute for Information & communications Technology Promotion).


  1. 1.
    Jung, C., Kim, J.: Player information extraction for semantic annotation in golf videos. IEEE Tran. Broadcast. 55(1), 79–83 (2009)MathSciNetCrossRefGoogle Scholar
  2. 2.
    Saidane, Z., Garcia, C.: An automatic method video character segmentation. In: International Conference Image Analysis and Recognition, pp. 557–566 (2014)Google Scholar
  3. 3.
    Lee, S., Ahn, J., Jo, K.: Comparison of text beginning frame detection methods in news video sequences. J. Broadcast. Eng. 21(3), 307–318 (2016)CrossRefGoogle Scholar
  4. 4.
    Lee, S., Ahn, J., Lee, Y., Jo, K.: Beginning frame and edge based name text localization in news interview videos. In: Huang, D.-S., Han, K., Hussain, A. (eds.) ICIC 2016. LNCS (LNAI), vol. 9773, pp. 583–594. Springer, Cham (2016). Scholar
  5. 5.
    Otus, N.: A thresholding selection method from gray level histogram. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)CrossRefGoogle Scholar
  6. 6.
    ABBYY Cloud OCR SDK.
  7. 7.
    NLTK Language Toolkit.

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.School of Electrical EngineeringUniversity of UlsanUlsanKorea

Personalised recommendations