Text Detection Through Hidden Markov Random Field and EM-Algorithm

  • H. T. BasavarajuEmail author
  • V. N. Manjunath Aradhya
  • D. S. Guru
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 862)


The text is a dominant source and delivers semantic information about a particular content of the respective image or video. Human often gives importance to the text than any other objects in an image or a video frame. Text detection is one of the prime part of the text information extraction process. Text detection process is an exciting and emerging research area in the zone of pattern recognition, and computer vision due to the complex background, illumination, and arbitrary orientation. In this paper, the Hidden Markov Random Field (HMRF) method and Expectation-Maximization (EM) algorithm are employed to detect the arbitrarily oriented multilingual text present in an image or a video frame. The proposed method calculates the max-min cluster to maximize the discrimination between textual and non-textual region. HMRF separates the textual region. EM algorithm maximizes the likelihood of the parameters. Laplacian of Gaussian process is used to identify the potential text information. The double line structure concept is employed to extract the true text region. The proposed method is evaluated on Hua’s dataset, arbitrarily oriented dataset, and horizontal dataset with performance measures recall, precision, and f-measure. The outcome shows that the approach is promising and encouraging.


Hidden Markov Random Field EM-algorithm Multilingual text Arbitrary oriented Laplacian of Gaussian 


  1. 1.
    Y. Wu, W. Wang, S. Palaiahnakote, T. Lu, A robust symmetry-based method for scene/video text detection through neural network, in 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), pp. 1249–1254 (2017)Google Scholar
  2. 2.
    X. He, Y. Song, Y. Zhang, Scene text detection based on skeleton-cut detector, in IEEE International Conference on Image Processing (ICIP), pp. 3375–3379 (2017)Google Scholar
  3. 3.
    M. Zarechensky, Text detection in natural scenes with multilingual text, in Proceedings of the Tenth Spring Researcher’s Colloquium on Database and Information Systems, pp. 32–35 (2013)Google Scholar
  4. 4.
    N. Sharma, P. Shivakumara, U. Pal, M. Blumenstein, C.L. Tan, A new method for word segmentation from arbitrarily-oriented video text lines, in International Conference on Digital Image Computing Techniques and Applications (DICTA), pp. 1–8 (2012)Google Scholar
  5. 5.
    J. Lim, J. Park, G.G. Medioni, Text segmentation in color images using tensor voting. Image Vis. Comput. 25(5), 671–685 (2007)CrossRefGoogle Scholar
  6. 6.
    M.S. Pavithra, V.N.M. Aradhya, A comprehensive of transforms, Gabor filter and k-means clustering for text detection in images and video. Appl. Comput. Inform. 1–15 (2014)Google Scholar
  7. 7.
    P. Shivakumara, D.S. Guru, H.T. Basavaraju, Color and Gradient Features for Text Segmentation from Video Frames, in International Conference on Multimedia Processing, Communication and Computing Applications, pp. 267–278 (2013)Google Scholar
  8. 8.
    K. Minemura, S. Palaiahnakote, K. Wong, Multi-oriented text detection for intra-frame in H. 264/AVC video, in International Symposium on Intelligent Signal Processing and Communication Systems, pp. 330–335 (2014)Google Scholar
  9. 9.
    M. Saric, Scene text segmentation using low variation extremal regions and sorting based character grouping. Neurocomputing 266, 56–65 (2017)CrossRefGoogle Scholar
  10. 10.
    P. Shivakumara, H.T. Basavaraju, D.S. Guru, C.L. Tan, Detection of curved text in video: quad tree based method, in 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 594–598 (2013)Google Scholar
  11. 11.
    J. Ma, W. Shao, H. Ye, L. Wang, H. Wang, Y. Zheng, X. Xue, Arbitrary-oriented scene text detection via rotation proposals. arXiv preprint arXiv:1703.01086 (2017)
  12. 12.
    X.S. Hua, L. Wenyin, H.J. Zhang, An automatic performance evaluation protocol for video text detection algorithms. IEEE Trans. CSVT, 498–507 (2004)CrossRefGoogle Scholar
  13. 13.
    P. Shivakumara, T.Q. Phan, C.L. Tan, New Fourier-Statistical Features in RGB Space for Video Text Detection, in IEEE Transaction on CSVT, pp. 1520–1532 (2010)CrossRefGoogle Scholar
  14. 14.
    J. Zhou, L. Xu, B. Xiao, R. Dai, A robust system for text extraction in video. In: Proceeding ICMV, pp. 119–124 (2007)Google Scholar
  15. 15.
    C. Lu, C. Wang, R. Dai, Text detection in images based on unsupervised classification of edge-based features, in Proceedings ICDAR, pp. 610–614 (2005)Google Scholar
  16. 16.
    E.K. Wong, M. Chen, A new robust algorithm for video text extraction. Pattern Recognition 1397–1406 (2003)CrossRefGoogle Scholar
  17. 17.
    H.T. Basavaraju, V.N.M. Aradhya, D.S. Guru, A novel arbitrary-oriented multilingual text detection in images/video. Inf. Decis. Sci. 519 –529 (2018)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • H. T. Basavaraju
    • 1
    Email author
  • V. N. Manjunath Aradhya
    • 1
  • D. S. Guru
    • 2
  1. 1.Department of Master of Computer ApplicationsJSS Science and Technology University (Sri Jayachamarajendra College of Engineering)MysoreIndia
  2. 2.Department of Studies in Computer ScienceUniversity of MysoreMysoreIndia

Personalised recommendations