Skip to main content

Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs

  • Conference paper
  • First Online:
MultiMedia Modeling (MMM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10132))

Included in the following conference series:

Abstract

Text detection in complex background images is an important prerequisite for many image content analysis tasks. Actually, nearly all the widely-used methods of text detection focus on English and Chinese while some minority languages, such as Uyghur language, are paid less attention by researchers. In this paper, we propose a system which detects Uyghur language text in complex background images. First, component candidates are detected by the channel-enhanced Maximally Stable Extremal Regions (MSERs) algorithm. Then, most non-text regions are removed by a two-layer filtering mechanism. Next, the remaining component regions are connected into short chains, and the short chains are expanded by an expansion algorithm to connect the missed MSERs. Finally, the chains are identified by a Random Forest classifier. Experimental comparisons on the proposed dataset prove that our algorithm is effective for detecting Uyghur language text in complex background images. The F-measure is 84.8%, much better than the state-of-the-art performance of 75.5%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Xie, H., Zhang, Y., Tan, J., Guo, L., Li, J.: Contextual query expansion for image retrieval. IEEE Trans. Multimedia 16(4), 1104–1114 (2014)

    Article  Google Scholar 

  2. Nie, L., Yan, S., Wang, M., Hong, R., Chua, T.S.: Harvesting visual concepts for image search with complex queries. In: ACM MM (2012)

    Google Scholar 

  3. Yin, X.C., Yin, X., et al.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2013)

    Google Scholar 

  4. Huang, W., Qiao, Yu., Tang, X.: Robust scene text detection with convolution neural network induced MSER trees. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 497–511. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10593-2_33

    Google Scholar 

  5. Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: IEEE CVPR, vol. 157, no. 10, pp. 3538–3545 (2012)

    Google Scholar 

  6. Yao, C., et al.: Detecting texts of arbitrary orientations in natural images. In: IEEE CVPR, pp. 1083–1090 (2012)

    Google Scholar 

  7. Zhang, Z., Shen, W., Yao, C., Bai, X.: Symmetry-based text line detection in natural scenes. In: IEEE CVPR, Boston, MA, June 2015

    Google Scholar 

  8. Xie, H., Gao, K., Zhang, Y., Tang, S., Li, J., Liu, Y.: Efficient feature detection and effective post-verification for large scale near-duplicate image search. IEEE Trans. Multimedia 13(6), 1319–1332 (2011)

    Article  Google Scholar 

  9. http://en.wikipedia.org/wiki/Uyghur_language

  10. Matas, J., et al.: Robust wide-baseline stereo from maximally stable extremal regions. IVC 2(10), 761–767 (2002)

    Google Scholar 

  11. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE CVPR, vol. 1, pp. 886–893 (2005)

    Google Scholar 

  12. Leo, B.: Random forests. Mach. Learn. 45(6), 422–432 (2001)

    MATH  Google Scholar 

  13. Wang, T., Wu, D.J., Coates, A., et al.: End-to-end text recognition with convolutional neural networks. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 3304–3308. IEEE (2012)

    Google Scholar 

  14. Chen, Z., Feng, B., Xie, H., Zheng, R., Xu, B.: Video to article hyperlinking by multiple tag property exploration. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8325, pp. 62–73. Springer, Heidelberg (2014). doi:10.1007/978-3-319-04114-8_6

    Chapter  Google Scholar 

  15. Chen, Z., Chen, Y., Gao, X., et al.: Unobtrusive sensing incremental social contexts using fuzzy class incremental learning. In: International Conference on Data Mining (2015)

    Google Scholar 

  16. Gao, X., Chen, Z., Tang, S., Zhang, Y., Li, J.: Adaptive weighted imbalance learning with application to abnormal activity recognition. Neurocomputing 173, 1927–1935 (2016)

    Article  Google Scholar 

Download references

Acknowledgement

This work is supported by the National Nature Science Foundation of China (61303171, 61502477, 61502479), the “Strategic Priority Research Program” of the Chinese Academy of Sciences (XDA06031000).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongtao Xie .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Liu, S., Xie, H., Zhou, C., Mao, Z. (2017). Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs. In: Amsaleg, L., Guðmundsson, G., Gurrin, C., Jónsson, B., Satoh, S. (eds) MultiMedia Modeling. MMM 2017. Lecture Notes in Computer Science(), vol 10132. Springer, Cham. https://doi.org/10.1007/978-3-319-51811-4_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-51811-4_40

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-51810-7

  • Online ISBN: 978-3-319-51811-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics