Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs

Liu, Shun; Xie, Hongtao; Zhou, Chuan; Mao, Zhendong

doi:10.1007/978-3-319-51811-4_40

Shun Liu¹⁸,
Hongtao Xie¹⁸,
Chuan Zhou^18,19 &
…
Zhendong Mao¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10132))

Included in the following conference series:

International Conference on Multimedia Modeling

3240 Accesses
4 Citations

Abstract

Text detection in complex background images is an important prerequisite for many image content analysis tasks. Actually, nearly all the widely-used methods of text detection focus on English and Chinese while some minority languages, such as Uyghur language, are paid less attention by researchers. In this paper, we propose a system which detects Uyghur language text in complex background images. First, component candidates are detected by the channel-enhanced Maximally Stable Extremal Regions (MSERs) algorithm. Then, most non-text regions are removed by a two-layer filtering mechanism. Next, the remaining component regions are connected into short chains, and the short chains are expanded by an expansion algorithm to connect the missed MSERs. Finally, the chains are identified by a Random Forest classifier. Experimental comparisons on the proposed dataset prove that our algorithm is effective for detecting Uyghur language text in complex background images. The F-measure is 84.8%, much better than the state-of-the-art performance of 75.5%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Xie, H., Zhang, Y., Tan, J., Guo, L., Li, J.: Contextual query expansion for image retrieval. IEEE Trans. Multimedia 16(4), 1104–1114 (2014)
Article Google Scholar
Nie, L., Yan, S., Wang, M., Hong, R., Chua, T.S.: Harvesting visual concepts for image search with complex queries. In: ACM MM (2012)
Google Scholar
Yin, X.C., Yin, X., et al.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2013)
Google Scholar
Huang, W., Qiao, Yu., Tang, X.: Robust scene text detection with convolution neural network induced MSER trees. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 497–511. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10593-2_33
Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: IEEE CVPR, vol. 157, no. 10, pp. 3538–3545 (2012)
Google Scholar
Yao, C., et al.: Detecting texts of arbitrary orientations in natural images. In: IEEE CVPR, pp. 1083–1090 (2012)
Google Scholar
Zhang, Z., Shen, W., Yao, C., Bai, X.: Symmetry-based text line detection in natural scenes. In: IEEE CVPR, Boston, MA, June 2015
Google Scholar
Xie, H., Gao, K., Zhang, Y., Tang, S., Li, J., Liu, Y.: Efficient feature detection and effective post-verification for large scale near-duplicate image search. IEEE Trans. Multimedia 13(6), 1319–1332 (2011)
Article Google Scholar
http://en.wikipedia.org/wiki/Uyghur_language
Matas, J., et al.: Robust wide-baseline stereo from maximally stable extremal regions. IVC 2(10), 761–767 (2002)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE CVPR, vol. 1, pp. 886–893 (2005)
Google Scholar
Leo, B.: Random forests. Mach. Learn. 45(6), 422–432 (2001)
MATH Google Scholar
Wang, T., Wu, D.J., Coates, A., et al.: End-to-end text recognition with convolutional neural networks. In: 2012 21st International Conference on Pattern Recognition (ICPR), pp. 3304–3308. IEEE (2012)
Google Scholar
Chen, Z., Feng, B., Xie, H., Zheng, R., Xu, B.: Video to article hyperlinking by multiple tag property exploration. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8325, pp. 62–73. Springer, Heidelberg (2014). doi:10.1007/978-3-319-04114-8_6
Chapter Google Scholar
Chen, Z., Chen, Y., Gao, X., et al.: Unobtrusive sensing incremental social contexts using fuzzy class incremental learning. In: International Conference on Data Mining (2015)
Google Scholar
Gao, X., Chen, Z., Tang, S., Zhang, Y., Li, J.: Adaptive weighted imbalance learning with application to abnormal activity recognition. Neurocomputing 173, 1927–1935 (2016)
Article Google Scholar

Download references

Acknowledgement

This work is supported by the National Nature Science Foundation of China (61303171, 61502477, 61502479), the “Strategic Priority Research Program” of the Chinese Academy of Sciences (XDA06031000).

Author information

Authors and Affiliations

National Engineering Laboratory for Information Security Technologies, Institute of Information Engineering, Chinese Academy of Sciences, Beijing, 100093, China
Shun Liu, Hongtao Xie, Chuan Zhou & Zhendong Mao
University of Chinese Academy of Sciences, Beijing, 100049, China
Chuan Zhou

Authors

Shun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hongtao Xie
View author publications
You can also search for this author in PubMed Google Scholar
Chuan Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Zhendong Mao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongtao Xie .

Editor information

Editors and Affiliations

CNRS–IRISA, Rennes, France
Laurent Amsaleg
Reykjavík University, Reykjavik, Iceland
Gylfi Þór Guðmundsson
Dublin City University, Dublin, Ireland
Cathal Gurrin
Reykjavik University, Reykjavik, Ireland
Björn Þór Jónsson
National Institute of Informatics, Tokyo, Japan
Shin’ichi Satoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, S., Xie, H., Zhou, C., Mao, Z. (2017). Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs. In: Amsaleg, L., Guðmundsson, G., Gurrin, C., Jónsson, B., Satoh, S. (eds) MultiMedia Modeling. MMM 2017. Lecture Notes in Computer Science(), vol 10132. Springer, Cham. https://doi.org/10.1007/978-3-319-51811-4_40

Download citation

DOI: https://doi.org/10.1007/978-3-319-51811-4_40
Published: 31 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51810-7
Online ISBN: 978-3-319-51811-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics