Text Extraction from Mail Images with Complex Background

Wang, Qingqing; Tu, Xiao; Lu, Shujing; Lu, Yue

doi:10.1007/978-981-10-8108-8_1

Text Extraction from Mail Images with Complex Background

Qingqing Wang¹²,
Xiao Tu¹³,
Shujing Lu¹³ &
…
Yue Lu^12,13

Conference paper
First Online: 03 February 2018

1816 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 815))

Abstract

A novel method is proposed for text extraction from mail images with complex background. Firstly, wavelet transform and Laplacian operator are applied to generate the features of regions which are obtained by dividing input image with sliding window. Then, support vector machine (SVM) is utilized to classify these regions into texts and non-texts according to the features. Bootstrap strategy is used to build the training database. Finally, connected components analysis (CCA) is employed to merge text regions into text candidates which can be processed by following steps to get the delivery address. Experimental results involving 534 mail images show the effectiveness and robustness of the proposed method, and comparison results with other methods demonstrate the advantages of the selected features.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

He, P., Huang, W., He, T., Zhu, Q., Qiao, Y., Li, X.: Single shot textdetector with regional attention. In: International Conference on Computer Vision (2017)
Google Scholar
He, T., Huang, W., Qiao, Y., Yao, J.: Accurate text localization in natural image with cascaded convolutional text network. arXiv:1603.09423 (2016)
He, T., Huang, W., Qiao, Y., Yao, J.: Text-attention convolutional neural networks for scene text detection. IEEE Trans. Image Process. 25, 2529–2541 (2016)
Article MathSciNet Google Scholar
Iqbal, K., Yin, X., Yin, X., Ali, H., Hao, H.: Classifier comparison for MSER-based text classification in scene images. In: International Joint Conference on Neural Networks, pp. 1–6 (2013)
Google Scholar
Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., Fu, P., Luo, Z.: R2CNN: Rotation region CNN for orientation robust scene text detection. arXiv:1706.09579v2 (2017)
Koo, K., Kim, D.: Scene text detection via connected component clustering and nontext filtering. IEEE Trans. Image Process. 22(6), 2296–2305 (2013)
Google Scholar
Liao, M., Shi, B., Bai, X, Wang, X., Liu, W.: Textboxes: a fast textdetector with a single deep neural network. In: The 31th AAAI Conference on Artificial Intelligence, pp. 4161–4167 (2017)
Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–268 (2002)
Article Google Scholar
Liu, C., Wang, C., Dai, R.: Text detection in images based on unsupervised classification of edge-based features. In: The 18th International Conference on Document Analysis and Recognition, pp. 610–614 (2005)
Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3538–3545 (2012)
Google Scholar
Pan, Y.F., Hou, X., Liu, C.L.: A hybrid approach to detect and localize texts in natural scene images. IEEE Trans. Image Process. 20(3), 800–813 (2011)
Article MathSciNet MATH Google Scholar
Shi, C., Wang, C., Xiao, B., Zhang, Y., Gao, S.: Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recogn. Lett. 34(2), 107–116 (2013)
Article Google Scholar
Shivakumara, P., Trung, Q.P., Tan, C.L.: A robust wavelet transform based technique for video text detection. In: The 10th International Conference on Document Analysis and Recognition, pp. 1285–1289 (2009)
Google Scholar
Sung, K., Poggio, T.: Example-based learning for view-based human face detection. IEEE Trans. Pattern Anal. Mach. Intell. 20(1), 39–51 (1998)
Article Google Scholar
Tu, X., Lu, Y.: Run-based approach to labeling connected components in document images. In: The 2th International Workshop on ETCS, pp. 206–209 (2010)
Google Scholar
Ye, Q., Gao, W., Wang, W., Zeng, W.: A robust text detection algorithm in images and video frames. In: IEEE ICICS-PCM, pp. 802–806 (2003)
Google Scholar
Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans. Image Process. 20(9), 2594–2605 (2011)
Article MathSciNet MATH Google Scholar
Yin, X., Yin, X., Huang, K., Hao, H.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)
Article Google Scholar
Zhang, J., Kasturi, R.: Text detection using edge gradient and graph spectrum. In: The 20th International Conference on Pattern Recognition, pp. 3979–3982 (2010)
Google Scholar
Zini, L., Destrero, A., Odone, F.: A classification architecture based on connected components for text detection in unconstrained environments. In: The 6th IEEE International Conference on Digital Object Identifier, pp. 176–181 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, East China Normal University, Shanghai, 200062, China
Qingqing Wang & Yue Lu
ECNU-SRI Joint Lab for Pattern Analysis and Intelligent, Shanghai Research Institute of China Post Group, Shanghai, 200062, China
Xiao Tu, Shujing Lu & Yue Lu

Authors

Qingqing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Tu
View author publications
You can also search for this author in PubMed Google Scholar
Shujing Lu
View author publications
You can also search for this author in PubMed Google Scholar
Yue Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qingqing Wang .

Editor information

Editors and Affiliations

Shanghai Jiao Tong University , Shanghai, China
Guangtao Zhai
Shanghai Jiao Tong University , Shanghai, China
Jun Zhou
Jiao Tong University , Shanghai, China
Xiaokang Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Q., Tu, X., Lu, S., Lu, Y. (2018). Text Extraction from Mail Images with Complex Background. In: Zhai, G., Zhou, J., Yang, X. (eds) Digital TV and Wireless Multimedia Communication. IFTC 2017. Communications in Computer and Information Science, vol 815. Springer, Singapore. https://doi.org/10.1007/978-981-10-8108-8_1

Download citation

DOI: https://doi.org/10.1007/978-981-10-8108-8_1
Published: 03 February 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8107-1
Online ISBN: 978-981-10-8108-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics