DeepITQA: Deep Based Image Text Quality Assessment

Li, Hongyu; Zhu, Fan; Qiu, Junhua

doi:10.1007/978-3-030-04224-0_34

Hongyu Li¹⁵,
Fan Zhu¹⁵ &
Junhua Qiu¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11306))

Included in the following conference series:

International Conference on Neural Information Processing

1818 Accesses
1 Citations

Abstract

To predict the OCR accuracy of document images, text related image quality assessment is necessary and of great value, especially in online business processes. Such quality assessment is more interested in text and aims to compute the quality score of an image through predicting the degree of degradation at textual regions. In this paper, we propose a deep based framework to achieve image text quality assessment, which is composed of three stages: text detection, text quality prediction, and weighted pooling. Text detection is used to find potential text lines and the quality is solely estimated on detected text lines. To predict text line quality, we train a deep neural network model with our synthetic samples. The overall text quality of an image can be computed through pooling the quality of all detected text lines by way of weighted averaging. The proposed method has been tested on two benchmarks and our collected pictures. Experimental results show that the proposed method is feasible and promising in image text quality assessment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bong, D.B.L., Khoo, B.E.: Blind image blur assessment by using valid reblur range and histogram shape difference. Signal Process. Image Commun. 29(6), 699–710 (2014)
Article Google Scholar
Bong, D.B.L., Khoo, B.E.: Objective blur assessment based on contraction errors of local contrast maps. Multimedia Tools Appl. 74(17), 7355–7378 (2015). https://doi.org/10.1007/s11042-014-1983-5
Article Google Scholar
Bosse, S., Maniry, D., Wiegand, T., Samek, W.: A deep neural network for image quality assessment. In: 2016 IEEE International Conference on Image Processing (ICIP), September 2016, pp. 3773–3777 (2016)
Google Scholar
Buta, M., Neumann, L., Matas, J.: Fastext: Efficient unconstrained scene text detector. In: IEEE International Conference on Computer Vision, pp. 1206–1214 (2015)
Google Scholar
De, K., Masilamani, V.: Discrete orthogonal moments based framework for assessing blurriness of camera captured document images. In: Vijayakumar, V., Neelanarayanan, V. (eds.) Proceedings of the 3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC – 16’). SIST, vol. 49, pp. 227–236. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30348-2_18
Chapter Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, June 2016
Google Scholar
Hou, W., Gao, X., Tao, D., Li, X.: Blind image quality assessment via deep learning. IEEE Trans. Neural Netw. Learn. Syst. 26(6), 1275–1286 (2015)
Article MathSciNet Google Scholar
Kang, L., Ye, P., Li, Y., Doermann, D.: A deep learning approach to document image quality assessment. In: IEEE International Conference on Image Processing, pp. 2570–2574 (2014)
Google Scholar
Kumar, J., Chen, F., Doermann, D.: Sharpness estimation for document and scene images. In: International Conference on Pattern Recognition, pp. 3292–3295 (2013)
Google Scholar
Nayef, N.: Metric-based no-reference quality assessment of heterogeneous document images. In: SPIE Electronic Imaging, pp. 94020L–94020L-12 (2015)
Google Scholar
Nayef, N., Luqman, M.M., Prum, S., Eskenazi, S., Chazalon, J., Ogier, J.M.: SmartDoc-QA: a dataset for quality assessment of smartphone captured document images - single and multiple distortions. In: International Conference on Document Analysis and Recognition, pp. 1231–1235 (2015)
Google Scholar
Peng, X., Cao, H., Natarajan, P.: Document image quality assessment using discriminative sparse representation. In: Document Analysis Systems, pp. 227–232 (2016)
Google Scholar
Rusinol, M., Chazalon, J., Ogier, J.M.: Combining focus measure operators to predict OCR accuracy in mobile-captured document images. In: IAPR International Workshop on Document Analysis Systems, pp. 181–185 (2014)
Google Scholar
Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: European Conference on Computer Vision, pp. 56–72 (2016)
Chapter Google Scholar
Ye, P., Doermann, D.: Document image quality assessment: a brief survey. In: International Conference on Document Analysis and Recognition, pp. 723–727 (2013)
Google Scholar
Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)
Article Google Scholar
Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., Bai, X.: Multi-oriented text detection with fully convolutional networks, pp. 4159–4167 (2016)
Google Scholar
Zhou, X., et al.: EAST: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, 21–26 July 2017, pp. 2642–2651 (2017)
Google Scholar
Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. 10(1), 19–36 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

AI Lab, ZhongAn Information Technology Service Co., Ltd., Shanghai, China
Hongyu Li, Fan Zhu & Junhua Qiu

Authors

Hongyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Fan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Junhua Qiu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongyu Li .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Beijing, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, H., Zhu, F., Qiu, J. (2018). DeepITQA: Deep Based Image Text Quality Assessment. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11306. Springer, Cham. https://doi.org/10.1007/978-3-030-04224-0_34

Download citation

DOI: https://doi.org/10.1007/978-3-030-04224-0_34
Published: 18 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04223-3
Online ISBN: 978-3-030-04224-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics