Abstract
To predict the OCR accuracy of document images, text related image quality assessment is necessary and of great value, especially in online business processes. Such quality assessment is more interested in text and aims to compute the quality score of an image through predicting the degree of degradation at textual regions. In this paper, we propose a deep based framework to achieve image text quality assessment, which is composed of three stages: text detection, text quality prediction, and weighted pooling. Text detection is used to find potential text lines and the quality is solely estimated on detected text lines. To predict text line quality, we train a deep neural network model with our synthetic samples. The overall text quality of an image can be computed through pooling the quality of all detected text lines by way of weighted averaging. The proposed method has been tested on two benchmarks and our collected pictures. Experimental results show that the proposed method is feasible and promising in image text quality assessment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bong, D.B.L., Khoo, B.E.: Blind image blur assessment by using valid reblur range and histogram shape difference. Signal Process. Image Commun. 29(6), 699–710 (2014)
Bong, D.B.L., Khoo, B.E.: Objective blur assessment based on contraction errors of local contrast maps. Multimedia Tools Appl. 74(17), 7355–7378 (2015). https://doi.org/10.1007/s11042-014-1983-5
Bosse, S., Maniry, D., Wiegand, T., Samek, W.: A deep neural network for image quality assessment. In: 2016 IEEE International Conference on Image Processing (ICIP), September 2016, pp. 3773–3777 (2016)
Buta, M., Neumann, L., Matas, J.: Fastext: Efficient unconstrained scene text detector. In: IEEE International Conference on Computer Vision, pp. 1206–1214 (2015)
De, K., Masilamani, V.: Discrete orthogonal moments based framework for assessing blurriness of camera captured document images. In: Vijayakumar, V., Neelanarayanan, V. (eds.) Proceedings of the 3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC – 16’). SIST, vol. 49, pp. 227–236. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30348-2_18
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, June 2016
Hou, W., Gao, X., Tao, D., Li, X.: Blind image quality assessment via deep learning. IEEE Trans. Neural Netw. Learn. Syst. 26(6), 1275–1286 (2015)
Kang, L., Ye, P., Li, Y., Doermann, D.: A deep learning approach to document image quality assessment. In: IEEE International Conference on Image Processing, pp. 2570–2574 (2014)
Kumar, J., Chen, F., Doermann, D.: Sharpness estimation for document and scene images. In: International Conference on Pattern Recognition, pp. 3292–3295 (2013)
Nayef, N.: Metric-based no-reference quality assessment of heterogeneous document images. In: SPIE Electronic Imaging, pp. 94020L–94020L-12 (2015)
Nayef, N., Luqman, M.M., Prum, S., Eskenazi, S., Chazalon, J., Ogier, J.M.: SmartDoc-QA: a dataset for quality assessment of smartphone captured document images - single and multiple distortions. In: International Conference on Document Analysis and Recognition, pp. 1231–1235 (2015)
Peng, X., Cao, H., Natarajan, P.: Document image quality assessment using discriminative sparse representation. In: Document Analysis Systems, pp. 227–232 (2016)
Rusinol, M., Chazalon, J., Ogier, J.M.: Combining focus measure operators to predict OCR accuracy in mobile-captured document images. In: IAPR International Workshop on Document Analysis Systems, pp. 181–185 (2014)
Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: European Conference on Computer Vision, pp. 56–72 (2016)
Ye, P., Doermann, D.: Document image quality assessment: a brief survey. In: International Conference on Document Analysis and Recognition, pp. 723–727 (2013)
Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)
Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., Bai, X.: Multi-oriented text detection with fully convolutional networks, pp. 4159–4167 (2016)
Zhou, X., et al.: EAST: an efficient and accurate scene text detector. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, 21–26 July 2017, pp. 2642–2651 (2017)
Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. 10(1), 19–36 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, H., Zhu, F., Qiu, J. (2018). DeepITQA: Deep Based Image Text Quality Assessment. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11306. Springer, Cham. https://doi.org/10.1007/978-3-030-04224-0_34
Download citation
DOI: https://doi.org/10.1007/978-3-030-04224-0_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04223-3
Online ISBN: 978-3-030-04224-0
eBook Packages: Computer ScienceComputer Science (R0)