Abstract
The binarization of badly degraded document images is very challenging job due to the presence of various degradations such as ink bleed through, stains, smear, blur, low contrast and nonuniform illumination. Many binarization techniques are proposed in the literature, most of these techniques are threshold based. This chapter proposes the binarization technique which uses the contrast feature to compute the threshold value with minimum parameter tuning. It computes the local contrast image using maximum and minimum pixel values in the neighbourhood. The high contrast text pixels in the image are detected using global binarization. Finally, the local thresholds are computed using high contrast image pixels within the local window to binarize the document image. It has been tested on benchmark datasets H-DIBCO-2010 and H-DIBCO-2016 in terms of F-measure, PSNR and NRM. The results are compared with the Bernsen’s and LMM contrast based binarization techniques and found to be outperforming these methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
N. Otsu, A threshold selection method from gray level histograms. IEEE Trans. Sys. Man Cybern. 9(1), 62–66 (1979)
W. Niblack, An Introduction to Image Processing (Prentice-Hall, Englewood Cliffs, NJ, 1986), pp. 115–116
P. Wellner, Adaptive Thresholding for the Digital Desk. Xerox, EPC1993–110 (1993)
J. Sauvola, M. Pietaksinen, Adaptive document image binarization. Pattern Recogn. Lett. 33(1), 225–236 (1997)
C. Wolf, J.M. Jolion, Extraction and recognition of artificial text in multimedia document. Pattern Anal. Appl. 6(4), 309–326 (2003)
K. Khurshid, I. Siddiqi, C. Faure, N. Vincent, Comparison of Niblack inspired methods for ancient document. in 16th IEEE International Conference on Document Recognition and Retrieval 7247 (2009), pp. 1–10
J. Bernsen, Dynamic thresholding of gray level images. in Proceedings of IEEE International Conference on Pattern Recognition (1986), pp. 1251–1255
B. Su, S. Lu, C.L. Tan, Binarization of historical handwritten document images using local maximum and minimum filter. Int. Workshop Doc. Anal. Sys. pp. 159–165 (2010)
B. Gatos, I. Pratikakis, S.J. Perantonis, Adaptive degraded document image binarization. Pattern Recogn. 39(3), 317–327 (2006)
S. Zhou, C. Liu, Z. Cui, S. Gong, An improved adaptive document image binarization method. in Proceedings of 2nd IEEE International Congress on Image and Signal Processing (2009), pp. 1–5
H. Kawano, H. Oohama, H. Maeda, Y. Okada, N. Ikoma, Degraded document image binarization combining local statistics. in IEEE International Joint Conference (ICROS-SICE) (2009), pp. 439–443
I.K. Kim, D.W. Jung, R.H. Park, Document image binarization based on topographic analysis using a water flow model. Pattern Recogn. 3, 265–277 (2002)
H.H. Oh, K.T. Lim, S.I. Hien, An improved binarization algorithm based on a water flow model for document image with inhomogeneous backgrounds. Pattern Recogn. 38, 2612–2625 (2005)
M. Valizadeh, E. Kabir, An adaptive water flow model for binarization of degraded document images. IJDAR 16(2), 165–176 (2013)
W. Jiangtao, L. Shumo, S. Jiedi, A new binarization method for non- uniform illuminated document images. Pattern Recogn. 46(6), 1670–1690 (2013)
A. Elazab, F. Jia, J. Wu, Q. Hu, Segmentation of brain tissues from magnetic resonance images using adaptively regularized kernel-based Fuzzy C-Means Clustering. in Computational and Mathematical Methods in Medicine (Hindawi Publishing Corporation, 2015)
A. Farahmand, A. Sarrafzadeh, J. Shanbehzadeh, Noise removal and binarization of scanned document images using clustering of features. in Proceedings of the International MultiConference of Engineers and Computer Scientists, vol. 1 (2017)
H. Michalak, K. Okarma, Region based adaptive binarization for optical character recognition purposes. In: International Interdisciplinary Ph.D. Workshop (IIPhDW) (Swinoujście, 2018) pp. 361–366
N. Ouafek, M. Kholladi, A binarization method for degraded document image using artificial neural network and interpolation inpainting. in Proceedings of 4th International Conference on Optimization and Applications (ICOA) (Mohammedia, 2018) pp. 1–5
B. Su, S. Lu, C.L. Tan, Combination of document image binarization techniques. in International Conference on Document Analysis and Recognition (Beijing, China, 2011) pp. 22–26
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Rani, U., Kaur, A., Josan, G. (2020). A New Contrast Based Degraded Document Image Binarization. In: Mallick, P., Pattnaik, P., Panda, A., Balas, V. (eds) Cognitive Computing in Human Cognition. Learning and Analytics in Intelligent Systems, vol 17. Springer, Cham. https://doi.org/10.1007/978-3-030-48118-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-48118-6_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-48117-9
Online ISBN: 978-3-030-48118-6
eBook Packages: Computer ScienceComputer Science (R0)