Abstract
This paper describes a method for binarization of document images from a live camera stream. The method is based on histogram matching over partial images (referred to as tiles). A method developed previously has been applied successfully to images with artificially added noise. Here, an improved method is presented, in which the user has more direct control over the specification of the binarizer. The resulting system is then taken a step further, by considering the more difficult case of binarization of live camera images. It is demonstrated that the improved method works well for this case, even when the image stream is obtained using a (slightly modified) low-cost web camera with low resolution. For typical images obtained this way, a standard OCR reader is capable of reading the binarized images, detecting around 87.5 % of all words without any error, and with mostly minor, correctable errors for the remaining words.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011)
González, A., Bergasa, L.: A text reading algorithm for natural images. Image vis. comput. 31, 255–274 (2013)
Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comput. Sci. 14(18), 3011–3030 (2008)
Shi, J., Ray, N., Zhang, H.: Shape based local thresholding for binarization of document images. Pattern Recogn. Lett. 33, 24–32 (2012)
Valizadeh, M., Kabir, E.: An adaptive water flow model for binarization of degraded document images. Int. J. Doc. Anal. Recogn. 16(2), 165–176 (2013)
Wahde, M.: A method for document image binarization based on histogram matching and repeated contrast enhancement. In: Duval, B., van der Herik, J., Loiseau, S., Filipe, J. (eds.) Proceedings of the 6th International Conference on Agents and Artificial Intelligence (ICAART 2014), pp. 34–41 (2014)
Chen, K.-N., Chen, C.-H., Chang, C.-C.: Efficient illumination compensation techniques for text images. Digit. Signal Process. 22, 726–733 (2012)
Lu, S., Su, B., Tan, C.: Document image binarization using background estimation and stroke edges. Int. J. Doc. Anal. Recogn. 13(4), 303–314 (2010)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man. Cybern. 9, 62–66 (1979)
Niblack, W.: An Introduction to Image Processing. Prentice-Hall, Englewood Cliffs (1986)
Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2010)
Pele, O., Werman, M.: The Quadratic-Chi Histogram Distance Family. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 749–762. Springer, Heidelberg (2010)
FreeOCR, accessed 20140722. www.free-ocr.com
Acknowledgements
The author gratefully acknowledges financial support from De blindas vänner.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Wahde, M. (2015). A Method for Binarization of Document Images from a Live Camera Stream. In: Duval, B., van den Herik, J., Loiseau, S., Filipe, J. (eds) Agents and Artificial Intelligence. ICAART 2014. Lecture Notes in Computer Science(), vol 8946. Springer, Cham. https://doi.org/10.1007/978-3-319-25210-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-25210-0_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25209-4
Online ISBN: 978-3-319-25210-0
eBook Packages: Computer ScienceComputer Science (R0)