Skip to main content

A Method for Binarization of Document Images from a Live Camera Stream

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8946))

Abstract

This paper describes a method for binarization of document images from a live camera stream. The method is based on histogram matching over partial images (referred to as tiles). A method developed previously has been applied successfully to images with artificially added noise. Here, an improved method is presented, in which the user has more direct control over the specification of the binarizer. The resulting system is then taken a step further, by considering the more difficult case of binarization of live camera images. It is demonstrated that the improved method works well for this case, even when the image stream is obtained using a (slightly modified) low-cost web camera with low resolution. For typical images obtained this way, a standard OCR reader is capable of reading the binarized images, detecting around 87.5 % of all words without any error, and with mostly minor, correctable errors for the remaining words.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Neumann, L., Matas, J.: A method for text localization and recognition in real-world images. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) ACCV 2010, Part III. LNCS, vol. 6494, pp. 770–783. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  2. González, A., Bergasa, L.: A text reading algorithm for natural images. Image vis. comput. 31, 255–274 (2013)

    Article  Google Scholar 

  3. Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comput. Sci. 14(18), 3011–3030 (2008)

    Google Scholar 

  4. Shi, J., Ray, N., Zhang, H.: Shape based local thresholding for binarization of document images. Pattern Recogn. Lett. 33, 24–32 (2012)

    Article  Google Scholar 

  5. Valizadeh, M., Kabir, E.: An adaptive water flow model for binarization of degraded document images. Int. J. Doc. Anal. Recogn. 16(2), 165–176 (2013)

    Article  Google Scholar 

  6. Wahde, M.: A method for document image binarization based on histogram matching and repeated contrast enhancement. In: Duval, B., van der Herik, J., Loiseau, S., Filipe, J. (eds.) Proceedings of the 6th International Conference on Agents and Artificial Intelligence (ICAART 2014), pp. 34–41 (2014)

    Google Scholar 

  7. Chen, K.-N., Chen, C.-H., Chang, C.-C.: Efficient illumination compensation techniques for text images. Digit. Signal Process. 22, 726–733 (2012)

    Article  Google Scholar 

  8. Lu, S., Su, B., Tan, C.: Document image binarization using background estimation and stroke edges. Int. J. Doc. Anal. Recogn. 13(4), 303–314 (2010)

    Article  Google Scholar 

  9. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man. Cybern. 9, 62–66 (1979)

    Article  Google Scholar 

  10. Niblack, W.: An Introduction to Image Processing. Prentice-Hall, Englewood Cliffs (1986)

    Google Scholar 

  11. Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2010)

    Article  Google Scholar 

  12. Pele, O., Werman, M.: The Quadratic-Chi Histogram Distance Family. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 749–762. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  13. FreeOCR, accessed 20140722. www.free-ocr.com

Download references

Acknowledgements

The author gratefully acknowledges financial support from De blindas vänner.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mattias Wahde .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Wahde, M. (2015). A Method for Binarization of Document Images from a Live Camera Stream. In: Duval, B., van den Herik, J., Loiseau, S., Filipe, J. (eds) Agents and Artificial Intelligence. ICAART 2014. Lecture Notes in Computer Science(), vol 8946. Springer, Cham. https://doi.org/10.1007/978-3-319-25210-0_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25210-0_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25209-4

  • Online ISBN: 978-3-319-25210-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics