Skip to main content

Recognizing Text in Low Resolution Born-Digital Images

  • Conference paper
Ubiquitous Information Technologies and Applications

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 280))

Abstract

Since born-digital images usually have low resolution, they are distinctly different from natural scene images. Extracting text information from born-digital images has been an increasing interest in document analysis and recognition field. We propose an automatic method to recognize word from low-resolution color image. First, the image is smoothed by using the bilateral filter, which preserves edge information. Then, it is binarized using global thresholding method and cleaned from noise. Finally, the open source Optical Character Recognition engine, with the incorporation of a post-processor trained on knowledge of English language, is applied to obtain meaningful words from the binary image. We experiment the proposed system on ICDAR 2011 and music sheet dataset, and the result shows better performance than several previous works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Karatzas, D., Robles Mestre, S., Mas, J., Nourbakhsh, F., Pratim Roy, P.: ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email). In: 11th International Conference of Document Analysis and Recognition (ICDAR 2011), pp. 1485–1490. IEEE CPS (2011)

    Google Scholar 

  2. Leavit, N.: Vendors Fight Spam’s Sudden Rise. IEEE Computer 40(3), 16–19 (2007)

    Article  Google Scholar 

  3. Aradhye, H.B., Meyers, G.K., Herson, J.A.: Image analysis for efficient categorization of image-based spam e-mail. In: 8th Int. Conf. on Document Analysis and Recognition, pp. 914–918 (2005)

    Google Scholar 

  4. Fumera, G., Pillai, I., Roli, F.: Spam fitlering based on the analysis of text information embedded into images. In: 1st Int. Symposium on Information and Communication Technologies, pp. 291–296 (2003)

    Google Scholar 

  5. TH-OCR, http://www.wintone.com.cn/en/Products/detail118.aspx

  6. Su, B., Lu, S., Phan, T.Q., Tan, C.L.: Character Extraction in Web Images for Text Recognition. In: International Conference on Pattern Recognition (2012)

    Google Scholar 

  7. Sauvola, J., Pietikainen, M.: Adaptive document image binarization. Pattern Recognition Journal 33(2), 225–236 (2000)

    Article  Google Scholar 

  8. Tesseract-OCR, http://code.google.com/p/tesseract-ocr/

  9. Commercial ABBY FineReader OCR, http://finereader.abbyy.com/

  10. González, A., Bergasa, L.M.: A Text Reading Algorithm for Natural Images. Image and Vision Computing 31(3), 255–274 (2013)

    Article  Google Scholar 

  11. Kumar, D., Ramakrishnan, A.G.: Power-law Transformation for Enhanced Recognition of Born-Digital Word Images. In: Conf. on Signal Processing and Communications, SPCOM (2012)

    Google Scholar 

  12. Yang, H., Quehl, B., Sack, H.: A Framework for Improved Video Text Detection and Recognition. Journal of Multimedia Tools and Applications (2012) ISSN: 1380-7501, ISSN: 1573-7721

    Google Scholar 

  13. Tomasi, C., Manduchi, R.: Bilateral filtering for gray and color images. In: International Conference on Computer Vision, pp. 839–846 (1998)

    Google Scholar 

  14. Otsu, N.: A threshold selection method from grey level histogram. IEEE Transactions on System, Man and Cybernetics 9, 62–66 (1979)

    Article  Google Scholar 

  15. Schilling, R.J.: Fundamentals of Robotics Analysis and Control. Prentice-Hall, Englewood Cliffs (1990)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Minh Hieu Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nguyen, M.H., Kim, SH., Lee, G. (2014). Recognizing Text in Low Resolution Born-Digital Images. In: Jeong, YS., Park, YH., Hsu, CH., Park, J. (eds) Ubiquitous Information Technologies and Applications. Lecture Notes in Electrical Engineering, vol 280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41671-2_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41671-2_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41670-5

  • Online ISBN: 978-3-642-41671-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics