Abstract
Since born-digital images usually have low resolution, they are distinctly different from natural scene images. Extracting text information from born-digital images has been an increasing interest in document analysis and recognition field. We propose an automatic method to recognize word from low-resolution color image. First, the image is smoothed by using the bilateral filter, which preserves edge information. Then, it is binarized using global thresholding method and cleaned from noise. Finally, the open source Optical Character Recognition engine, with the incorporation of a post-processor trained on knowledge of English language, is applied to obtain meaningful words from the binary image. We experiment the proposed system on ICDAR 2011 and music sheet dataset, and the result shows better performance than several previous works.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Karatzas, D., Robles Mestre, S., Mas, J., Nourbakhsh, F., Pratim Roy, P.: ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email). In: 11th International Conference of Document Analysis and Recognition (ICDAR 2011), pp. 1485–1490. IEEE CPS (2011)
Leavit, N.: Vendors Fight Spam’s Sudden Rise. IEEE Computer 40(3), 16–19 (2007)
Aradhye, H.B., Meyers, G.K., Herson, J.A.: Image analysis for efficient categorization of image-based spam e-mail. In: 8th Int. Conf. on Document Analysis and Recognition, pp. 914–918 (2005)
Fumera, G., Pillai, I., Roli, F.: Spam fitlering based on the analysis of text information embedded into images. In: 1st Int. Symposium on Information and Communication Technologies, pp. 291–296 (2003)
TH-OCR, http://www.wintone.com.cn/en/Products/detail118.aspx
Su, B., Lu, S., Phan, T.Q., Tan, C.L.: Character Extraction in Web Images for Text Recognition. In: International Conference on Pattern Recognition (2012)
Sauvola, J., Pietikainen, M.: Adaptive document image binarization. Pattern Recognition Journal 33(2), 225–236 (2000)
Tesseract-OCR, http://code.google.com/p/tesseract-ocr/
Commercial ABBY FineReader OCR, http://finereader.abbyy.com/
González, A., Bergasa, L.M.: A Text Reading Algorithm for Natural Images. Image and Vision Computing 31(3), 255–274 (2013)
Kumar, D., Ramakrishnan, A.G.: Power-law Transformation for Enhanced Recognition of Born-Digital Word Images. In: Conf. on Signal Processing and Communications, SPCOM (2012)
Yang, H., Quehl, B., Sack, H.: A Framework for Improved Video Text Detection and Recognition. Journal of Multimedia Tools and Applications (2012) ISSN: 1380-7501, ISSN: 1573-7721
Tomasi, C., Manduchi, R.: Bilateral filtering for gray and color images. In: International Conference on Computer Vision, pp. 839–846 (1998)
Otsu, N.: A threshold selection method from grey level histogram. IEEE Transactions on System, Man and Cybernetics 9, 62–66 (1979)
Schilling, R.J.: Fundamentals of Robotics Analysis and Control. Prentice-Hall, Englewood Cliffs (1990)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nguyen, M.H., Kim, SH., Lee, G. (2014). Recognizing Text in Low Resolution Born-Digital Images. In: Jeong, YS., Park, YH., Hsu, CH., Park, J. (eds) Ubiquitous Information Technologies and Applications. Lecture Notes in Electrical Engineering, vol 280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41671-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-41671-2_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41670-5
Online ISBN: 978-3-642-41671-2
eBook Packages: EngineeringEngineering (R0)