Abstract
Here, we present our recent study of a robust but simple approach to extraction of texts from camera-captured images. In the proposed approach, we first identify pixels which are highly specular. Connected components of this set of specular pixels are obtained. Pixels belonging to each such component are separately binarized using the well-known Otsu’s approach. We next apply smoothing on the whole image before obtaining its Canny edge representation. Bounding rectangle of each connected component of the Canny edge image is obtained and multiple components with pairwise overlapping bounding boxes are merged. Otsu’s thresholding technique is applied separately on different parts of input image defined by the resulting bounding boxes. Although Otsu’s thresholding approach does not generally provide acceptable performance on camera captured images, we observed its suitability when applied severally as in the above. The binarized specular components obtained at the initial stage replace the corresponding regions of the latter binarized image. Finally, a set of postprocessing operations is used to remove certain non-text components of the binarized image.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man Cybern. 9(1), 62–66 (1979)
Kittler, J., Illingworth, J., Foglein, J.: Threshold selection based on a simple image statistic. Comp. Vision Graph. Image Proc. 30(2), 125–147 (1985)
Sauvola, J.J., Pietikainen, M.: Adaptive document image binarization. Patt. Recog. 33(2), 225–236 (2000)
Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, New York (1986)
Stathis, P., Kavallieratou, E., Papamarkos, N.: An evaluation technique for binarization algorithms. J. Univ. Comp. Sci. 14(18), 3011–3030 (2008)
Peng, X., Setlur, S., Govindaraju, V., Sitaram, R.: Markov random field based binarization for hand-held devices captured document images. In: Proceedings of Indian Conference on Comp. Vision Graph. Image Proceedings, pp. 71–76 (2010)
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: Proceedings of the 7th Internationl Conference on Document Analysis and Recognition, pp. 682–687 (2003)
Shafer, S.A.: Using color to separate reflection components. Color Res. Appl. 10, 210–218 (1985)
He, Y., et al.: Enhancement of camera-based whiteboard images. In: XVII-DRR (SPIE Proceedings Series, vol. 7534, pp. 1–10 (2010)
Canny, J.: A computational approach to edge detection. IEEE Trans. Patt. Anal. Mach. Intell. 8(6), 679–698 (1986)
Roy Chowdhury, A., Bhattacharya, U., Parui, S.K.: Text detection of two major Indian scripts in natural scene images. In: Iwamura, M., Shafait, F. (eds.) CBDAR 2011. LNCS, vol. 7139, pp. 42–57. Springer, Heidelberg (2012)
Roy Chowdhury, A., Bhattacharya, U., Parui, S.K.: Scene text detection using sparse stroke information and MLP. In: Proceedings of International Conference on Pattern Recognition, pp. 294–297 (2012)
Kasar, T. et al.: Font and background color independent text binarization. In: Proceedings of CBDAR, pp. 3–9 (2007)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: Proceedings of CVPR, pp. 2963–2970 (2010)
Borgefors, G.: Distance transformations in digital images. Comp. Vis. Graph. Image Proc. 34, 344–371 (1986)
Chen, H., et al.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: Proceedings of ICIP (2011)
Merino-Gracia, C., Lenc, K., Mirmehdi, M: A head-mounted device for recognizing text in natural scenes. In: Proceedings of CBDAR, pp. 27–32 (2011)
Zhang, J., Kasturi, R.: Text detection using edge gradient and graph spectrum. In: Proceedings of ICPR, pp. 3979–3982 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Banerjee, S., Mullick, K., Bhattacharya, U. (2014). A Robust Approach to Extraction of Texts from Camera Captured Images. In: Iwamura, M., Shafait, F. (eds) Camera-Based Document Analysis and Recognition. CBDAR 2013. Lecture Notes in Computer Science(), vol 8357. Springer, Cham. https://doi.org/10.1007/978-3-319-05167-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-05167-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05166-6
Online ISBN: 978-3-319-05167-3
eBook Packages: Computer ScienceComputer Science (R0)