Skip to main content

Text Extraction from Images: A Review

  • Conference paper
  • First Online:
  • 908 Accesses

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 10))

Abstract

Multimedia, natural scenes, images are sources of textual information. Textual information extracted from these sources can be used for automatic image and video indexing, and image structuring. But, due to variations in text style, size, alignment of text, as well as orientation of text and low contrast of the image and complex background make challenging the extraction of text. From the past recent years, many methods for extraction of text are proposed. This paper provides with analysis, comparison of performance of various methods used for extraction of text information from images. It summarizes various methods for text extraction and various factors affecting the performance of these methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Keechul Jung, Kwang In Kim, Anil K. Jain “Text information extraction in images and video: a survey”, Elsevier, Pattern Recognition 37 (2004).

    Google Scholar 

  2. Xu-Cheng Yin, Member, IEEE, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao “Robust Text Detection in Natural Scene Images” IEEE transactions on pattern analysis and machine intelligence, VOL. 36, NO. 5, (2014).

    Google Scholar 

  3. Viet Phuong Le, Nibal Nayef, Muriel Visani, Jean-Marc Ogier and Cao De Trant “Text and Non-text Segmentation based on Connected Component Features” IEEE, 13th International Conference on Document Analysis and Recognition (ICDAR), (2015).

    Google Scholar 

  4. Ankit Vidyarthi, Namita Mittal, Ankita Kansal, “Text and Non-Text Region Identification Using Texture and Connected Components”, International Conference on Signal Propagation and Computer Technology (ICSPCT), IEEE (2014).

    Google Scholar 

  5. Yingying Zhu, Cong Yao, Xiang Bai “Scene text detection and recognition: recent advances and future trends” Front. Comput. Sci., (2016).

    Google Scholar 

  6. Qixiang Ye, and David Doermann, “Text Detection and Recognition in Imagery: A Survey” IEEE transactions on pattern analysis and machine intelligence, vol. 37, no. 7, (2015).

    Google Scholar 

  7. N. Senthilkumaran and R. Rajesh, “Edge Detection Techniques for Image Segmentation – A Survey of Soft Computing Approaches”, International Journal of Recent Trends in Engineering, Vol. 1, No. 2, (2009).

    Google Scholar 

  8. Zhong Y, Karu K, Jain A K. “Locating text in complex color images.” in Proceedings of the 3rd IEEE Conference on Document Analysis and Recognition, pp-146–149, IEEE (1995).

    Google Scholar 

  9. Kim K I, Jung K, Kim J H. “Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm.” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp-1631–1639, IEEE(2003).

    Google Scholar 

  10. Y. Zhong, H. J. Zhang, and A. K. Jain, “Automatic caption localization in compressed video,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 4, pp. 385–392, IEEE (2000).

    Google Scholar 

  11. Li H, Doermann D, Kia O. “Automatic text detection and tracking in digital video.”, 9(1): 147–156, IEEE Transactions on Image Processing, (2000).

    Google Scholar 

  12. K. I. Kim, K. Jung, and H. Kim, “Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 12, pp. 1631–1639, (2003).

    Google Scholar 

  13. Chong Yu, Yonghong Song, Quan Meng, Yuanlin Zhang, Yang Liu, “Text detection and recognition in natural scene with edge analysis”, IET Comput. Vis., Vol. 9, Iss. 4, pp. 603–613, (2015).

    Google Scholar 

  14. Chucai Yi, Ying Li Tian, “Text String Detection From Natural Scenes by Structure-Based Partition and Grouping”, Vol. 20, No. 9, IEEE Transactions on Image Processing (2011).

    Google Scholar 

  15. Shijian Lu, Tao Chen, Shangxuan Tian, Joo-Hwee Lim, Chew-Lim Tan, “Scene text extraction based on edges and support vector regression”, 18:125–135, IJDAR (2015).

    Google Scholar 

  16. K. C. Kim, H. R. Byun, Y. J. Song, Y. W. Choi, S. Y. Chi, K. K. Kim, Y. K. Chung, “Scene Text Extraction in Natural Scene Images using Hierarchical Feature Combining and Verification”, 17th International Conference on Pattern Recognition (ICPR’04), IEEE (2004).

    Google Scholar 

  17. Mohammad Khodadadi, and Alireza Behrad, “Text Localization, Extraction and Inpainting in Color Images”, IEEE, 20th Iranian Conference on Electrical Engineering, (ICEE2012), (2012).

    Google Scholar 

  18. Anubhav Kumar “An Efficient Text Extraction Algorithm in Complex Images”, IEEE, (2013).

    Google Scholar 

  19. Lukáš Neumann, Jiří Matas, “On Combining Multiple Segmentations in Scene Text Recognition”, 12th International Conference on Document Analysis and Recognition, IEEE (2013).

    Google Scholar 

  20. L. Neumann and J. Matas, “Text localization in real-world images using efficiently pruned exhaustive search,” in Document Analysis and Recognition (ICDAR), 2011 International Conference, pp. 687–691, (2011).

    Google Scholar 

  21. L. Neumann and J. Matas, “Real-time scene text localization and recognition,” in Computer Vision and Pattern Recognition (CVPR), pp. 3538–3545, IEEE Conference (2012).

    Google Scholar 

  22. Hyung Il Ko, Duck Hoon Kim, “Scene Text Detection via Connected Component Clustering and Non-text Filtering”, Vol. 22, No. 6, IEEE Transactions on Image Processing (2013).

    Google Scholar 

  23. Huizhong Chen, Sam S. Tsai1, Georg Schroth, David M. Chen, Radek Grzeszczuk and Bernd Girod, “Robust Text Detection In Natural Images with Edge-Enhanced Maximally Stable Extremal Regions”, 18th IEEE International Conference on Image Processing (2013).

    Google Scholar 

  24. Kamrul Hasan Talukder, Tania Mallick, “Connected Component Based Approach for Text Extraction from Color Image”, IEEE, 7th International Conference on Computer and Information Technology (ICCIT)(2014).

    Google Scholar 

  25. Gang Zhou, Yuehu Liu, Liang Xu1, Zhenhong Jia, “Scene text detection method based on the hierarchical model”, Vol. 9, Iss. 4, pp. 500–510 IET Comput. Vis., (2015).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nitin Sharma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharma, N., Nidhi (2018). Text Extraction from Images: A Review. In: Mishra, D., Nayak, M., Joshi, A. (eds) Information and Communication Technology for Sustainable Development. Lecture Notes in Networks and Systems, vol 10. Springer, Singapore. https://doi.org/10.1007/978-981-10-3920-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3920-1_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3919-5

  • Online ISBN: 978-981-10-3920-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics