Skip to main content

Visual Analytic-Based Technique for Handwritten Indic Script Identification—A Greedy Heuristic Feature Fusion Framework

  • Conference paper
  • First Online:
  • 766 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 404))

Abstract

Script identification from multi-script handwritten document images has been a subject of considerable discussion in the literature. In this paper, a novel feature fusion framework (FFF) using structural appearance (SA) and directional morphological filter (DMF) is proposed based on the idea of visual analytic (VA). A dataset of 181 handwritten document pages distributed over 2450 line and 20,260 word images of Bangla, Devanagari, Roman, Oriya, and Urdu scripts is built and considered for experimentation. Experimental result shows a significant improvement of the identification rate by the VA-FFF over the SA and DMF technique if they had been applied individually.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Obaidullah, S.M., Das, S.K., Roy, K.: A system for handwritten script identification from indian document. J. Pattern Recognit. Res. 8(1), 1–12 (2013)

    Article  Google Scholar 

  2. Chaudhuri, B.B., Pal, U.: A complete printed Bangla OCR. Pattern Recognit. 31, 531–549 (1998)

    Article  Google Scholar 

  3. Ghosh, D., Dube, T., Shivprasad, S.P.: Script recognition—a review. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2142–2161 (2010)

    Article  Google Scholar 

  4. Chaudhuri, B.B., Pal, U.: An OCR system to read two Indian language scripts: Bangla and Devanagari (Hindi). In: Proceedings of 4th International Conference on Document Analysis and Recognition, Uhn. pp. 18–20 (1997)

    Google Scholar 

  5. Hochberg, J., Kelly, P., Thomas, T., Kerns, L.: Automatic script identification from document images using cluster-based templates. In: IEEE Trans. Pattern Anal. Mach. Intell. 19, 176–181 (1997)

    Google Scholar 

  6. Chaudhury, S., Harit, G., Madnani, S., Shet, R. B.: Identification of scripts of Indian languages by combining trainable classifiers. In: Proceedings of Indian Conference on Computer Vision, Graphics and Image Processing, Bangalore, India (2000)

    Google Scholar 

  7. Dhanya, D., Ramakrishnan, A.G., Pati, P.B.: Script identification in printed bilingual documents. Sadhana 27(part-1), 73–82 (2002)

    Google Scholar 

  8. Pati, P.B., Ramakrishnan, A.G.: Word level multi-script identification. Pattern Recognit. Lett. 29(9), 1218–1229 (2008)

    Article  Google Scholar 

  9. Obaidullah, S.M., Mondal, A., Das, N., Roy, K.: Script Identification from printed Indian document images and performance evaluation using different classifiers. Appl. Comput. Intell. Soft Comput. 2014(Article ID 896128), 12 (2014)

    Google Scholar 

  10. Hochberg, J., Bowers, Cannon, K.M., Kelly, P.: Script and language identification for handwritten document images. Int. J. Doc. Anal. Recognit. 2(2–3), 45–52 (1999)

    Google Scholar 

  11. Zhou, L., Lu, Y., Tan, C.L.: Bangla/English Script identification based on analysis of connected component profiles. In: Lecture Notes in Computer Science, vol. 3872/2006, pp. 243–254 (2006)

    Google Scholar 

  12. Singhal, V., Navin, N., Ghosh, D.: Script-based classification of hand-written text document in a multilingual environment. In: Research Issues in Data Engineering, pp. 47–54 (2003)

    Google Scholar 

  13. Hangarge, M., Santosh, K.C., Pardeshi, R.: Directional discrete cosine transform for handwritten script identification. In: Proceedings of 12th International Conference on Document Analysis and Recognition, pp. 344–348 (2013)

    Google Scholar 

  14. Pardeshi, R., Chaudhury, B.B., Hangarge, M., Santosh, K.C.: Automatic handwritten indian scripts identification. In: Proceedings of 14th International Conference on Frontiers in Handwriting Recognition, pp. 375–380 (2014)

    Google Scholar 

  15. Roy, K., Banerjee, A., Pal, U.: A system for word-wise handwritten script identification for indian postal automation. In: Proceedings of IEEE India Annual Conference, pp. 266–271 (2004)

    Google Scholar 

  16. Bradski, G., Kaehler, A.: Learning OpenCV. O’Reilly Media, Sebastopol (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sk. Md. Obaidullah .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer India

About this paper

Cite this paper

Obaidullah, S.M., Halder, C., Das, N., Roy, K. (2016). Visual Analytic-Based Technique for Handwritten Indic Script Identification—A Greedy Heuristic Feature Fusion Framework. In: Das, S., Pal, T., Kar, S., Satapathy, S., Mandal, J. (eds) Proceedings of the 4th International Conference on Frontiers in Intelligent Computing: Theory and Applications (FICTA) 2015. Advances in Intelligent Systems and Computing, vol 404. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2695-6_19

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-2695-6_19

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-2693-2

  • Online ISBN: 978-81-322-2695-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics