Abstract
In the advent of digital computers and era where work force is shifted to be inclined on robotic process, Optical Character Recognition (OCR) has immense potentials to ease some these processes. Segmentation is one of the pre-processing phases- the pivotal essence of the process where lingual scripts and their characteristics vary to a much larger extent. This paper focuses on techniques which facilitates segmentation in Devanagari script (Hindi) for offline handwritten words i.e. Headline detection in handwritten word images of Hindi for extracting upper and middle zone characters and cropping. Experiments are performed on the handwritten legal amount words ICDAR database [1] on 106 words by 80 writers and on Self created touching character database on 106 words by 15 writers. The proposed zoning technique i.e. CPT (Continuous pixel technique) and cropping techniques is implemented on 10070 and 530 legal amount words with 98.89% accuracy and 80.94% respectively.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Database development and recognition of handwritten Devanagari legal amount words. In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 304–308 (2011)
Kumar, S.: An analysis of irregularities in Devanagari script writing—a machine recognition perspective. Int. J. Comput. Sci. Eng. 2, 274–279 (2010)
Choudhary, A., Rishi, R., Ahlawat, S.: New character segmentation approach for off-line cursive handwritten words. Procedia Comput. Sci. 17, 88–95 (2013)
Elnagar, A., Alhajj, R.: Segmentation of connected handwritten numeral strings. Pattern Recognit. 36, 625–634 (2003)
Jayarathna, U.K.S., Bandara, G.E.M.D.C.: A junction based segmentation algorithm for offline handwritten connected character segmentation. In: International Conference on Computational Intelligence for Modelling, Control and Automation, 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, p. 147 (2006)
Kim, K.K., Kim, J.H., Suen, C.Y.: Segmentation-based recognition of handwritten touching pairs of digits using structural features. Pattern Recognit. Lett. 23, 13–24 (2002)
Saba, T., Sulong, G., Rehman, A.: Non-linear segmentation of touched roman characters based on genetic algorithm. Int. J. Comput. Sci. Eng. 2, 2167–2172 (2010)
Reddy, L.P., Babu, T.R., Rao, N.V., Babu, B.R.: Touching syllable segmentation using split profile algorithm. Int. J. Comput. Sci. Issues (IJCSI) 7(3), 1–10 (2010)
Bag, S., Bhowmick, P., Harit, G., Biswas, A.: Character segmentation of handwritten Bangla text by vertex characterization of isothetic covers. In: 2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), pp. 21–24 (2011)
Venkatesh, M., Majjagi, V., Vijayasenan, D.: Implicit segmentation of Kannada characters in offline handwriting recognition using hidden Markov models. Implicit arXiv1410.4341, pp. 1–6 (2014)
Bag, A.S., Krishna: Character segmentation of Hindi unconstrained handwritten words. In: International Workshop on Combinatorial Image Analysis, vol. 9448, pp. 247–260. Springer, Cham (2015)
Garg, N.K., Kaur, L., Jindal, M.K.: The hazards in segmentation of handwritten Hindi Text. Int. J. Comput. Appl. 29, 30–34 (2011)
Palakollu, S., Rani, R.: Handwritten Hindi text segmentation techniques for lines and characters. In: Proceedings of the World Congress on Engineering and Computer Science (2012)
Garg, N.K.: A new method for line segmentation of handwritten Hindi text key words. In: Seventh International Conference on Information Technology, pp. 392–397 (2010)
Hanmandlu, M.B.L., Agrawal, P.: Segmentation of handwritten Hindi text: a structural approach. Int. J. Comput. Proc. Languages 22(01), 1–20 (2001)
Bhujade, M.V.G., Meshram, M.C.M.: A technique for segmentation of handwritten Hindi text. Int. J. Eng. Res. Technol. 3, 1491–1495 (2014)
Ramteke, A.S., Rane, M.E.: Offline handwritten devanagari script segmentation. Int. J. Sci. Res. 1, 142–145 (2012)
Garain, U., Chaudhuri, B.B.: Segmentation of touching and fused Devanagari characters. Pattern Recognit 32, 449–459 (2002)
Bansal, V., Sinha, R.M.K.: Segmentation of touching and fused Devanagari characters. Pattern Recognit. 35, 875–893 (2002)
Kumar, M.: Segmentation of isolated and touching characters in offline handwritten Gurmukhi script recognition. Int. J. Inf. Technol. Comput. Sci. 2, 58–63 (2014)
Acknowledgment
I am thankful to Jayadevan R., ICDAR for support and providing word database of offline handwritten words database in Hindi.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kohli, M., Kumar, S. (2019). Improved Zoning and Cropping Techniques Facilitating Segmentation. In: Luhach, A., Singh, D., Hsiung, PA., Hawari, K., Lingras, P., Singh, P. (eds) Advanced Informatics for Computing Research. ICAICR 2018. Communications in Computer and Information Science, vol 955. Springer, Singapore. https://doi.org/10.1007/978-981-13-3140-4_58
Download citation
DOI: https://doi.org/10.1007/978-981-13-3140-4_58
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-3139-8
Online ISBN: 978-981-13-3140-4
eBook Packages: Computer ScienceComputer Science (R0)