Abstract
In a country like India where more number of scripts are in use, automatic identification of printed and handwritten script facilitates many important applications including sorting of document images and searching online archives of document images. In this paper, a multiple feature based approach is presented to identify the script type of the collection of handwritten documents. Eight popular Indian scripts are considered here. Features are extracted using Gabor filters, Discrete Cosine Transform, and Wavelets of Daubechies family. Experiments are performed to test the recognition accuracy of the proposed system at line level for bilingual scripts and later extended to trilingual scripts. We have obtained 100% recognition accuracy for bi-scripts at line level. The classification is done using k-nearest neighbour classifier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dhandra, B.V., Hangarge, M.: Offline Handwritten Script Identification in Document Images. International Journal of Computer Applications 4(5), 1–5 (2010)
Elgammmal, A.M., Ismail, M.A.: Techniques for Language Identification for Hybrid Arabic-English Document Images. In: Proc. Sixth Int’l Conf. Document Analysis and Recognition, pp. 1100–1104 (2001)
Rajput, G.G., Anita, H.B.: A Two Step Approach for Deskewing Handwritten and Machine Printed Document Images using Histograms and Geometric features. In: Proc. of Second Intl. Conf. on Signal and Image Processing, pp. 414–417 (2009)
Rajput, G.G., Anita, H.B.: Handwritten Script Recognition using DCT and Wavelet Features at Block Level. IJCA, Special Issue on RTIPPR (3), 158–163 (2010)
Rajput, G.G., Anita, H.B.: Kannada, English, and Hindi Handwritten Script Recognition using multiple features. In: Proc. of National Seminar on Recent Trends in Image Processing and Pattern Recognition, pp. 149–152 (2010) ISBN: 93-80043-74-0
Gonzalez, Woods: Digital Image processing, 3/e, Pearson Education (2008)
Hochberg, J., Bowers, K., Cannon, M., Keely, P.: Script and language identification for handwritten document images. IJDAR 2, 45–52 (1999)
Roy, K., Banerjee, A., Pal, U.: A System for Wordwise Handwritten Script Identification for Indian Postal Automation. In: Proc. IEEE India Annual Conference 2004 (INDICON 2004), pp. 266–271 (2004)
Padma, M.C., Vijaya, P.A.: Script Identification From Trilingual Documents Using Profile Based Features. International Journal of Computer Science and Applications, Technomathematics Research Foundation 7(4), 16–33 (2010)
Otsu, N.: A Threshold Selection Method from Gray-Level Histogram. IEEE Transaction Systems, Man and Cybernetics 9(1), 62–66 (1979)
Sarkar, R., Das, N., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script. Journal of Computing 2(2) (2010) ISSN 2151-9617
Abirami, S., Manjula, D.: A Survey of Script Identification techniques for Multi-Script Document Images. International Journal of Recent Trends in Engineering 1(2) (2009)
Pal, U., Chaudhuri, B.B.: Automatic identification of English, Chinese, Arabic, Devanagari and Bangla script line. In: Proc. 6th Intl. Conf: Document Analysis and Recognition (ICDAR 2001), pp. 790–794 (2001)
Pal, U., Chaudhury, B.B.: Identification of Different Script Lines from Multi-Script Documents. Image and Vision Computing 20(13-14), 945–954 (2002)
Pal, U., Chaudhuri, B.B.: Script Line Separation from Indian Multi-Script Documents. In: 5th ICDAR, pp. 406–409 (1999)
Pal, U., Sinha, S., Chaudhuri, B.B.: Multi-Script Line identification from Indian Documents. In: Seventh International Conference on Document Analysis and Recognition, ICDAR, vol. 2, p. 880 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Rajput, G.G., Anita, H.B. (2012). Handwritten Script Recognition Using DCT, Gabor Filter and Wavelet Features at Line Level. In: Patnaik, S., Yang, YM. (eds) Soft Computing Techniques in Vision Science. Studies in Computational Intelligence, vol 395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25507-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-25507-6_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25506-9
Online ISBN: 978-3-642-25507-6
eBook Packages: EngineeringEngineering (R0)