Abstract
Recognition of multi-script documents, both printed and handwritten, is still a challenge due to the script dependence of OCR. Identification of script is a significant process in design of multi-script OCR system for processing of multi-script documents. In this paper, we focus on wordwise script identification, as without surprise we can see many scripts mixed in single line. We present a method, which mainly comprises three steps—word extraction, feature computation, and classification. Using morphological dilation, words are extracted. Radon and wavelet transforms are employed to extract the features based on directional and multi-resolution analysis. In classification, performance of LDA, SVM, and KNN classifiers is studied separately. Experiments with our dataset of Kannada and Roman words show that the presented method is robust for wordwise handwritten script identification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
J Hochberg, Patrick Kelly, T. Thomas, L. Kerns ,“Automatic script identification from document images using cluster based templates”, IEEE TPAMI, Vol. (19), No. (2), Feb. 1997, pp 176–181.
T.N. Tan, “Rotation invariant texture features and their use in automatic script identification”, IEEE TPAMI, Vol. (20), No. (7), July-1998, pp: 751–756.
U. Pal, S. Sinha, B.B. Chaudhuri, “Multi script line identification from indian documents”, In Proc. of ICDAR 2003, 880–884.
U. Pal, B.B. Chaudhuri, “Automatic identification of English, Chinese, Arabic, Devanagari, and Bangla script line”, In Proc. of ICDAR, 2001, pp. 790–794.
Shamita Gosh, B.B. Chaudhuri, “Composite script identification and orientation detection for Indian text images”, In Proc. of ICDAR 2011, pp. 294–298.
A Busch, W Boles, S Shridharan, “Texture for script identification”, In IEEE TPAMI, Vol. (2), No. (11), Nov. 2005, pp. 1720–1732.
P.B. Pati, A.G. Ramkrishnan, “Word level multi-script identification”, In Pattern Recognition Letters, Vol(29), No. (9), July 2008, pp. 1218–1229.
Ram sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu.,“Word level script identification from Bangla and Devanagari handwritten texts mixed with Roman script”, In Journal of Computing, Vol. 2, pp. 103–108, 2010.
A. Bhardwaj, H Cao, V. Govindraju, “Script identification of handwritten images”, In Proc. SPIE 7247, Document Recognition and Retrieval XVI 2009, 72470Z, PP. 7247–7247-6.
K. Roy, A. Alaei, U. Pal, “Word wise handwritten Persian and Roman script identification”, In Proc. ICFHR 10, 628–633.
M. Hangarge, K.C. Santosh, R Pardeshi, “Directional Discrete Cosine Transform for Handwritten Script Identification”, In Proc. of ICDAR 2013, pp. 344–348.
G.G. Rajput, H.B. Anita, “Handwritten Script Recognition using DCT, Gabor filter and Wavelet features at Word Level”, In proceedings of VASCAN-13, LNEE-258, pp. 363–372
M. Hangarge, K.C. Santosh, S. Doddamani, R Pardeshi, “Statistical texture features based handwritten and printed text classification in south indian documents”, in Proc. of ICECIT, 2012, pp. 215–221.
N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Trans. Sys., Man., Cyber, Vol (9), No. (1), 979, p: 62–66.
D.V. Jadhav, R.S. Holambe, “ Feature extraction using Radon and Wavelet transforms with application to face recognition”, Neurocomputing Vol. (72), 2009, pp. 1951–1959.
Shigeo Abe, “Support Vector Machines for Pattern Classification,” Springer Verlag, 2005.
D Ghosh, T Dube and A.P. Shivaprasad, “Script Recognition A review,” IEEE TPAMI, Vol. (32), No. (12), 2010, pp 2142–2161.
V.N. Vapnik, “The Nature of statistical Learning Theory”, Springer 1995.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Veershetty, C., Pardeshi, R., Hangarge, M., Dhawale, C. (2018). Radon and Wavelet Transforms for Handwritten Script Identification. In: Perez, G., Tiwari, S., Trivedi, M., Mishra, K. (eds) Ambient Communications and Computer Systems. Advances in Intelligent Systems and Computing, vol 696. Springer, Singapore. https://doi.org/10.1007/978-981-10-7386-1_63
Download citation
DOI: https://doi.org/10.1007/978-981-10-7386-1_63
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7385-4
Online ISBN: 978-981-10-7386-1
eBook Packages: EngineeringEngineering (R0)