Abstract
The optical character recognition (OCR) systems for Gujrati language were the most primitive ones and occupy a significant place in pattern recognition. The Gujrati language OCR systems have been used successfully in a wide array of commercial applications. The different challenges involved in the OCR systems for Gujrati language is investigated in this Chapter. The pre-processing activities such as binarization, noise removal, skew detection, character segmentation and thinning performed on the datasets considered. The feature extraction is performed through fuzzy Genetic Algorithms (GA). The feature based classification is performed through important soft computing techniques viz rough fuzzy multilayer perceptron (RFMLP), fuzzy support vector machine (FSVM), fuzzy rough support vector machine (FRSVM) and fuzzy markov random fields (FMRF). The superiority of soft computing techniques is demonstrated through the experimental results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Chaudhuri, A., Some Experiments on Optical Character Recognition Systems for different Languages using Soft Computing Techniques, Technical Report, Birla Institute of Technology Mesra, Patna Campus, India, 2010.
Bunke, H., Wang, P. S. P. (Editors), Handbook of Character Recognition and Document Image Analysis, World Scientific, 1997.
http://www.indsenz.com/int/index.php?content=software_ind_ocr_gujarati.
Bansal, V., Sinha, R. M. K., Integrating Knowledge Sources in Devanagari Text Recognition System, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 30(4), pp 500–505, 2000.
Bajaj, R., Dey, L., Chaudhury, S., Devnagari Numeral Recognition by combining Decision of Multiple Connectionist Classifiers, Sadhana, 27(1), pp 59–72, 2002.
Cheriet, M., Kharma, N., Liu, C. L., Suen, C. Y., Character Recognition Systems: A Guide for Students and Practitioners, John Wiley and Sons, 2007.
Sharma, N., Pal, U., Kimura, F., Pal, S., Recognition of Offline Handwritten Devnagari Characters using Quadratic Classifier, Indian Conference on Computer Vision, Graphics and Image Processing, pp 805–816, 2006.
Gujrati Character Recognition Dataset: Antani, S., Agnihotri, L., Gujrati Character Recognition, Proceedings of the Fifth International Conference on Document Analysis and Recognition, 418, 1999. http://dl.acm.org/citation.cfm?id=840401.
Chaudhuri, A., De, K., Job Scheduling using Rough Fuzzy Multi-Layer Perception Networks, Journal of Artificial Intelligence: Theory and Applications, 1(1), pp 4–19, 2010.
Chaudhuri, A., De, K., Chatterjee, D., Discovering Stock Price Prediction Rules of Bombay Stock Exchange using Rough Fuzzy Multi-Layer Perception Networks, Book Chapter: Forecasting Financial Markets in India, Rudra P. Pradhan, Indian Institute of Technology Kharagpur, (Editor), Allied Publishers, India, pp 69–96, 2009.
Pal, S. K., Mitra, S., Mitra, P., Rough-Fuzzy Multilayer Perception: Modular Evolution, Rule Generation and Evaluation, IEEE Transactions on Knowledge and Data Engineering, 15(1), pp 14–25, 2003.
Pal, U., Chaudhuri, B. B., Printed Devnagari script OCR system, Vivek, 10(1), pp 12–24, 1997.
Chaudhuri, A., Modified Fuzzy Support Vector Machine for Credit Approval Classification, AI Communications, 27(2), pp 189–211, 2014.
Chaudhuri, A., De, Fuzzy Support Vector Machine for Bankruptcy Prediction, Applied Soft Computing, 11(2), pp 2472–2486, 2011.
Chaudhuri, A., Fuzzy Rough Support Vector Machine for Data Classification, International Journal of Fuzzy System Applications, 5(2), pp 26–53, 2016.
Chaudhuri, A., Applications of Support Vector Machines in Engineering and Science, Technical Report, Birla Institute of Technology Mesra, Patna Campus, India, 2011.
Zeng, J., Liu, Z. Q., Type-2 Fuzzy Markov Random Fields and their Application to Handwritten Chinese Character Recognition, IEEE Transactions on Fuzzy Systems, 16(3), pp 747–760, 2008.
Taghva, K., Borsack, J., Condit, A., Effects of OCR Errors on Ranking and Feedback using the Vector Space Model, Information Processing and Management, 32(3), pp 317–327, 1996.
Taghva, K., Borsack, J., Condit, A., Evaluation of Model Based Retrieval Effectiveness with OCR Text, ACM Transactions on Information Systems, 14(1), pp 64–93, 1996.
Taghva, K., Borsack, J., Condit, A., Erva, S., The Effects of Noisy Data on Text Retrieval, Journal of American Society for Information Science, 45 (1), pp 50–58, 1994.
Jain, A. K., Fundamentals of Digital Image Processing, Prentice Hall, India, 2006.
Russ, J. C., The Image Processing Handbook, CRC Press, 6th Edition, 2011.
Young, T. Y., Fu, K. S., Handbook of Pattern Recognition and Image Processing, Academic Press, 1986.
De, R. K., Pal, N. R., Pal, S. K., Feature Analysis: Neural Network and Fuzzy Set Theoretic Approaches, Pattern Recognition, 30(10), pp 1579–1590, 1997.
Gonzalez, R. C., Woods, R. E., Digital Image Processing, 3rd Edition, Pearson, 2013.
De, R. K., Basak, J., Pal, S. K., Neuro-Fuzzy Feature Evaluation with Theoretical Analysis, Neural Networks, 12(10), pp 1429–1455, 1999.
Zadeh, L. A., Fuzzy Sets, Information and Control, 8(3), pp 338–353, 1965.
Zimmermann, H. J., Fuzzy Set Theory and its Applications, 4th Edition, Kluwer Academic Publishers, Boston, 2001.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Chaudhuri, A., Mandaviya, K., Badelia, P., Ghosh, S.K. (2017). Optical Character Recognition Systems for Gujrati Language. In: Optical Character Recognition Systems for Different Languages with Soft Computing. Studies in Fuzziness and Soft Computing, vol 352. Springer, Cham. https://doi.org/10.1007/978-3-319-50252-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-50252-6_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50251-9
Online ISBN: 978-3-319-50252-6
eBook Packages: EngineeringEngineering (R0)