Skip to main content

A New Experiment on Bengali Character Recognition

  • Conference paper
Ubiquitous Computing and Multimedia Applications (UCMA 2010)

Abstract

This paper presents a method to use View based approach in Bangla Optical Character Recognition (OCR) system providing reduced data set to the ANN classification engine rather than the traditional OCR methods. It describes how Bangla characters are processed, trained and then recognized with the use of a Backpropagation Artificial neural network. This is the first published account of using a segmentation-free optical character recognition system for Bangla using a view based approach. The methodology presented here assumes that the OCR pre-processor has presented the input images to the classification engine described here. The size and the font face used to render the characters are also significant in both training and classification. The images are first converted into greyscale and then to binary images; these images are then scaled to a fit a pre-determined area with a fixed but significant number of pixels. The feature vectors are then formed extracting the characteristics points, which in this case is simply a series of 0s and 1s of fixed length. Finally, an artificial neural network is chosen for the training and classification process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kahan, S., Pavlidis, T.: Recognition of printed characters of any font and size. IEEE Trans. Pattern Anal. Arid Mach. lnteN. 9, 274–288 (1987)

    Article  Google Scholar 

  2. Milky Mahmud, S.M., Shahrier, N., Delowar Hossain, A.S.M., Tareque Mohmud Chowdhury, M., Abdus Sattar, M.: An Efficient Segmentation Scheme for the Recognition of Printed Bangla characters. In: Proceedings of ICCIT 2003, pp. 283–286 (2003)

    Google Scholar 

  3. Dutta, A., Chaudhury, S.: Bengali Alpha- Numeric Character Recognition Using Curvature Features. Pattern Recognition 26, 1707–1720 (1993)

    Article  Google Scholar 

  4. Chaudhuri, B.B., Pal, U.: A Complete Printed Bangla OCR System. Pattern Recognition 31, 531–549 (1997); Graphics and Image Processing, NCCIS (1997)

    Google Scholar 

  5. Mahmud, J.U., Raihan, M.F., Rahman, C.M.: A Complete OCR System for continuous Bengali Character. In: TENCON 2003, Conference on Convergent Technologies for Asia-Pacific Region, October 15-17 (2003)

    Google Scholar 

  6. Majumdar, A.: Bangla Basic Character Recognition using Digital Curvlet Transform. Journal of Pattern Recognition Research 1, 17–26 (2007)

    Google Scholar 

  7. Roy, A.K., Chatterjee, B.: Design of nearest neighbour classifier for Bengali character recognition. J.IEEE 30 (1984)

    Google Scholar 

  8. Abul Hasnat, M.: Research Report on Bangla OCR Training and Testing Methods, Working Papers (2004-2007)

    Google Scholar 

  9. Garain, U., Chaudhuri, B.B.: Segmentation of Touching Characters in Printed Devnagari and Bangla Scripts Using Fuzzy Multifactorial Analysis. IEEE Transactions on Systems, MAN, and Cybernetics—Part C: Applications and Reviews 32(4) (November 2002)

    Google Scholar 

  10. Rybnik, M., Chebira, A., Madani, K., Saeed, K., Tabedzki, M., Adamski, M.: A Hybrid Neural-Based Information-Processing Approach Combining a View-Based Feature Extractor and a Treelike Intelligent Classifier. In: CISIM – Compute Information Systems and Industrial Management Applications, pp. 66–73. WSFiZ Press, Bialystok (2003)

    Google Scholar 

  11. Saeed, K., Tabedzki, M.: A New Hybrid System for Recognition of Handwritten-Script. COMPUTING – International Scientific Journal of Computing 3(1), 50–57 (2004); 332 Advances in Information Processing and Protection, Ternopil

    Google Scholar 

  12. Saeed, K., Tabedzki, M.: New Experiments on Word Recognition Without Segmentation. In: Conference Proceedings of ACS-CISIM 2007 under the title, A Hybrid Word-Recognition System (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Barman, S., Bhattacharyya, D., Jeon, Sw., Kim, Th., Kim, HK. (2010). A New Experiment on Bengali Character Recognition. In: Tomar, G.S., Grosky, W.I., Kim, Th., Mohammed, S., Saha, S.K. (eds) Ubiquitous Computing and Multimedia Applications. UCMA 2010. Communications in Computer and Information Science, vol 75. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13467-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13467-8_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13466-1

  • Online ISBN: 978-3-642-13467-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics