Skip to main content

Classification of Bangla Compound Characters Using a HOG-CNN Hybrid Model

  • Conference paper
  • First Online:
Proceedings of the International Conference on Computing and Communication Systems

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 24))

Abstract

Automatic handwriting recognition is challenging task due to its sheer variety of acceptable stylistic differences. This is especially true for scripts with large character sets. Bangla, the sixth most widely spoken language in the world has a complex, large and rich set of compound characters. In this study, a hybrid deep learning model is proposed which combines the use of the manually designed feature Histogram of Oriented Gradients (HOG), with the adaptively learned features of a Convolutional Neural Networks (CNN). The proposed hybrid model was trained on the CMATERDB 3.1.3.3, a Bangla compound character data set which divides Bangla compound characters into 177 broad classes and 199 specific classes. The results demonstrate that CNN-only models achieve over 91% and 92% test accuracy respectively. Furthermore, it is shown that the proposed model, which incorporates HOG features with a CNN, achieves over 92.50% test accuracy on each division. While there is still room for improvement, these results are significantly better than currently published state of art on this data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chatfield, K., Lempitsky, V.S., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: BMVC. vol. 2, p. 8 (2011)

    Google Scholar 

  2. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. vol. 1, pp. 886–893. IEEE (2005)

    Google Scholar 

  3. Das, N., Acharya, K., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: A novel ga-svm based multistage approach for recognition of handwritten bangla compound characters. In: Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. pp. 145–152. Springer (2012)

    Google Scholar 

  4. Das, N., Acharya, K., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: A benchmark image database of isolated bangla handwritten compound characters. International Journal on Document Analysis and Recognition (IJDAR) 17(4), 413–431 (2014)

    Article  Google Scholar 

  5. Das, N., Basu, S., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.: Handwritten bangla compound character recognition: Potential challenges and probable solution. In: IICAI. pp. 1901–1913 (2009)

    Google Scholar 

  6. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc. (2012)

    Google Scholar 

  7. LeCun, Y., Bengio, Y.: Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks 3361(10), 1995 (1995)

    Google Scholar 

  8. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International journal of computer vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  9. M. A. H. Akhand, Mahtab Ahmad, M.M.H.R.: Convolutional neural network training with artificial pattern for bangla handwritten numeral recognition. ICIEB 1(1), 1–6 (2016)

    Google Scholar 

  10. Maitra, D.S., Bhattacharya, U., Parui, S.K.: Cnn based common approach to handwritten character recognition of multiple scripts. In: Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. pp. 1021–1025. IEEE (2015)

    Google Scholar 

  11. Sharif Razavian, A., Azizpour, H., Sullivan, J., Carlsson, S.: Cnn features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. pp. 806–813 (2014)

    Google Scholar 

  12. Srivastava, N., Hinton, G.E., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research 15(1), 1929–1958 (2014)

    MathSciNet  MATH  Google Scholar 

  13. Zeiler, M.D.: Adadelta: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)

Download references

Acknowledgements

This work was supported by the ICT division of Ministry of ICT, Bangladesh [Grant number 56.00.0000.028.33.066.16-731].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. M. A. Sharif .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharif, S.M.A., Mohammed, N., Momen, S., Mansoor, N. (2018). Classification of Bangla Compound Characters Using a HOG-CNN Hybrid Model. In: Mandal, J., Saha, G., Kandar, D., Maji, A. (eds) Proceedings of the International Conference on Computing and Communication Systems. Lecture Notes in Networks and Systems, vol 24. Springer, Singapore. https://doi.org/10.1007/978-981-10-6890-4_39

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6890-4_39

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6889-8

  • Online ISBN: 978-981-10-6890-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics