Convolutional Neural Network for Machine-Printed Traditional Mongolian Font Recognition

Wei, Hongxi; Wen, Ya; Wang, Weiyuan; Gao, Guanglai

doi:10.1007/978-3-030-04221-9_24

Hongxi Wei^16,17,
Ya Wen^16,17,
Weiyuan Wang^16,17 &
…
Guanglai Gao^16,17

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11305))

Included in the following conference series:

International Conference on Neural Information Processing

2450 Accesses
4 Citations

Abstract

Although font recognition is a fundamental issue in the field of document analysis and recognition, it was usually ignored in the past. With the development of optical character recognition (OCR), font recognition becomes more and more important. This paper proposed a well-designed convolutional neural network (CNN) architecture for traditional Mongolian font recognition by means of a single word. To be specific, the whole word image is regarded as input of CNN. Hence, the word images should be normalized into the same size before being inputted into CNN. By comparison, an appropriate aspect ratio for the traditional Mongolian word images has been determined. Experimental results demonstrate that the proposed CNN architecture outperforms three classic CNN architectures, including LeNet-5, AlexNet and GoogLeNet. Therefore, the proposed CNN is much more suitable for the task of the traditional Mongolian font recognition in the way of a single word.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zramdini, A., Ingold, R.: Optical font recognition using typographical features. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 877–882 (1998)
Article Google Scholar
Jung, M., Shin, Y., Srihari, N.: Multifont classification using typographical attributes. In: Proceedings of ICDAR 1999, pp. 353–356. IEEE Press, New York (1999)
Google Scholar
Moussa, B., Zahour, A., Benabdelhafid, A., Alimi, M.: New features using fractal multi-dimensions for generalized Arabic font recognition. Pattern Recogn. Lett. 31(5), 361–371 (2010)
Article Google Scholar
Lutf, M., You, X., Cheung, Y., Chen, P.: Arabic font recognition based on diacritics features. Pattern Recogn. 47(2), 672–684 (2014)
Article Google Scholar
Zhu, Y., Tan, T., Wang, Y.: Font recognition based on global texture analysis. IEEE Trans. Pattern Anal. Mach. Intell. 23(10), 1192–1200 (2001)
Article Google Scholar
Ding, Q., Li, C., Tao, W.: Character independent font recognition on a single Chinese character. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 195–204 (2007)
Article Google Scholar
Song, W., Lian, Z., Tang, Y., Xiao, J.: Content-independent font recognition on a single Chinese character using sparse representation. In: Proceedings of ICDAR 2015, pp. 376–380. IEEE Press, New York (2015)
Google Scholar
Joshi, G., Garg, S., Sivaswamy, J.: A generalized framework for script identification. Int. J. Doc. Anal. Recogn. 10(2), 55–68 (2007)
Article Google Scholar
Tao, D., Lin, X., Jin, L., Li, X.: Principal component 2-D long short-term memory for font recognition on single Chinese characters. IEEE Trans. Cybern. 46(3), 756–765 (2016)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Proceedings of NIPS 2012, pp. 1097–1105. Curran Associates Inc. (2012)
Google Scholar
Ciresan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. In: Proceedings of CVPR 2012, pp. 3642–3649. IEEE Press, New York (2012)
Google Scholar
Oquab, M., Bottou, L., Laptev, I., Sivic, J.: Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings of CVPR 2014, pp. 1717–1724. IEEE Press, New York (2014)
Google Scholar
Gao, P., Gu, G., Wu, J., Wei, B.: Chinese calligraphic style representation for recognition. Int. J. Doc. Anal. Recogn. 20(1), 59–68 (2017)
Article Google Scholar
Tensmeyer, C., Saunders, D., Martinez, T.: Convolutional neural networks for font classification. In: Proceedings of ICDAR 2017, pp. 985–990. IEEE Press, New York (2017)
Google Scholar
Wei, H., Gao, G.: A keyword retrieval system for historical Mongolian document images. Int. J. Doc. Anal. Recogn. 17(1), 33–45 (2014)
Article Google Scholar
Wei, H., Gao, G.: Machine-printed traditional Mongolian characters recognition using BP neural networks. In: Proceedings of CiSE 2009, pp. 1–7. IEEE Press, New York (2009)
Google Scholar
Hu, H., Wei, H., Liu, Z.: The CNN based machine-printed traditional Mongolian characters recognition. In: Proceedings of CCC 2017, pp. 3937–3941. IEEE Press, New York (2017)
Google Scholar
Zhang, H., Wei, H., Bao, F., Gao, G.: Segmentation-free printed traditional Mongolian OCR using sequence to sequence with attention model. In: Proceedings of ICDAR 2017, pp. 585–590. IEEE Press, New York (2017)
Google Scholar
Ma, L., Liu, J., Wu, J.: A new database for online handwritten Mongolian word recognition. In: Proceedings of ICPR 2016, pp. 1131–1136. IEEE Press, New York (2016)
Google Scholar
Hinton, G., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.: Going deeper with convolutions. In: Proceedings of CVPR 2015, pp. 1–9. IEEE Press, New York (2015)
Google Scholar
Wei, H., Gao, G., Bao, Y.: A method for removing inflectional suffixes in word spotting of Mongolian Kanjur. In: Proceedings of ICDAR 2011, pp. 88–92. IEEE Press, New York (2011)
Google Scholar
Wei, H., Zhang, H., Gao, G.: Representing word image using visual word embeddings and RNN for keyword spotting on historical document images. In: Proceedings of ICME 2017, pp. 1368–1373. IEEE Press, New York (2017)
Google Scholar

Download references

Acknowledgements

This paper is supported by the National Natural Science Foundation of China under Grant 61463038.

Author information

Authors and Affiliations

School of Computer Science, Inner Mongolia University, Hohhot, 010021, China
Hongxi Wei, Ya Wen, Weiyuan Wang & Guanglai Gao
Provincial Key Laboratory of Mongolian Information Processing Technology, Hohhot, China
Hongxi Wei, Ya Wen, Weiyuan Wang & Guanglai Gao

Authors

Hongxi Wei
View author publications
You can also search for this author in PubMed Google Scholar
Ya Wen
View author publications
You can also search for this author in PubMed Google Scholar
Weiyuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Guanglai Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongxi Wei .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Beijing, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wei, H., Wen, Y., Wang, W., Gao, G. (2018). Convolutional Neural Network for Machine-Printed Traditional Mongolian Font Recognition. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11305. Springer, Cham. https://doi.org/10.1007/978-3-030-04221-9_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-04221-9_24
Published: 17 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04220-2
Online ISBN: 978-3-030-04221-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics