Abstract
Handwritten digit recognition has long been a challenging problem in the field of optical character recognition and of great importance in industry. This paper develops a new approach for handwritten digit recognition that uses a small number of patterns for training phase. To improve performance of isolated Farsi/Arabic handwritten digit recognition, we use Bag of Visual Words (BoVW) technique to construct images feature vectors. Each visual word is described by Scale Invariant Feature Transform (SIFT) method. For learning feature vectors, Quantum Neural Networks (QNN) classifier is used. Experimental results on a very popular Farsi/Arabic handwritten digit dataset (HODA dataset) show that proposed method can achieve the highest recognition rate compared to other state of the arts methods.
Similar content being viewed by others
References
Salimi, H. and Giveki, D., Farsi/Arabic handwritten digit recognition based on ensemble of SVD classifiers and reliable multi-phase PSO combination rule, Int. J. Document Anal. Recognit. (IJDAR), 2012.
Pan, W.M., Bui, T.D., and Suen, C.Y.: Isolated handwritten Farsi numerals recognition using sparse and overcomplete representations, 10th International Conference on Document Analysis and Recognition, 2009, pp. 586–590.
Sadri, J., Suen, C.Y., and Bui, T.D., Application of support vector machines for recognition of handwritten Arabic/Persian digits, Proceedings of the 2nd Conference on Machine Vision and Image Processing and Applications, 2003, vol. 1, pp. 300–307.
Soltanzadeh, H. and Rahmati, M., Recognition of persian handwritten digits using image profiles of multiple orientations, Patern Recognit. Lett., 2004, vol. 25, no. 14, pp. 1569–1576.
Mowlaei, A. and Faez, K., Recognition of isolated handwritten Persian/Arabic characters and numerals using support vector machines, Proceedings IEEE 13th Workshop on Neural Networks for Signal Processing, 2003, pp. 547–554.
Ziaratbanv, M., Faez, K., and Faradji, F., Language-based feature extraction using template-matching in Farsi/Arabic handwritten numeral recognition, Proceedings Ninth International Conference on Document Analysis and Recognition, 2007, vol. 1, pp. 297–301.
LeCun, Y., Bottou, L., Bengio, Y., and Haffner, P., Gradient based learning applied to document recognition, Proc. IEEE, 1998, vol. 86, no. 11, pp. 2278–2324.
Broumandnia, A., Shanbehzadeh, J., and RezakhahVarnoosfaderani, M., Persian/Arabic handwritten word recognition using M-band packet wavelet transform, Image Vis. Comput. Elsevier, 2008, vol. 26, pp. 829–842.
Chen, C.H. and Wang, P.S.P., Handbook of Pattern Recognition and Computer Vision, 3rd ed., Singapore: World Scientific, 2005.
Hanmandlu, M., Grover, J., Madasu, V.K., and Vasikarla, S., Input fuzzy modeling for the recognition of handwritten Hindi numeral, International Conference on Informational Technology, 2007, vol. 2, pp. 208–213.
Jolliffe, I.T., Principal Component Analysis, New York: Springer, 1986.
Liu, C.L. and Suen, C.Y., A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters, Patern Recognit., 2008, vol. 42, pp. 3287–3295.
Rajashekararadhya, S.V., Ranjan, P.V., and ManjunathAradhya, V.N., Isolated handwritten Kannada and Tamil numeral recognition: a novel approach, First International Conference on Emerging Trends in Engineering and Technology ICETET, 2008, no. 8, pp. 1192–1195.
Wshah, S., Shi, Z., and Govindaraju, V., Segmentation of Arabic handwriting based on both contour and skeleton segmentation, 10th International Conference on Document Analysis and Recognition, 2009.
Yang, J., Zhang, D., Frangi, A.F., and Yang, J.Y., Two-dimensional PCA: A new approach to appearance-based face representation and recognition, IEEE Trans. Pattern Anal. Machine Intelligence, 2004, pp. 131–137.
Lowe, D.G., Distinctive image features from scale-invariant keypoints, Int. J. Comput. Vision, 2004, vol. 60, no. 2, pp. 91–110.
Lazebnik, S., Schmid, C., and Ponce, J., Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, IEEE ComputeSociety Conference on Computer Vision and Pattern Recognition, 2006, vol. 2, pp. 2169–2178.
Avila, S., Thome, N., Cord, M., Valle, E., and de Araújo, A., Bossa: Extended bow formalism for image classification, International Conference on Image Processing, 2011, pp. 2966–2969.
Zhu, D. and Wu, R.A., Multi-layer quantum neural networks recognition system for handwritten digital recognition pattern recognition, IEEE International Conference on Natural Computation, 2007, vol. 1, pp. 718–722.
Khosravi, H. and Kabir, E., Introducing a very large dataset of handwritten Farsi digit and a study on their varieties, Patern Recognit. Lett., 2007, vol. 28, pp. 1133–1141.
Csurka, G., Bray, C., Dance, C., and Fan, L.,Visual categorization with bags of keypoints, Workshop on Statistical Learning in Computer Vision, ECCV, Prague, Czech Republic, 2004, pp. 1–22.
Opelt, A., Pinz, A., and Zisserman, A., A boundary-fragment-model for Scene detection, European Conference on Computer Vision, Graz, Austria, 2006, vol. 2, pp. 575–588.
Jing Yu, Zengchang Qin, Tao Wan, and Xi Zhang, Feature integration analysis of bag-of-features model for image retrieval, Neurocomputing, 2013, vol. 120, pp. 355–364.
Penatti, Otávio A.B., Silva, Fernanda B., Eduardo Valle, Valerie Gouet-Brunet, and Ricardo da S. Torres, Visual word spatial arrangement for image retrieval and classification, Patern Recognit., 2014, vol. 47, pp. 705–720.
Shiliang Zhang, Qi Tian, Gang Hua, Qingming Huang, and Wen Gao, ScenePatchNet: Towards scalable and semantic image annotation and retrieval, Comput. Vision Image Understanding, 2014, vol. 118, pp. 16–29.
Zhang, H., Berg, A., Maire, M., and Malik, J., SVM-KNN: Discriminative nearest neighbor classification for visual category recognition, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2006, vol. 2, pp. 2126–2136.
Leung, T. and Malik, J., Representing and recognizing the visual appearance of materials using three-dimensional textons, Int. J. Comput. Vision, 2001, vol. 43, no. 1, pp. 29–44.
Sivic, J. and Zisserman, A., Video Google: A text retrieval approach to Scene matching in videos, International Conference on Computer Vision, 2003, vol. 2, pp. 1470–1477.
Fei-Fei, L. and Perona, P., A bayesian hierarchical model for learning natural scene categories, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, vol. 2, pp. 524–531.
Gopathy, P., Nicolaos, B., and Karayiannis, N.B., Quantum Neural networks: Inherently fuzzy feedforward neural networks, J. IEEE Trans. Neural Networks, 1997, vol. 8, no. 3, pp. 679–693.
http://farsiocr.ir/farsi-digit-dataset.
Antoine, J.P., Vandergheynst, P., Bouyoucef, K., and Murenzi, R., Target detection and recognition using twodimensional isotropic and anisotropic wavelets, Aut. Object Recognit. V: SPIE Proc., 1995, vol. 2485, pp. 20–31.
Antoine, J.P. and Murenzi, R., Two dimensional directional wavelets and the scale-angle representation, Signal Process., 1996, vol. 53, pp. 259–281.
Kaplan, L. and Murenzi, R., Pose estimation of SAR imagery using the two dimensional continuous wavelet transform, Patern Recognit. Lett., 2003, vol. 24, pp. 2269–2280.
Romero, D.J., Seijas, L.M., and Ruedin, A.M., Directional continuous wavelet transform applied to handwritten numerals recognition using neural networks, JCS, 2007, vol. 7, no. 1, pp. 66–71.
Jain, A.K., Prabhakar, S., Hong, L., and Pankanti, S., FingerCode: A filterbank for fingerprint representation and matching. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1999, vol. 2, pp.
Shirali-Shahreza, H., Faez, K., and Khotanzad, A., Recognition of hand-written persian/arabic numerals by shadow coding and an edited probabilistic neural network, Proceedings of International Conference on Image Processing, 1995, vol. 3, pp. 436–439.
Dehghan, M. and Faez, K., Farsi handwritten character recognition with moment invariants, Proceedings of 13th International Conference on Digital Signal Processing, 1997. 2, pp. 507–510.
Harifi, A. and Aghagolzadeh, A., A new pattern for handwritten persian/arabic digit recognition, J. Inform. Technol., 2004, vol. 3, pp. 249–252.
Mir Mohammad Hosseini, H. and Bouzerdoum, A., A combined method for Persian and Arabic handwritten digit recognition, Australian New Zealand Conference on Intelligent Information System, 1996, pp. 80–83.
Mowlaei, A., Faez, K., and Haghighat, A., Feature extraction with wavelet transform for recognition of isolated handwritten farsi/arabic characters and numerals, Digital Signal Process., 2002, vol. 2, pp. 923–926.
Mozaffari, S., Faez, K., and Ziaratban, M., Structural decomposition and statistical description of Farsi/Arabic handwritten numeric characters, Proceedings of the 8th Int. Conference on Document Analysis and Recognition, 2005, vol. 1, pp. 237–241.
Alaei, A., Pal, U., and Nagabhushan, P., Using modified contour features and SVM based classifier for the recognition of Persian/Arabic handwritten numerals, Seventh International Conference on Advances in Pattern Recognition, 2009.
Reza Ebrahimpour, Alireza Esmkhani, and Soheil Faridi, Farsi handwritten digit recognition based on mixture of RBF experts, IEICE Electron. Express., 2010, vol. 7, no. 14, pp. 1014–1019.
Abdi, M.J. and Salimi, H., Farsi handwriting recognition with mixture of RBF experts based on particle swarm optimization, Int. J. Inf. Sci. Comput. Math., 2010, vol. 2, pp. 129–136.
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Montazer, G.A., Soltanshahi, M.A. & Giveki, D. Farsi/Arabic handwritten digit recognition using quantum neural networks and bag of visual words method. Opt. Mem. Neural Networks 26, 117–128 (2017). https://doi.org/10.3103/S1060992X17020060
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3103/S1060992X17020060