Abstract
Recently the Convolutional Recurrent Neural Network (CRNN) architecture has shown success in many string recognition tasks and residual connections are applied to most network architectures. In this paper, we embrace these observations and present a new string recognition model named Residual Convolutional Recurrent Neural Network (Residual CRNN, or Res-CRNN) based on CRNN and residual connections. We add residual connections to convolutional layers as well as recurrent layers in CRNN. With residual connections, the proposed method extracts more efficient image features and make better predictions than ordinary CRNN. We apply this new model to handwritten digit string recognition task (HDSR) and obtain significant improvements on HDSR benchmarks ORAND-CAR-A and ORAND-CAR-B.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Diem, M., et al.: ICFHR 2014 competition on handwritten digit string recognition in challenging datasets (HDSRC 2014). In: 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 779–784 (2014)
Saabni, R.: Recognizing handwritten single digits and digit strings using deep architecture of neural networks. In: The 3th International Conference on Artificial Intelligence and Pattern Recognition, pp. 1–6 (2016)
Gattal, A., Chibani, Y., Hadjadji, B.: Segmentation and recognition system for unknown-length handwritten digit strings. Pattern Anal. Appl. 20(2), 307–323 (2017)
Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: The 23rd International Conference on Machine Learning, pp. 369–376 (2006)
Messina, R., Louradour, J.: Segmentation-free handwritten Chinese text recognition with LSTM-RNN. In: 13th IAPR International Conference on Document Analysis and Recognition, pp. 171–175 (2015)
Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2012)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:1409.1556 (2014)
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: The 29th IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Huang, G., Liu, Z., Weinberger, K.Q., van der Maaten, L.: Densely connected convolutional networks, arXiv preprint arXiv:1608.06993 (2016)
Gao, H., Zhuang, L., Maaten, L.V.D., Weinberger, K.Q.: Densely connected convolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition (2017)
Zhan, H., Wang, Q., Lu, Y.: Handwritten digit string recognition by combination of residual network and RNN-CTC. In: Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, E.S. (eds.) ICONIP 2017. LNCS, vol. 10639, pp. 583–591. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-70136-3_62
Jia, Y., et al.: Caffe: convolutional architecture for fast feature embedding, arXiv preprint arXiv:1408.5093 (2014)
Wang, Q., Lu, Y.: A sequence labeling convolutional network and its application to handwritten string recognition. In: Twenty-Sixth International Joint Conference on Artificial Intelligence, pp. 2950–2956 (2017)
Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. (2016). https://doi.org/10.1109/TPAMI.2016.2646371
Acknowledgement
The work is supported by Shanghai Natural Science Foundation (No. 19ZR1415900).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhan, H., Lyu, S., Tu, X., Lu, Y. (2019). Residual CRNN and Its Application to Handwritten Digit String Recognition. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Communications in Computer and Information Science, vol 1143. Springer, Cham. https://doi.org/10.1007/978-3-030-36802-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-36802-9_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36801-2
Online ISBN: 978-3-030-36802-9
eBook Packages: Computer ScienceComputer Science (R0)