A Dataset and a Novel Neural Approach for Optical Gregg Shorthand Recognition

  • Fangzhou ZhaiEmail author
  • Yue Fan
  • Tejaswani Verma
  • Rupali Sinha
  • Dietrich Klakow
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11107)


Gregg shorthand is the most popular form of pen stenography in the United States. It has been adapted for many other languages. In order to substantially explore the potentialities of performing optical recognition of Gregg shorthand, we develop and present Gregg-1916, a dataset that comprises Gregg shorthand scripts of about 16 thousand common English words. In addition, we present a novel architecture for shorthand recognition which exhibits promising performance and opens up the path for various further directions.


Optical Gregg shorthand recognition Character recognition Convolutional neural networks Recurrent neural networks 


  1. 1.
    Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13(Feb), 281–305 (2012)MathSciNetzbMATHGoogle Scholar
  2. 2.
    Encyclopaedia Britannica: Encyclopædia Britannica. Common Law, Chicago (2009)Google Scholar
  3. 3.
    Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
  4. 4.
    Chollet, F., et al.: Keras (2015).
  5. 5.
    Chollet, F.: Xception: Deep learning with depthwise separable convolutions. arXiv preprint (2016)Google Scholar
  6. 6.
    Gregg, J.R.: Gregg Shorthand Dictionary. Gregg Publishing Company, Upper Saddle River (1916)Google Scholar
  7. 7.
    Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRefGoogle Scholar
  8. 8.
    Ioffe, S., Szegedy, C.: Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015)
  9. 9.
    Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3128–3137 (2015)Google Scholar
  10. 10.
    Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
  11. 11.
    KumarMishra, J., Alam, K.: A neural network based method for recognition of handwritten English Pitmans shorthand 102, 31–35 (2014)Google Scholar
  12. 12.
    Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet physics doklady, vol. 10, pp. 707–710 (1966)Google Scholar
  13. 13.
    Ma, Y., Leedham, G., Higgins, C., Myo Htwe, S.: Segmentation and recognition of phonetic features in handwritten Pitman shorthand 41, 1280–1294 (2008)Google Scholar
  14. 14.
    Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting On Association For Computational Linguistics, pp. 311–318. Association for Computational Linguistics (2002)Google Scholar
  15. 15.
    Pascanu, R., Mikolov, T., Bengio, Y.: On the difficulty of training recurrent neural networks. In: International Conference on Machine Learning, pp. 1310–1318 (2013)Google Scholar
  16. 16.
    Pedersoli, M., Lucas, T., Schmid, C., Verbeek, J.: Areas of attention for image captioning. In: ICCV-International Conference on Computer Vision (2017)Google Scholar
  17. 17.
    Rajasekaran, R., Ramar, K.: Handwritten Gregg shorthand recognition 41, 31–38 (2012)Google Scholar
  18. 18.
    Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)MathSciNetzbMATHGoogle Scholar
  19. 19.
    Zhu, M., Chi, Z., Wang, X.: Segmentation and recognition of on-line Pitman shorthand outlines using neural networks, vol. 37, no. 5, pp. 2454–2458, December 2002Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Fangzhou Zhai
    • 1
    Email author
  • Yue Fan
    • 1
  • Tejaswani Verma
    • 1
  • Rupali Sinha
    • 1
  • Dietrich Klakow
    • 1
  1. 1.Spoken Language Systems, Saarland Informatics CampusSaarbrückenGermany

Personalised recommendations