Advertisement

Touching Character Segmentation Method of Archaic Lanna Script

  • Sakkayaphop Pravesjit
  • Arit Thammano
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 314)

Abstract

In general, character recognition consists of four stages: image preprocessing, segmentation, feature extraction, and classification. Character segmentation is one of the most important and difficult tasks in character recognition. Incorrectly segmented characters are not likely to be correctly recognized. Touching characters, which always arises when handwritten characters are being segmented, makes the task even more difficult. Therefore, this paper emphasizes the interest to the segmentation of touching and overlapping characters. This paper proposes two new techniques which are shown to dramatically improve the segmentation accuracy. The first proposed technique emphasizes on converting a greyscale image to a binary image while the second proposed technique emphasizes on the process of character segmentation itself. In the proposed character segmentation process, the bounding box analysis is initially employed to segment the document image into images of isolated characters and images of touching characters. The thinning algorithm is applied to extract the skeleton of the touching characters. Next, the skeleton of the touching characters is separated into several pieces. Finally, the separated pieces of the touching characters are put back to reconstruct two isolated characters. The proposed algorithm achieves an accuracy of 89.26%.

Keywords

Character segmentation Touching character Dissection method Archaic script 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bhowmik, T.K., Roy, A., Roy, U.: Character Segmentation for Handwritten Bangla Words Using Artificial Neural Network. In: Proceedings of the International Workshop on Neural Networks and Learning in Document Analysis and Recognition (2005)Google Scholar
  2. 2.
    Casey, R.G., Lecolinet, E.: A Survey of Methods and Strategies in Character Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(7), 690–706 (1996)CrossRefGoogle Scholar
  3. 3.
    Chen, J.-L., Wu, C.-H., Lee, H.-J.: Chinese Handwritten Character Segmentation in Form Documents. In: Lee, S.-W., Nakano, Y. (eds.) DAS 1998. LNCS, vol. 1655, pp. 348–362. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  4. 4.
    Hoang, T.V., Tabbone, S., Pham, N.: Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram. In: Proceedings of the 13th International Conference on Computer Analysis of Images and Patterns (2009)Google Scholar
  5. 5.
    Marinai, S., Gori, M., Soda, G.: Artificial Neural Networks for Document Analysis and Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(1), 23–35 (2005)CrossRefGoogle Scholar
  6. 6.
    Soba, T., Sulong, G., Rehman, A.: A Survey on Methods and Strategies on Touched Characters Segmentation. International Journal of Research and Reviews in Computer Science 1(2), 103–114 (2010)Google Scholar
  7. 7.
    Tseng, L.Y., Chen, R.C.: Segmenting Handwritten Chinese Characters Based on Heuristic Merging of Stroke Bounding Boxes and Dynamic Programming. Pattern Recognition Letter 19, 963–973 (1998)CrossRefGoogle Scholar
  8. 8.
    Xiao, X., Leedham, G.: Knowledge-based English Cursive Script Segmentation. Pattern Recognition Letters 21, 945–954 (2000)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Sakkayaphop Pravesjit
    • 1
  • Arit Thammano
    • 1
  1. 1.Computational Intelligence Laboratory Faculty of Information TechnologyKing Mongkut’s Institute of Technology LadkrabangBangkokThailand

Personalised recommendations