Touching Character Segmentation Method of Archaic Lanna Script
In general, character recognition consists of four stages: image preprocessing, segmentation, feature extraction, and classification. Character segmentation is one of the most important and difficult tasks in character recognition. Incorrectly segmented characters are not likely to be correctly recognized. Touching characters, which always arises when handwritten characters are being segmented, makes the task even more difficult. Therefore, this paper emphasizes the interest to the segmentation of touching and overlapping characters. This paper proposes two new techniques which are shown to dramatically improve the segmentation accuracy. The first proposed technique emphasizes on converting a greyscale image to a binary image while the second proposed technique emphasizes on the process of character segmentation itself. In the proposed character segmentation process, the bounding box analysis is initially employed to segment the document image into images of isolated characters and images of touching characters. The thinning algorithm is applied to extract the skeleton of the touching characters. Next, the skeleton of the touching characters is separated into several pieces. Finally, the separated pieces of the touching characters are put back to reconstruct two isolated characters. The proposed algorithm achieves an accuracy of 89.26%.
KeywordsCharacter segmentation Touching character Dissection method Archaic script
Unable to display preview. Download preview PDF.
- 1.Bhowmik, T.K., Roy, A., Roy, U.: Character Segmentation for Handwritten Bangla Words Using Artificial Neural Network. In: Proceedings of the International Workshop on Neural Networks and Learning in Document Analysis and Recognition (2005)Google Scholar
- 4.Hoang, T.V., Tabbone, S., Pham, N.: Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram. In: Proceedings of the 13th International Conference on Computer Analysis of Images and Patterns (2009)Google Scholar
- 6.Soba, T., Sulong, G., Rehman, A.: A Survey on Methods and Strategies on Touched Characters Segmentation. International Journal of Research and Reviews in Computer Science 1(2), 103–114 (2010)Google Scholar