Abstract
In this paper, we propose an approach for the separation of overlapping and touching lines within handwritten Arabic documents. Our approach is based on the morphology analysis of the terminal letters of Arabic words. Starting from 4 categories of possible endings, we use the angular variance to follow the connection and separate the endings. The proposed separation scheme has been evaluated on 100 documents contains 640 overlapping and touching occurrences reaching an accuracy of about 96.88%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, Y., Leedham, G.: Independent Component Analysis Segmentation Algorithm. In: 8th International Conference on Document Analysis and Recognition, pp. 680–684 (2005)
Louloudis, G., Gatos, B., Halatsis, C.: Line And Word Segmentation of Handwritten Documents. In: 11th International Conference on Frontiers in Handwriting Recognition, Canada, pp. 599–603 (2008)
Takru, K., Leedham, G.: Separation of touching and overlapping words in adjacent lines of handwritten text. In: International Workshop on Frontiers in Handwriting Recognition, pp. 496–501 (2002)
Hyvarinen, A.: Survey on Independent Component Analysis. Helsinki University of Technology, Finland (1999)
Lüthy, F., Varga, T., Bunke, H.: Using hidden Markov models as a tool for handwritten text line segmentation. In: 9th Int. Conf. on Document Analysis and Recognition, pp. 8–12 (2007)
Zahour, A., Likforman-Sulem, L., Boussellaa, W., Taconet, B.: Text Line Segmentation of Historical Arabic Documents. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition, Brazil, pp. 138–142 (2007)
Bukhari, S.S., Shafait, F., Breuel, T.M.: Segmentation of Curled Text Lines using Active Contours. In: Proceedings of Eight IAPR Workshop on Document Analysis Systems, pp. 270–277 (2008)
Shi, Z., Govindaraju, V.: Line Separation for Complex Document Images Using Fuzzy Run length. In: Proc. of the Int. Workshop on Document Image Analysis for Libraries, Palo, Alto, CA (2004)
Ouwayed, N., Belaïd, A.: Multi-oriented Text Line Extraction from Handwritten Arabic Documents. In: The Eighth IAPR International Workshop on Document Analysis Systems (DAS 2008), Japan, pp. 339–346 (2008)
Lam, L., Lee, S.-W., Suen, C.Y.: Thinning Methodologies-A Comprehensive Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 14(9), 869–885 (1992)
Dori, D., Liu, W.: Stepwise recovery of arc segmentation in complex line environments. International Journal on Document Analysis and Recognition 1(1), 62–71 (1998)
Ballard, D.H.: Generalizing the Hough Transform to detect arbitrary shapes. Pattern Recognition 13(2), 111–122 (1981)
Rosin, P.L., West, G.A.: Segmentation of Edges into Lines and Arcs. Image and Vision Computing 7(2), 109–114 (1989)
Stanford University, USA, http://www.stanford.edu/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ouwayed, N., Belaïd, A. (2009). Separation of Overlapping and Touching Lines within Handwritten Arabic Documents. In: Jiang, X., Petkov, N. (eds) Computer Analysis of Images and Patterns. CAIP 2009. Lecture Notes in Computer Science, vol 5702. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03767-2_29
Download citation
DOI: https://doi.org/10.1007/978-3-642-03767-2_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03766-5
Online ISBN: 978-3-642-03767-2
eBook Packages: Computer ScienceComputer Science (R0)