Handwritten and Printed Text Separation: Linearity and Regularity Assessment

Hamrouni, Sameh; Cloppet, Florence; Vincent, Nicole

doi:10.1007/978-3-319-11758-4_42

Sameh Hamrouni¹⁷,
Florence Cloppet¹⁷ &
Nicole Vincent¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8814))

Included in the following conference series:

International Conference Image Analysis and Recognition

2113 Accesses
4 Citations

Abstract

In this paper, we address the issue of discerning handwriting from machine-printed text in real documents (This work is funded by the PiXL project, supported by the “Fonds national pour laSociété Numérique” of the French State. http://valconum.fr/index.php/les-projets/pixl). We present a reliable method based on a novel set of features belonging to two different categories, linearity and regularity, invariant to translation and scaling. Specifically, a novel linearity measure derived from the histogram of straight line segment lengths is introduced. The resulting framework is independent of the document layout andsupports any latin language used. Its performances are assessed on real documents dataset comprising heterogeneous administrative images.Experimental results demonstrate its accuracy, allowing up to 90 % recognition rate.

The authors would like to thank ITESOFT society for providing the dataset and for their help to carry out the comparison with Belaid et al. method [1].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Belaïd, A., Santosh, K.C., D’Andecy, V.P.: Handwritten and printed text separation in real document. CoRR, abs/1303.4614 (2013)
Google Scholar
Zagoris, K., Pratikakis, I., Antonacopoulos, A., Gatos, B., Papamarkos, N.: Handwritten and machine printed text separation in document images using the bag of visual words. In: International Conference on Frontiers in Handwriting Recognition (2012)
Google Scholar
Peng, X., Setlur, S., Govindaraju, V., Sitaram, R.: Handwritten text separation from annotated machine printed documents using markov random fields. IJDAR 16(1), 1–16 (2013)
Article Google Scholar
Wahl, R., Wong, K., Casey, R.: Block Segmentation and Text Extraction in Mixed Text/Image Documents. IBM Research Lab, San Jose, California, Research Report RJ3356 (40312) (December 1981)
Google Scholar
Zheng, Y., Li, H., Doermann, D.: Machine printed text and handwriting identification in noisy document images. University of Maryland, College Park, Technical Report (September 2003)
Google Scholar
Shirdhonkar, M., Kokare, M.B.: Discrimination between printed and handwritten text in documents. IJCA 3, 131–134 (2010). Special Issue on RTIPPR
Google Scholar
Bilane, P., Bres, S., Emptoz, H.: Robust directional features for wordspotting in degraded syriac manuscripts. In: International Workshop on Content-Based Multimedia Indexing, CBMI 2008, pp. 526–533 (June 2008)
Google Scholar
Berlemont, S., Aaron, B., Cloppet, F., Olivo-Marin, J.-C.: Detection of linear structures in biological images. In: Conference Record of the Forty-First Asilomar, Signals, Systems and Computers 2007, pp. 1279–1283 (November 2007)
Google Scholar
Siddiqi, I., Vincent, N.: Text independent writer recognition using redundant writing patterns with contour-based orientation and curvature features. Pattern Recognition 43(11), 3853–3865 (2010)
Article MATH Google Scholar
Wall, K., Danielsson, P.-E.: A fast sequential method for polygonal approximation of digitized curves. Computer Vision Graphics and Image Processing 28(3), 220–227 (1984)
Article Google Scholar

Download references

Author information

Authors and Affiliations

LIPADE, University of Paris Descartes, 45 rue des Saint-Pères, 75006, Paris, France
Sameh Hamrouni, Florence Cloppet & Nicole Vincent

Authors

Sameh Hamrouni
View author publications
You can also search for this author in PubMed Google Scholar
Florence Cloppet
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Vincent
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sameh Hamrouni .

Editor information

Editors and Affiliations

Faculty of Engineering, University of Porto, Porto, Portugal
Aurélio Campilho
Dept. of Electrical and Computer Eng., University of Waterloo, Waterloo, Ontario, Canada
Mohamed Kamel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hamrouni, S., Cloppet, F., Vincent, N. (2014). Handwritten and Printed Text Separation: Linearity and Regularity Assessment. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2014. Lecture Notes in Computer Science(), vol 8814. Springer, Cham. https://doi.org/10.1007/978-3-319-11758-4_42

Download citation

DOI: https://doi.org/10.1007/978-3-319-11758-4_42
Published: 10 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11757-7
Online ISBN: 978-3-319-11758-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics