Abstract
Handwriting recognition requires a prior segmentation of text lines which is a challenging task, especially for historical scripts. Exemplary for the date in entries of historical church registers, we present an approach which enables a segmentation by using additional knowledge about the word sequence. The algorithm is based on probability distribution curves and a neural network, which assesses local features of potential word boundaries. Our database consists of 298 different date entries from the 18th and 19th century which contain 674 word boundaries. The algorithm generates hypotheses for the expected date type, ordered by their probability. Tests resulted in an accuracy of 97% for the best four hypotheses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
M. Feldbach and K. D. Tönnies. Line Detection and Segmentation in Historical Church Registers. In Sixth International Conference on Document Analysis and Recognition, pages 743–747, Seattle, USA, September 2001. IEEE Computer Society.
M. Feldbach and K. D. Tönnies. Robust Line Detection in Historical Church Registers. In Pattern Recognition, 23rd DAGM Symposium, pages 140–147, Munich, Germany, September 2001. Springer-Verlag.
D. Kazakov and S. Manandhar. A hybrid approach to word segmentation. In D. Page, editor, Proceedings of the 8th International Conference on Inductive Logic Programming, volume 1446, pages 125–134. Springer-Verlag, 1998.
G. Kim and V. Govindaraju. Handwritten Phrase Recognition as Applied to Street Name Images. Pattern Recognition, 31(1):41–51, January 1998.
S. H. Kim, S. Jeong, G.-S. Lee, and C.Y. Suen. Word Segmentation in Handwritten Korean Text Lines Based on Gap Clustering Techniques. In Sixth International Conference on Document Analysis and Recognition — ICDAR 2001, pages 189–193. IEEE Computer Society, September 2001.
H. Kruse, R. Mangold, B. Mechler, and O. Pengler. Programmierung Neuronaler Netze: Eine Turbo Pascal Toolbox. Addison-Wesley, 1991.
U. Mahadevan and R. C. Nagabushnam. Gap Metrics for Word Separation in Handwritten Lines. In International Conference on Document Analysis and Recognition, pages 124–127, Montreal, Canada, 1995.
R. Manmatha and N. Srimal. Scale space technique for word segmentation in handwritten documents. In Scale-Space Theories in Computer Vision, pages 22–33, 1999.
U. Marti and H. Bunke. Text line segmentation and word recognition in a system for general writer independent handwriting recognition. In Sixth International Conference on Document Analysis and Recognition, pages 159–163, Seattle, USA, September 2001. IEEE Computer Society.
G. Seni and E. Cohen. External word segmentation of off-line handwritten text lines. Pattern Recognition, 27(1):41–52, January 1994.
A. Vinciarelli and J. Luettin. A new normalization technique for cursive handwritten words. Pattern Recognition Letters, 22(9):1043–1050, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Feldbach, M., Tönnies, K.D. (2002). Segmentation of the Date in Entries of Historical Church Registers. In: Van Gool, L. (eds) Pattern Recognition. DAGM 2002. Lecture Notes in Computer Science, vol 2449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45783-6_49
Download citation
DOI: https://doi.org/10.1007/3-540-45783-6_49
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44209-7
Online ISBN: 978-3-540-45783-1
eBook Packages: Springer Book Archive