Unsupervised learning of character prototypes

Leroy, Annick

doi:10.1007/3-540-63791-5_18

Annick Leroy¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1339))

Included in the following conference series:

Brazilian Symposium on Document Image Analysis

109 Accesses

Abstract

In the framework of handwritten word recognition, the use of characters extracted from words instead of written in isolation, is essential to train recognizers. We propose a segmentation method which relies on anchor points such as the ascenders or the descenders, but also on certain kinds of loops. We do not use a manually segmented prototype set to initialize our incremental learning process, but instead we use an a priori knowledge about the alphabet characters. This knowledge is introduced as the encoding of the descending movements of the pen and of loops. From a set of words written by the same writer, we evaluate the different possible segmentations for each word and use the ones superior to a certain threshold. In the beginning this threshold is rather high. At each step of the segmentation of the words, its value decreases in order to register new prototypes. The characters already accepted are used in the next steps. The confidence rate is maximum for 3 steps.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

C.J.C. Burges and al. Shortest path segmentation: A method for training a neural network to recognize character strings. In IJCNN, pages 165–172, vol 3, Cambridge, Ma, 1992.
Google Scholar
L. Duneau and B. Dorizzi. Incremental building of an allograph lexicon. In Advances in handwriting & drawing: a multidisciplinary approach, pages 39–63. C. Faure. Reuss P., Lorette G. and Vinter A., 1994.
Google Scholar
Annick Leroy. Progressive lexicon reduction for on-line handwriting. In IWFHR 5, Colchester, UK, 1996.
Google Scholar
H.L. Teulings and L. Schomaker. Unsupervised learning of prototype allographs in cursive script recognition. In From Pixels to Features III, pages 61–73. S. Impedovo and J.C. Simon, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

IRISA, Campus de Beaulieu, 35042, Rennes cedex
Annick Leroy

Authors

Annick Leroy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Nabeel A. Murshed Flávio Bortolozzi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leroy, A. (1997). Unsupervised learning of character prototypes. In: Murshed, N.A., Bortolozzi, F. (eds) Advances in Document Image Analysis. BSDIA 1997. Lecture Notes in Computer Science, vol 1339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63791-5_18

Download citation

DOI: https://doi.org/10.1007/3-540-63791-5_18
Published: 02 August 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63791-2
Online ISBN: 978-3-540-69646-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics