Evaluating a Hidden Markov Model Of Syntax In A Text Recognition System

Hanlon, Stephen; Boyle, Roger

doi:10.1007/978-1-4471-3201-1_48

Stephen Hanlon² &
Roger Boyle²

182 Accesses
1 Citations

Abstract

Recognition of text by whole word shapes generates a set of candidate words for each printed word. A Hidden Markov Model (HMM) of syntax may be used to find the most probable sequence of syntactic tags for a sentence given the sequence of candidate sets. Candidate sets are then reduced by removing all words which are not associated with the chosen tag. We show that the tagging performance of the HMM does not deteriorate despite an increasing proportion of mis-classified words. We also show that using the model significantly reduces the number of candidates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R D Boyle and R C Thomas. Interpretation of cursive script at the word level. Technical report, School of Computer Studies, University of Leeds, June 1990.
Google Scholar
S A Dudani, K J Breeding, and R B Mcghee. Aircraft identification by moment invariants. IEEE Transactions on Computers ,26:39–45, 1977.
Article Google Scholar
G D Forney. The Viterbi algorithm. Proceedings of the IEEE ,61:268–278, March 1973.
Article MathSciNet Google Scholar
S J Hanlon and R D Boyle. Syntactic knowledge in word level text recognition. In R Beale and J Finlay, editors, Neural Networks and Pattern Recognition in Human-Computer Interaction. Ellis Horwood, 1992.
Google Scholar
J Hull. Incorporation of a Markov Model of language syntax in a text recognition algorithm. In Symposium oh Document Analysis and Information Retrieval ,University of Nevada, Las Vegas. 16th -18th March, 1992.
Google Scholar
J Hull. A computational theory of visual word recognition. Report number 88–07, University of NY at Buffalo, February 1988.
Google Scholar
S Johansson, E Atwell, R Garside, and G Leech. The tagged LOB corpus. Norwegian Computing Centre for the Humanities, Bergen, 1986.
Google Scholar
F G Keenan, L J Evett, and R J Whitrow. A large vocabulary stochastic syntax analyser for handwriting recognition. In First International Conference on Document Analysis and Recognition ,Saint-Malo, France. September 30 -October 2, 1991.
Google Scholar
A Kundu, Y He, and P Bahl. Recognition of handwritten word : First and second order Hidden Markov Model based approach. Pattern Recognition ,22(3):283–297, 1989.
Article Google Scholar
M A O’Hair and M Kabrinsky. Beyond the OCR: reading whole words as single symbols based on the two dimensional, low frequency Fourier transform. In First International Conference on Document Analysis and Recognition ,Saint-Malo, France. September 30 -October 2, 1991.
Google Scholar
T G Rose, L J Evett, and R J Whitrow. The use of semantic information as an aid to handwriting recognition. In First International Conference on Document Analysis and Recognition ,Saint-Malo, France. September 30 October 2, 1991.
Google Scholar
M Shridhar and A Badreldin. High accuracy character recognition algorithm using Fourier and topological descriptors. Pattern Recognition ,17:515–524, 1984.
Article MATH Google Scholar
Q Zhang and R Boyle. A new clustering algorithm with multiple runs of iterative procedures. Pattern Recognition ,24(9):835–848, 1991.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Division of Artificial Intelligence, School of Computer Studies, The University of Leeds, UK
Stephen Hanlon & Roger Boyle

Authors

Stephen Hanlon
View author publications
You can also search for this author in PubMed Google Scholar
Roger Boyle
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Studies, University of Leeds, Leeds, LS2 9JT, UK
David Hogg BSc, MSc, DPhil & Roger Boyle BA, PhD &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hanlon, S., Boyle, R. (1992). Evaluating a Hidden Markov Model Of Syntax In A Text Recognition System. In: Hogg, D., Boyle, R. (eds) BMVC92. Springer, London. https://doi.org/10.1007/978-1-4471-3201-1_48

Download citation

DOI: https://doi.org/10.1007/978-1-4471-3201-1_48
Publisher Name: Springer, London
Print ISBN: 978-3-540-19777-5
Online ISBN: 978-1-4471-3201-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics