Efficient OCR Post-Processing Combining Language, Hypothesis and Error Models

Llobet, Rafael; Navarro-Cerdan, J. Ramon; Perez-Cortes, Juan-Carlos; Arlandis, Joaquim

doi:10.1007/978-3-642-14980-1_72

Rafael Llobet²¹,
J. Ramon Navarro-Cerdan²¹,
Juan-Carlos Perez-Cortes²¹ &
…
Joaquim Arlandis²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6218))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

1863 Accesses
7 Citations

Abstract

In this paper, an OCR post-processing method that combines a language model, OCR hypothesis information and an error model is proposed. The approach can be seen as a flexible and efficient way to perform Stochastic Error-Correcting Language Modeling. We use Weighted Finite-State Transducers (WFSTs) to represent the language model, the complete set of OCR hypotheses interpreted as a sequence of vectors of a posteriori class probabilities, and an error model with symbol substitutions, insertions and deletions. This approach combines the practical advantages of a de-coupled (OCR + post-processor) model with the error-recovery power of a integrated model.

Work partially supported by the Spanish MICINN grants TIN2009-14205-C04-02 and Consolider Ingenio 2010: MIPRCV (CSD2007-00018) and by IMPIVA and the E.U. by means of the ERDF in the context of the R+D Program for Technological Institutes of IMPIVA network for 2010 (IMIDIC_2009/204).

Download to read the full chapter text

Chapter PDF

State-of-the-Art in Weighted Finite-State Spell-Checking

A New Linguistic Engine for NooJ: Parsing Context-Sensitive Grammars with Finite-State Machines

Using the Google Web 1T 5-Gram Corpus for OCR Error Correction

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Mohri, M., Pereira, F., Riley, M.: The design principles of a weighted finite-state transducer library. Theoretical Computer Science 231, 17–32 (2000)
Article MATH MathSciNet Google Scholar
Allauzen, C., Riley, M., Schalkwyk, J., Skut, W., Mohri, M.: OpenFst: A General and Efficient Weighted Finite-State Transducer LIbrary. In: Holub, J., Žďárek, J. (eds.) CIAA 2007. LNCS, vol. 4783, pp. 11–23. Springer, Heidelberg (2007)
Chapter Google Scholar
Perez-Cortes, J.C., Amengual, J.C., Arlandis, J., Llobet, R.: Stochastic Error Correcting Parsing for OCR Post-processing. In: Proceedings of the ICPR, vol. 4, pp. 405–408 (2000)
Google Scholar
Vidal, E., Thollard, F., de la Higuera, C., Casacuberta, F., Carrasco, R.: Probabilistic Finite-State Machines - Parts I and II. IEEE Trans. on Pattern Analysis and Machine Intelligence 27, 1013–1039 (2005)
Article Google Scholar
Garcia, P., Vidal, E.: Inference of k-testable languages in the strict sense and application to syntactic pattern recognition. IEEE Trans. on PAMI 12, 920–925 (1990)
Google Scholar
Riley, M., Pereira, F., Mohri, M.: Transducer composition for context-dependent network expansion. In: Proc. of Eurospeech 1997 (1997)
Google Scholar
Amengual, J., Vidal, E.: Efficient error-correcting viterbi parsing. IEEE Trans. on PAMI 20, 1109–1116 (1998)
Google Scholar
Neuhoff, D.: The viterbi algorithm as an aid in text recognition. IEEE Trns. on Inf. Theory 21, 222–226 (1975)
Article Google Scholar
Berghel, H.L.: A logical framework for the correction of spelling errors in electronic documents. Information Processing and Management 23, 477–494 (1987)
Article Google Scholar
Hall, P., Dowling, G.: Approximate string matching. ACM Surveys 12, 381–402 (1980)
Article MathSciNet Google Scholar
Beaufort, R., Mancas-Thillou, C.: A Weighted Finite-State Framework for Correcting Errors in Natural Scene OCR. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition, vol. 2, pp. 889–893 (2007)
Google Scholar
Farooq, F., Jose, D., Govindaraju, V.: Phrase-based correction model for improving handwriting recognition accuracies. Pattern Recognition 42, 3271–3277 (2009)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Instituto Tecnologico de Informatica, Universidad Politecnica de Valencia, Camino de Vera s/n, 46071, Valencia, Spain
Rafael Llobet, J. Ramon Navarro-Cerdan, Juan-Carlos Perez-Cortes & Joaquim Arlandis

Authors

Rafael Llobet
View author publications
You can also search for this author in PubMed Google Scholar
J. Ramon Navarro-Cerdan
View author publications
You can also search for this author in PubMed Google Scholar
Juan-Carlos Perez-Cortes
View author publications
You can also search for this author in PubMed Google Scholar
Joaquim Arlandis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Vision and Pattern Recognition Group,Computer Science, University of York Heslington, YO10-5DD, York, United Kingdom
Edwin R. Hancock
Department of Computer Science, University of York, YO10 5DD, UK
Richard C. Wilson
Centre for Vision, Speech and Signal Proc (CVSSP), University of Surrey, Guildford, GU2 7XH, Surrey, United Kingdom
Terry Windeatt
Electrical and Electronics Engineering Department, Middle East Technical University, 06531, Ankara, Turkey
Ilkay Ulusoy
Department of Computer Science and Artificial Intelligence, University of Alicante, P.O.B. 99, E-03080, Alicante, Spain
Francisco Escolano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Llobet, R., Navarro-Cerdan, J.R., Perez-Cortes, JC., Arlandis, J. (2010). Efficient OCR Post-Processing Combining Language, Hypothesis and Error Models. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2010. Lecture Notes in Computer Science, vol 6218. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14980-1_72

Download citation

DOI: https://doi.org/10.1007/978-3-642-14980-1_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14979-5
Online ISBN: 978-3-642-14980-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Efficient OCR Post-Processing Combining Language, Hypothesis and Error Models

Abstract

Chapter PDF

Similar content being viewed by others

State-of-the-Art in Weighted Finite-State Spell-Checking

A New Linguistic Engine for NooJ: Parsing Context-Sensitive Grammars with Finite-State Machines

Using the Google Web 1T 5-Gram Corpus for OCR Error Correction

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Efficient OCR Post-Processing Combining Language, Hypothesis and Error Models

Abstract

Chapter PDF

Similar content being viewed by others

State-of-the-Art in Weighted Finite-State Spell-Checking

A New Linguistic Engine for NooJ: Parsing Context-Sensitive Grammars with Finite-State Machines

Using the Google Web 1T 5-Gram Corpus for OCR Error Correction

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation