Document Image Understanding

Kreich, Joachim; Luhn, Achim; Maderlechner, Gerd

doi:10.1007/978-3-642-93464-3_15

Joachim Kreich³,
Achim Luhn³ &
Gerd Maderlechner³

38 Accesses

Abstract

Despite the expansion of electronic data processing paper remains the most popular medium for display, storage and transmission of information for persons and organisations. With growing office automation the paper-computer interface becomes increasingly important. To be useful, this interface must be able to handle documents containing text as well as graphics, and convert them into a standardized electronic representation.

In this paper we describe a prototypical system for the analysis and interpretation of paper documents, SODA (System for Office Document Analysis), using knowledge based image analysis applied to the scanned raster images of the documents. which e.g. is able to extract the key elements of a business letter like its sender, date, and reference. The internal computer representation of the recognized document considers the standardized Office Document Architecture (ODA) so that a description of the layout and logic structure can be generated for a large variety of documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baird, H. S.: Global-to-Local Layout Analysis, Proc. Int. Workshop on Syntactical and Structural Pattern Recognition, Pont-à-Mousson, France, 1988, 136–147
Google Scholar
Bergengruen, O., Luhn, A., Maderlechner, G., and Ueberreiter, B.: Dokumentanalyse mit ATN’s und unscharfen Relationen, Informatik Fachberichte 149, Springer Verlag, 1987, 78–81
Google Scholar
Bernhardt, L.: Three Classical Character Recognition Problems, Three New Solutions, Siemens Research and Development Reports, 13, 1984, 114–117
Google Scholar
Bixler, J. P.: Tracking Text in Mixed-Mode Documents, Proc. ACM Conf. on Document Processing Systems, Santa Fe, New Mexico, USA, Dec. 5–9, 1988, 177–185
Google Scholar
Brachman, J.R. and Schmolze, J.G.: An Overview of the KL-ONE Knowledge Represenation System, Cognitive Science, 9, 1985, 171–216
Article Google Scholar
Dengel, A., Luhn, A., and Ueberreiter, B.: Model Based Segmentation and Hypothesis Generation for the Recognition of Printed Documents, SPIE vol. 860, 1988, 89–95
Google Scholar
Hofer-Alfeis, J., Maderlechner, G.: Automated Conversion of Mechanical Engineering Drawings to CAD-Models: Too Many Problems?, Proc. IAPR Workshop on Computer Vision, Tokyo, 1988, 206–209
Google Scholar
Kamentsky, L.: The Kurzweil Reading Machine: Current Developments, Proc. IEEE Workshop on Computers in the Education and Employment of the Handicapped, Minneapolis, 1983, 97–100
Google Scholar
deKleer, J.: Assumption-Based TMS, Artificial Intelligence 28, 1986, 127–162
Google Scholar
Kreich, J., Luhn. A., Maderlechner G.: Knowledge-Based Interpretation of Scanned Business Letters, Proc. IAPR Workshop on Computer Vision, Tokyo, 1988, 417–420
Google Scholar
Maderlechner G., Hundt E., Image Dialog between Man and Machine, Siemens Forschungs-und Entwicklungsberichte 13, (3), 1984, 126–129
Google Scholar
Maderlechner, G., Jeppsson, O.: Representation, classification and modelling of graphs for efficient pattern recognition in line drawings,Proc. 9th Int. Conf. on Pattern Recognition (ICPR 88), Nov. 1988, Rom, 678–680
Google Scholar
Nagy, G., Seth, S., and Stoddard, S.: Document Analysis with an Expert System, PatternRecognition in Practice II, 1986, 149–159
Google Scholar
ISO 8613: Office Document Architecture (ODA) and Interchange Format (ODIF),March 1988
Google Scholar
Postl, W.: Halftone Recognition by an Experimental Text and Facsimile Workstation, Proc. of the 6th ICPR, Munich, 1982, 489–491
Google Scholar
Scherl, W.: Unified Analysis of Complex Document Patterns, Proc. 4th Scandinavian Conf. on Image Analysis, Trondheim, 1985, 873–880
Google Scholar
Wang, D., Srihari, S. N.: Classification of Newspaper Image Blocks using Texture Analysis, Computer Vision, Graphics and Image Proc., Vol. 47, 1989, 327–352
Article Google Scholar
Wong, K.Y.,Casey, R.G., Wahl, F.M.: Document Analysis System, IBM J. Res. Devlop., Vol. 26, 1982, 647
Google Scholar
Weber, S.: Imaging, Electronics, July 1989, 61–64
Google Scholar

Download references

Author information

Authors and Affiliations

Siemens AG, Germany
Joachim Kreich, Achim Luhn & Gerd Maderlechner

Authors

Joachim Kreich
View author publications
You can also search for this author in PubMed Google Scholar
Achim Luhn
View author publications
You can also search for this author in PubMed Google Scholar
Gerd Maderlechner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Corporate Research and Development, Applied Computer Science and Software, Siemens AG, Munich, Germany
Heinz Schwärtzel
Institute of Informatics Problems, Academy of Sciences of the USSR, Moscow, USSR
Igor A. Mizin (Corresponding Member of the Academy of Sciences of the USSR) (Corresponding Member of the Academy of Sciences of the USSR)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kreich, J., Luhn, A., Maderlechner, G. (1990). Document Image Understanding. In: Schwärtzel, H., Mizin, I.A. (eds) Advanced Information Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-93464-3_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-93464-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-52683-4
Online ISBN: 978-3-642-93464-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics