Abstract
Despite the expansion of electronic data processing paper remains the most popular medium for display, storage and transmission of information for persons and organisations. With growing office automation the paper-computer interface becomes increasingly important. To be useful, this interface must be able to handle documents containing text as well as graphics, and convert them into a standardized electronic representation.
In this paper we describe a prototypical system for the analysis and interpretation of paper documents, SODA (System for Office Document Analysis), using knowledge based image analysis applied to the scanned raster images of the documents. which e.g. is able to extract the key elements of a business letter like its sender, date, and reference. The internal computer representation of the recognized document considers the standardized Office Document Architecture (ODA) so that a description of the layout and logic structure can be generated for a large variety of documents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baird, H. S.: Global-to-Local Layout Analysis, Proc. Int. Workshop on Syntactical and Structural Pattern Recognition, Pont-à -Mousson, France, 1988, 136–147
Bergengruen, O., Luhn, A., Maderlechner, G., and Ueberreiter, B.: Dokumentanalyse mit ATN’s und unscharfen Relationen, Informatik Fachberichte 149, Springer Verlag, 1987, 78–81
Bernhardt, L.: Three Classical Character Recognition Problems, Three New Solutions, Siemens Research and Development Reports, 13, 1984, 114–117
Bixler, J. P.: Tracking Text in Mixed-Mode Documents, Proc. ACM Conf. on Document Processing Systems, Santa Fe, New Mexico, USA, Dec. 5–9, 1988, 177–185
Brachman, J.R. and Schmolze, J.G.: An Overview of the KL-ONE Knowledge Represenation System, Cognitive Science, 9, 1985, 171–216
Dengel, A., Luhn, A., and Ueberreiter, B.: Model Based Segmentation and Hypothesis Generation for the Recognition of Printed Documents, SPIE vol. 860, 1988, 89–95
Hofer-Alfeis, J., Maderlechner, G.: Automated Conversion of Mechanical Engineering Drawings to CAD-Models: Too Many Problems?, Proc. IAPR Workshop on Computer Vision, Tokyo, 1988, 206–209
Kamentsky, L.: The Kurzweil Reading Machine: Current Developments, Proc. IEEE Workshop on Computers in the Education and Employment of the Handicapped, Minneapolis, 1983, 97–100
deKleer, J.: Assumption-Based TMS, Artificial Intelligence 28, 1986, 127–162
Kreich, J., Luhn. A., Maderlechner G.: Knowledge-Based Interpretation of Scanned Business Letters, Proc. IAPR Workshop on Computer Vision, Tokyo, 1988, 417–420
Maderlechner G., Hundt E., Image Dialog between Man and Machine, Siemens Forschungs-und Entwicklungsberichte 13, (3), 1984, 126–129
Maderlechner, G., Jeppsson, O.: Representation, classification and modelling of graphs for efficient pattern recognition in line drawings,Proc. 9th Int. Conf. on Pattern Recognition (ICPR 88), Nov. 1988, Rom, 678–680
Nagy, G., Seth, S., and Stoddard, S.: Document Analysis with an Expert System, PatternRecognition in Practice II, 1986, 149–159
ISO 8613: Office Document Architecture (ODA) and Interchange Format (ODIF),March 1988
Postl, W.: Halftone Recognition by an Experimental Text and Facsimile Workstation, Proc. of the 6th ICPR, Munich, 1982, 489–491
Scherl, W.: Unified Analysis of Complex Document Patterns, Proc. 4th Scandinavian Conf. on Image Analysis, Trondheim, 1985, 873–880
Wang, D., Srihari, S. N.: Classification of Newspaper Image Blocks using Texture Analysis, Computer Vision, Graphics and Image Proc., Vol. 47, 1989, 327–352
Wong, K.Y.,Casey, R.G., Wahl, F.M.: Document Analysis System, IBM J. Res. Devlop., Vol. 26, 1982, 647
Weber, S.: Imaging, Electronics, July 1989, 61–64
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1990 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kreich, J., Luhn, A., Maderlechner, G. (1990). Document Image Understanding. In: Schwärtzel, H., Mizin, I.A. (eds) Advanced Information Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-93464-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-93464-3_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-52683-4
Online ISBN: 978-3-642-93464-3
eBook Packages: Springer Book Archive