GOAL: Towards Understanding of Graphic Objects from Architectural to Line Drawings
- 515 Downloads
Understanding of graphic objects has become a problem of pertinence in today’s context of digital documentation and document digitization, since graphic information in a document image may be present in several forms, such as engineering drawings, architectural plans, musical scores, tables, charts, extended objects, hand-drawn sketches, etc. There exist quite a few approaches for segmentation of graphics from text, and also a separate set of techniques for recognizing a graphics and its characteristic features. This paper introduces a novel geometric algorithm that performs the task of segmenting out all the graphic objects in a document image and subsequently also works as a high-level tool to classify various graphic types. Given a document image, it performs the text-graphics segmentation by analyzing the geometric features of the minimum-area isothetic polygonal covers of all the objects for varying grid spacing, g. As the shape and size of a polygonal cover depends on g, and each isothetic polygon is represented by an ordered sequence of its vertices, the spatial relationship of the polygons corresponding to a higher grid spacing with those corresponding to a lower spacing, is used for graphics segmentation and subsequent classification. Experimental results demonstrate its efficiency, elegance, and versatility.
KeywordsGrid Spacing Document Image Graphic Type Graphic Object Extended Object
Unable to display preview. Download preview PDF.
- 1.Antonacopoulos, A., Ritchings, R.T.: Representation and classification of complex-shaped printed regions using white tiles. In: Proc. ICDAR 1995, pp. 1132–1134 (1995)Google Scholar
- 2.Biswas, A., Bhowmick, P., Bhattacharya, B.B.: Construction of isothetic covers of a digital object: A combinatorial approach. JVCIR (in press, 2010)Google Scholar
- 4.Futrelle, R.P., et al.: Extraction, layout analysis and classification of diagrams in PDF documents. In: ICDAR 2003, pp. 1007–1014 (2003)Google Scholar
- 5.Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison-Wesley, California (1993)Google Scholar
- 6.Haralick, R.M.: Document image understanding: Geometric and logical layout. In: Proc. CVPR, pp. 385–390 (1994)Google Scholar
- 7.Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: Evaluating the performance of table processing algorithms 4(3), 140–153 (2002)Google Scholar
- 9.Kopec, G.E., Chou, P.A.: Document image decoding using Markov source models. IEEE TPAMI 16(6), 602–617 (1994)Google Scholar
- 13.Song, J., et al.: An object-oriented progresssive-simplification-based vectorization system for engineering drawings: Model, algorithm, and performance. IEEE TPAMI 24(8), 1048–1060 (2002)Google Scholar
- 16.Wenyin, L.: On-line graphics recognition: State-of-the-art. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 291–304. Springer, Heidelberg (2004)Google Scholar
- 18.Zanibbi, R., Blostein, D., Cordy, J.R.: Recognizing mathematical expressions using tree transformation. IEEE TPAMI 24(11), 1455–1467 (2002)Google Scholar