Agent-Based Text Extraction from Pyramid Images
A system using multiple agents working on a pyramid structure to do text extraction is described in this paper. The method is based on the observation that text strings appear as different groupings of connected components at appropriate resolutions. The pyramid structure, which is a multi-resolution image representation, is amenable to parallel processing for detection of text strings. Agents in the system individually and concurrently look for groups of connected components at appropriate levels. They may in turn spawn new agents when connected components become disjointed at finer resolution levels. The agent-based pyramidal operations do not require expensive feature analysis among different connected components to detect text strings as found in other existing works.
KeywordsMultiple Agent Full Resolution Pyramid Structure Text Character Graphic Object
Unable to display preview. Download preview PDF.
- Olivier D and Dominique B. A robust and multiscale document image segmentation for block line/text line structures extraction. Twelfth International Conference on Pattern Recognition, Jerusalem, 1994, pp 306–309.Google Scholar
- He S, Abe N and Tan CL. A clustering-based approach to the separation of text strings from mixed text/graphics documents. Thirteenth International Conference on Pattern Recognition, Austria, 25–29, August 1996, pp 706–710.Google Scholar
- Hase H, Shinokawa T, Yoneda M, Sakai M and Maruyama H. Character string extraction by multi-stage relaxation. Fourth International Conference on Document Analysis and Recognition, 18–20 August 1997, pp 298–302.Google Scholar