Data extraction; Screen scraping; Screen wrapper
A screen scraper is a program which extracts relevant data from the visual user interface of an application. Input data are commonly represented using text-only or graphically enhanced tables, lists, and forms, tailored to a human audience. Scraping is the task of collecting data from its presentation, not directly from its source for lack of access. The scraper output has a structured and machine-readable format, where extracted data are usually annotated with its semantics (metadata), suitable for automatic post-processing. The process can be thought of as reverse-engineering a data store from its presentation, abstracting content from layout. Using this approach, application data are taken from the human-oriented screen output rather than the application’s hidden proprietary data structures.
Traditionally, screen scrapers have been used to interface legacy systems residing on old mainframes, which often...
- 1.Hassan T, Baumgartner R. Intelligent text extraction from PDF documents. In: Proceedings of the International Conference on Intelligent Agents, Web Technologies and Internet Commerce; 2005. p. 2–6.Google Scholar