Automatic Digital Document Processing and Management

Problems, Algorithms and Techniques

  • Stefano¬†Ferilli

Part of the Advances in Pattern Recognition book series (ACVPR)

Table of contents

  1. Front Matter
    Pages I-XXVI
  2. Digital Documents

    1. Front Matter
      Pages 1-2
    2. Stefano Ferilli
      Pages 3-13
    3. Stefano Ferilli
      Pages 15-71
    4. Stefano Ferilli
      Pages 73-109
  3. Document Analysis

    1. Front Matter
      Pages 111-112
    2. Stefano Ferilli
      Pages 113-143
    3. Stefano Ferilli
      Pages 145-196
  4. Content Processing

    1. Front Matter
      Pages 197-198
    2. Stefano Ferilli
      Pages 199-222
    3. Stefano Ferilli
      Pages 223-255
  5. Back Matter
    Pages 257-297

About this book


Computer-readable documents have become ubiquitous in everyday life - from legacy documents that have been digitized, to new documents that have been created electronically. As the number of electronic documents continues to grow, so does the importance of digital methods for processing and managing these documents.

This comprehensive text/reference provides a broad review of the issues involved in handling and processing digital documents. Examining the full range of a document's lifetime, the book covers acquisition, representation, security, pre-processing, layout analysis, understanding, analysis of single components, information extraction, filing, indexing and retrieval. A background knowledge of the area is not required, beyond familiarity with basic concepts of computer science and mathematics; deeper technical content is provided in discrete subsections that are not essential for an understanding of other parts of the book.

Topics and features:

  • With a Foreword by Professor George Nagy of Rensselaer Polytechnic Institute, New York, USA
  • Provides a list of acronyms and a glossary of technical terms
  • Contains appendices covering key concepts in machine learning, and providing a case study on building an intelligent system for digital document and library management
  • Discusses issues of security, and legal aspects of digital documents
  • Examines core issues of document image analysis, and image processing techniques of particular relevance to digitized documents
  • Reviews the resources available for natural language processing, in addition to techniques of linguistic analysis for content handling
  • Investigates methods for extracting and retrieving data/information from a document, including representation at a semantic level

Undergraduate and graduate students will find the text a valuable general reference on the subject, and researchers will discover how their specific area of interest is interrelated with other disciplines involved in digital document processing. The book also supplies a repertoire of potential technological solutions for professionals working on digital documents.

Dr. Stefano Ferilli is an associate professor at the University of Bari, Italy, where he is Director of the Interdepartmental Center for Logic and Applications.

Authors and affiliations

  • Stefano¬†Ferilli
    • 1
  1. 1.Dipto. InformaticaUniversità BariBariItaly

Bibliographic information

  • DOI
  • Copyright Information Springer-Verlag London Limited 2011
  • Publisher Name Springer, London
  • eBook Packages Computer Science
  • Print ISBN 978-0-85729-197-4
  • Online ISBN 978-0-85729-198-1
  • Series Print ISSN 2191-6586
  • Buy this book on publisher's site
Industry Sectors
Chemical Manufacturing
Health & Hospitals
IT & Software
Consumer Packaged Goods