Advertisement

Case Study: Retrieving Scanned Library Cards

  • Peter Schäuble
Chapter
Part of the The Springer International Series in Engineering and Computer Science book series (SECS, volume 397)

Abstract

In this chapter we present a case study where the speech retrieval approach described in is applied to retrieve scanned library cards. Some concepts of are repeated in this chapter such that it is self-contained. The library catalogue of this case study is the Alphabetischer Zentralkatalog of the Zentralbibliothek Zürich. It contains many old, and sometimes even handwritten cards. As a consequence, the Optical Character Recognition (OCR) produces many recognition errors. Since library cards are rather short documents, we cannot expect that the effects of recognition errors cancel out (Section 5.6). Fortunately, we have an additional source of information, i.e. the alphabetic ordering of the cards. We will show how retrieval is improved by taking advantage of this ordering information.

Keywords

Optical Character Recognition Document Frequency Recognition Error Index Card Word Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Note

  1. 1.
    After this feasibility study the entire AZK was converted, i.e. all 2.2 million cards were scanned.Google Scholar

Copyright information

© Springer Science+Business Media New York 1997

Authors and Affiliations

  • Peter Schäuble
    • 1
  1. 1.Swiss Federal Institute of Technology (ETH)ZurichSwitzerland

Personalised recommendations