Quality Assurance for Document Image Collections in Digital Preservation

Huber-Mörk, Reinhold; Schindler, Alexander

doi:10.1007/978-3-642-33140-4_10

Reinhold Huber-Mörk²¹ &
Alexander Schindler^21,22

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7517))

Included in the following conference series:

International Conference on Advanced Concepts for Intelligent Vision Systems

1389 Accesses
7 Citations

Abstract

Maintenance of digital image libraries requires to frequently asses the quality of the images to engage preservation measures if necessary. We present an approach to image based quality assurance for digital image collections based on local descriptor matching. We use spatially distinctive local keypoints of contrast enhanced images and robust symmetric descriptor matching to calculate affine transformations for image registration. Structural similarity of aligned images is used for quality assessment. The results show, that our approach can efficiently asses the quality of digitized documents including images of blank paper.

This work was partially supported by the SCAPE Project. The SCAPE project is co-funded by the European Union under FP7 ICT-2009.4.1 (Grant Agreement number 270137).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: SURF: Speeded up robust features. Computer Vision and Image Understanding (CVIU) 110(3), 346–359 (2008)
Article Google Scholar
van Beusekom, J., Keysers, D., Shafait, F., Breuel, T.: Distance measures for layout-based document image retrieval. In: Second International Conference on Document Image Analysis for Libraries, DIAL 2006, pp. 231–242 (April 2006)
Google Scholar
van Beusekom, J., Shafait, F., Breuel, T.: Image-Matching for Revision Detection in Printed Historical Documents. In: Hamprecht, F.A., Schnörr, C., Jähne, B. (eds.) DAGM 2007. LNCS, vol. 4713, pp. 507–516. Springer, Heidelberg (2007)
Chapter Google Scholar
Breuel, T.: Fast recognition using adaptive subdivisions of transformation space. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Proceedings, CVPR 1992, pp. 445–451 (June 1992)
Google Scholar
Brown, M., Szeliski, R., Winder, S.: Multi-image matching using multi-scale oriented patches. In: Proc. of Conf. on Comput. Vis. and Pat. Rec., San Diego, pp. 510–517 (June 2005)
Google Scholar
Chaudhury, K., Jain, A., Thirthala, S., Sahasranaman, V., Saxena, S., Mahalingam, S.: Google newspaper search - image processing and analysis pipeline. In: 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 621–625 (July 2009)
Google Scholar
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV 2004, pp. 1–22 (2004)
Google Scholar
Doermann, D., Li, H., Kia, O.: The detection of duplicates in document image databases. Image and Vision Computing 16(12-13), 907–920 (1998)
Article Google Scholar
Ferrari, V., Tuytelaars, T., Gool, L.V.: Simultaneous object recognition and segmentation from single or multiple model views. Intl. J. of Comp. Vis. 67(2), 159–188 (2006)
Article Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)
Article MathSciNet Google Scholar
Gabarda, S., Cristóbal, G.: Blind image quality assessment through anisotropy. J. Opt. Soc. Am. A 24(12), B42–B51 (2007)
Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proc. of ALVEY Vision Conf., pp. 147–152 (1988)
Google Scholar
Ke, Y., Sukthankar, R., Huston, L.: An efficient parts-based near-duplicate and sub-image retrieval system. In: Proceedings of the 12th Annual ACM International Conference on Multimedia, MULTIMEDIA 2004, pp. 869–876. ACM, New York (2004)
Chapter Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. of Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. on Pat. Anal. and Mach. Intel. 27(10), 1615–1630 (2005)
Article Google Scholar
Moorthy, A., Bovik, A.: Blind image quality assessment: From natural scene statistics to perceptual quality. IEEE Transactions on Image Processing 20(12), 3350–3364 (2011)
Article MathSciNet Google Scholar
Pizer, S.M., Amburn, E.P., Austin, J.D., Cromartie, R., Geselowitz, A., Greer, T., Romeny, B.T.H., Zimmerman, J.B., Zuiderveld, K.: Adaptive histogram equalization and its variations. Computer Vision, Graphics, and Image Processing 39 (1987)
Google Scholar
Ramachandrula, S., Joshi, G., Noushath, S., Parikh, P., Gupta, V.: Paperdiff: A script independent automatic method for finding the text differences between two document images. In: The Eighth IAPR International Workshop on Document Analysis Systems, DAS 2008, pp. 585–590 (September 2008)
Google Scholar
Rosten, E., Drummond, T.W.: Machine Learning for High-Speed Corner Detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Chapter Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: An efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, Barcelona (November 2011)
Google Scholar
Wang, Z., Bovik, A.: A universal image quality index. IEEE Signal Processing Letters 9(3), 81–84 (2002)
Article Google Scholar
Wang, Z., Bovik, A.: Mean squared error: Love it or leave it? A new look at signal fidelity measures. IEEE Signal Processing Magazine 26(1), 98–117 (2009)
Article Google Scholar
Wang, Z., Bovik, A., Sheikh, H., Simoncelli, E.: Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing 13(4), 600–612 (2004)
Article Google Scholar
Wu, X., Zhao, W.L., Ngo, C.W.: Near-duplicate keyframe retrieval with visual keywords and semantic context. In: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007, pp. 162–169. ACM, New York (2007), http://doi.acm.org/10.1145/1282280.1282309
Google Scholar
Xu, D., Cham, T.J., Yan, S., Duan, L., Chang, S.F.: Near duplicate identification with spatially aligned pyramid matching. IEEE Transactions on Circuits and Systems for Video Technology 20(8), 1068–1079 (2010)
Article Google Scholar
Zhang, L., Zhang, L., Mou, X., Zhang, D.: FSIM: A feature similarity index for image quality assessment. IEEE Transactions on Image Processing 20(8), 2378–2386 (2011)
Article MathSciNet Google Scholar
Zhao, W.L., Ngo, C.W., Tan, H.K., Wu, X.: Near-duplicate keyframe identification with interest point matching and pattern learning. IEEE Transactions on Multimedia 9(5), 1037–1048 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Research Area Intelligent Vision Systems Department Safety & Security, Austrian Institute of Technology, Austria
Reinhold Huber-Mörk & Alexander Schindler
Department of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
Alexander Schindler

Authors

Reinhold Huber-Mörk
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Schindler
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DGA, 7-9 rue des mathurins, 92 221, Bagneux, France
Jacques Blanc-Talon
Telecommunications and Information processing (TELIN), Ghent University, St.-Pietersnieuwstraat 41, 9000, Ghent, Belgium
Wilfried Philips
CSIRO ICT Centre, Epping, Po Box 76, 1710, Sydney, NSW, Australia
Dan Popescu
University of Antwerp, Universiteitsplein 1, Building N. 2610,, Wilrijk, Belgium
Paul Scheunders
Faculty of Information Technology, Brno University of Technology, 61266, Brno, Czech Republic
Pavel Zemčík

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huber-Mörk, R., Schindler, A. (2012). Quality Assurance for Document Image Collections in Digital Preservation. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P., Zemčík, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2012. Lecture Notes in Computer Science, vol 7517. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33140-4_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-33140-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33139-8
Online ISBN: 978-3-642-33140-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics