Abstract
In this chapter, we present a method for spotting symbols in document images by using a photometric description of symbols. As a running example we present an application of logo spotting. The presented method uses a bag-of-words model in order to perform a categorization of document images such as invoices or receipts. The hypotheses validation is done in terms of spatial coherence by the use of a Hough-like voting scheme. Experiments which demonstrate the effectiveness of this system on a large set of real data are presented at the end of the chapter.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bagdanov, A., Ballan, L., Bertini, M., Bimbo, A.D.: Trademark matching and retrieval in sports video databases. In: Proceedings of the International Workshop on Multimedia Information Retrieval, pp. 79–86. ACM, New York (2007)
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(4), 509–522 (2002)
Califano, A., Mohan, R.: Multidimensional indexing for recognizing visual shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence 16(4), 373–392 (1994)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the Alvey Vision Conference, pp. 147–151 (1988)
Ke, Y., Sukthankar, R.: PCA-SIFT: A more distinctive representation for local image descriptor. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 506–513. IEEE Computer Society, Los Alamitos (2004)
Klein, B., Agne, S., Dengel, A.: Results of a study on invoice-reading systems in Germany. In: Document Analysis Systems VI, Lecture Notes on Computer Science, vol. 3163, pp. 451–462. Springer, Berlin (2004)
Lowe, D.: Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, pp. 1150–1157. IEEE Computer Society, Los Alamitos (1999)
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, pp. 525–531. IEEE Computer Society, Los Alamitos (2001)
Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. International Journal of Computer Vision 60(1), 63–86 (2004)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(10), 1615–1630 (2005)
Nakai, T., Kise, K., Iwamura, M.: Camera-based document image retrieval as voting for partial signatures of projective invariants. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 379–383. IEEE Computer Society, Los Alamitos (2005)
Nakai, T., Kise, K., Iwamura, M.: Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval. In: Document Analysis Systems VII, Lecture Notes on Computer Science, vol. 3872, pp. 541–552. Springer, Berlin (2006)
Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their localization in images. In: Proceedings of the Tenth IEEE International Conference on Computer Vision, pp. 370–377. IEEE Computer Society, Los Alamitos (2005)
Valveny, E., Dosch, P., Fornés, A., Escalera, S.: Report on the third contest on symbol recognition. In: Graphics Recognition. Recent Advances and New Opportunities, Lecture Notes on Computer Science, vol. 5046, pp. 321–328. Springer, Berlin (2008)
Viola, P., Rinker, J., Law, M.: Automatic fax routing. In: Document Analysis Systems VI, Lecture Notes on Computer Science, vol. 3163, pp. 484–495. Springer, Berlin (2004)
Wei, C., Li, Y., Chau, W., Li, C.: Trademark image retrieval using synthetic features for describing global shape and interior structure. Pattern Recognition 42(3), 386–394 (2009)
Zhu, G., Doerman, D.: Automatic document logo detection. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition, pp. 864–868. IEEE Computer Society, Los Alamitos (2007)
Zhu, G., Doerman, D.: Logo matching for document image retrieval. In: Proceedings of the Tenth International Conference on Document Analysis and Recognition, pp. 606–610. IEEE Computer Society, Los Alamitos (2009)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer-Verlag London Limited
About this chapter
Cite this chapter
Rusiñol, M., Lladós, J. (2010). Symbol Spotting for Document Categorization. In: Symbol Spotting in Digital Libraries. Springer, London. https://doi.org/10.1007/978-1-84996-208-7_3
Download citation
DOI: https://doi.org/10.1007/978-1-84996-208-7_3
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-84996-207-0
Online ISBN: 978-1-84996-208-7
eBook Packages: Computer ScienceComputer Science (R0)