Abstract
Representing images by bag of visual codes (BoVC) features has been the cornerstone of state-of-the-art image classification system. Since the BoVC features depend on a precomputed codebook in use, when the codebook applied to test images differs from the codebook of an existing image classification system, the system becomes inapplicable. To resolve the codebook incompatibility problem, we propose in this paper cross-codebook image classification. This is achieved by transforming BoVC features derived from one codebook to make them compatible with another codebook. Two BoVC transform methods, i.e., code-reassignment and least squares, are studied. Experiments on a popular image classification benchmark set show that both methods are better than random guess when crossing the codebooks. In particular, when the BoVC features are transformed from a higher dimension to a relatively small dimension, cross-codebook image classification has a similar performance compared to within-codebook image classification, with a relative performance loss of 1.3% only. The results justify the feasibility of the proposed cross-codebook image classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Perronnin, F., Akata, Z., Harchaoui, Z., Schmid, C.: Towards good practice in large-scale learning for image classification. In: CVPR, pp. 3482–3489 (2012)
Jiang, Y.G., Yang, J., Ngo, C.W., Hauptmann, A.: Representations of keypoint-based semantic concept detection: A comprehensive study. IEEE Transactions on Multimedia 12(1), 42–53 (2010)
van de Sande, K., Gevers, T., Snoek, C.: Empowering visual categorization with the gpu. IEEE Transactions on Multimedia 13(1), 60–70 (2011)
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV Workshop on Statistical Learning in Computer Vision, vol. 1, p. 22 (2004)
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Chua, T.S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y., Zheng, Y.: Nus-wide: a real-world web image database from national university of singapore. In: CIVR, p. 48 (2009)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer (2001)
Nowak, S., Huiskes, M.: New strategies for image annotation: Overview of the photo annotation task at Image CLEF 2010. In: CLEF (2010)
Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008)
Li, X., Snoek, C., Worring, M., Koelma, D., Smeulders, A.: Bootstrapping visual categorization with relevant negatives. IEEE Transactions on Multimedia 15(4), 933–945 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Liao, S., Li, X., Du, X. (2013). Cross-Codebook Image Classification. In: Huet, B., Ngo, CW., Tang, J., Zhou, ZH., Hauptmann, A.G., Yan, S. (eds) Advances in Multimedia Information Processing – PCM 2013. PCM 2013. Lecture Notes in Computer Science, vol 8294. Springer, Cham. https://doi.org/10.1007/978-3-319-03731-8_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-03731-8_46
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03730-1
Online ISBN: 978-3-319-03731-8
eBook Packages: Computer ScienceComputer Science (R0)