Vector Quantization Enhancement for Computer Vision Tasks

Trichet, Remi; O’Connor, Noel E.

doi:10.1007/978-3-319-48680-2_35

Vector Quantization Enhancement for Computer Vision Tasks

Remi Trichet¹⁸ &
Noel E. O’Connor¹⁸

Conference paper
First Online: 21 October 2016

2223 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10016))

Abstract

This paper augments the Bag-of-Word scheme in several respects: we incorporate a category label into the clustering process, build classifier-tailored codebooks, and weight codewords according to their probability to occur. A size-adaptive feature clustering algorithm is also proposed as an alternative to k-means. Experiments on the PASCAL VOC 2007 challenge validate the approach for classical hard-assignment as well as VLAD encoding.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Chatfield, K., Lempitsky, V., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: BMVC (2011)
Google Scholar
Kaufman, L., Rousseeuw, P.-J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)
Book MATH Google Scholar
Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S.: Constrained K-means clustering with background knowledge. In: ICML (2001)
Google Scholar
Hartigan, J., Wang, M.: A K-means clustering algorithm. Appl. Stat. 28, 100–108 (1979)
Article MATH Google Scholar
Perronnin, F., Dance, C.: Fisher kenrels on visual vocabularies for image categorizaton. In: CVPR (2006)
Google Scholar
Perronnin, F., Sánchez, J., Mensink, T.: Improving the fisher kernel for large-scale image classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 143–156. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15561-1_11
Chapter Google Scholar
Negrel, R., Picard, D., Gosselin, P.H.: Compact tensor based image representation for similarity search. In: ICIP (2012)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: CVPR (2008)
Google Scholar
Csurka, G., Bray, C., Dance, C.R., Fan, L., Willamowski, J.: Visual categorization with bags of keypoints. In: ECCV (2004)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Google Scholar
Jegou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: CVPR (2010)
Google Scholar
Delhumeau, J., Gosselin, P.-H., Jégou, H., Pérez, P.: Revisiting the VLAD image representation. ACM Multimedia (2013)
Google Scholar
Zhou, X., Yu, K., Zhang, T., Huang, T.S.: Image classification using super-vector coding of local image descriptors. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6315, pp. 141–154. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15555-0_11
Chapter Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: ICCV (2003)
Google Scholar
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. PAMI 30(9), 1632–1646 (2008)
Article Google Scholar
Winn, J., Criminisi, A., Minka, A.: Object categorization by learned universal visual dictionary. In: ICCV (2005)
Google Scholar
Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual codebook generation with classifier training for object category recognition. In: CVPR (2008)
Google Scholar
Larlus, D., Jurie, F.: Latent mixture vocabularies for object categorization. In: BMVC (2006)
Google Scholar
López-Sastre, R.J., Renes-Olalla, J., Gil-Jiménez, P., Maldonado-Bascón, S., Lafuente-Arroyo, S.: Heterogeneous visual codebook integration via consensus clustering for visual categorization. TCSVT 23, 1358–1368 (2013)
Google Scholar
Liu, J., Yang, Y., Shah, M.: Learning semantic visual vocabularies using diffusion distance. In: CVPR (2009)
Google Scholar
Zhang, S., Tian, Q., Hua, G., Zhou, W., Huang, Q., Li, H., Gao, W.: Modeling spatial and semantic cues for large-scale near-duplicated image retrieval. CVIU 115(3), 403–414 (2011)
Google Scholar
Li, T., Mei, T., Kweon, I.-S., Hua, X.-S.: Contextual bag-of-words for visual categorization. TCSVT 21(4), 381–392 (2011)
Google Scholar
Trichet, R., Nevatia, R.: Video segmentation and feature co-occurrences for activity classification. In: WACV (2014)
Google Scholar
Leibe, B., Ettlin, A., Schiele, B.: Learning semantic object parts for object categorization. Image Vis. Comput. 26(1), 15–26 (2008)
Article Google Scholar
Zhang, Z., Wang, C., Xiao, B., Zhou, W., Liu, S.: Action recognition using context-constrained linear coding. IEEE Sig. Process. Lett. 19(7), 2112–2119 (2012)
Article Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR (2010)
Google Scholar
Kovashka, A., Grauman, K.: Learning a hierarchy of discriminative space-time neighborhood features for human action recognition. In: CVPR (2010)
Google Scholar
Wang, H., Yuan, J., Tan, Y.-P.: Combining feature context and spatial context for image pattern discovery. In: ICDM (2011)
Google Scholar
Arandjelovic, R., Zisserman, A.: All about vlad. In: CVPR (2013)
Google Scholar
Everingham, M., Zisserman, A., Williams, C., Van Gool, L.: The PASCAL visual object classes challenge 2007 (VOC2007) results. Technical report, Pascal Challenge (2007)
Google Scholar
Vedaldi, A., Fulkerson, B.: VLFeat-an open and portable library of computer vision algorithms. ACM Multimedia (2010)
Google Scholar
Krystian, M., Schmid, C.: A performance evaluation of local descriptors. PAMI 27(10), 1615–1630 (2005)
Article Google Scholar
Peng, X., Wang, L., Qiao, Y., Peng, Q.: Boosting VLAD with supervised dictionary learning and high-order statistics. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8691, pp. 660–674. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10578-9_43
Google Scholar

Download references

Acknowledgement

This publication has emanated from research conducted with the financial support of Science Foundation Ireland (SFI) under grant number SFI/12/RC/2289.

Author information

Authors and Affiliations

Insight Centre for Data Analytics, Dublin City University, Glasnevin, Ireland
Remi Trichet & Noel E. O’Connor

Authors

Remi Trichet
View author publications
You can also search for this author in PubMed Google Scholar
Noel E. O’Connor
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Remi Trichet .

Editor information

Editors and Affiliations

Université Paris-Sud 11 , Orsay, France
Jacques Blanc-Talon
University of Salento , Lecce, Lecce, Italy
Cosimo Distante
Ghent University , Gent, Belgium
Wilfried Philips
CSIRO ICT Centre , Sydney, New South Wales, Australia
Dan Popescu
University of Antwerp , Wilrijk, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Trichet, R., O’Connor, N.E. (2016). Vector Quantization Enhancement for Computer Vision Tasks. In: Blanc-Talon, J., Distante, C., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2016. Lecture Notes in Computer Science(), vol 10016. Springer, Cham. https://doi.org/10.1007/978-3-319-48680-2_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-48680-2_35
Published: 21 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48679-6
Online ISBN: 978-3-319-48680-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics