Abstract
Visual vocabulary construction is an integral part of the popular Bag-of-Features (BOF) model. When visual data scale up (in terms of the dimensionality of features or/and the number of samples), most existing algorithms (e.g. k-means) become unfavorable due to the prohibitive time and space requirements. In this paper we propose the random locality sensitive vocabulary (RLSV) scheme towards efficient visual vocabulary construction in such scenarios. Integrating ideas from the Locality Sensitive Hashing (LSH) and the Random Forest (RF), RLSV generates and aggregates multiple visual vocabularies based on random projections, without taking clustering or training efforts. This simple scheme demonstrates superior time and space efficiency over prior methods, in both theory and practice, while often achieving comparable or even better performances. Besides, extensions to supervised and kernelized vocabulary constructions are also discussed and experimented with.
Support of IDMPO Grant R-705-000-018-279 Singapore and NRF/IDM Program under research Grant NRF2008IDMIDM004-029 are gratefully acknowledged.
Chapter PDF
References
Lowe, D.: Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 91–110 (2004)
Beyer, K.S., Goldstein, J., Ramakrishnan, R., Shaft, U.: When is nearest neigh- bor meaningful? In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 217–235. Springer, Heidelberg (1998)
Dasgupta, S.: Experiments with random projection. In: UAI, pp. 143–151 (2000)
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. ICCV, 604–610 (2005)
Nistér, D., Stewénius, H.: Scalable recognition with a vocabulary tree. In: CVPR, vol. (2), pp. 2161–2168 (2006)
Breiman, L.: Random forests. Machine Learning 45, 5–32 (2001)
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classiffcation. IEEE Trans. Pattern Anal. 30, 1632–1646 (2008)
Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual code-book generation with classifier training for object category recognition. In: CVPR (2008)
Lee, H., Battle, A., Raina, R., Ng, A.: Efficient sparse coding algorithms. In: NIPS, vol. 19, p. 801 (2007)
Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. PAMI, 210–227 (2009)
Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to algorithms. The MIT press, Cambridge (2001)
Charikar, M.: Similarity estimation techniques from rounding algorithms. In: STOC, pp. 380–388 (2002)
Andoni, A., Indyk, P.: Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. ACM Commun. 51, 117–122 (2008)
Kulis, B., Grauman, K.: Kernelized locality-sensitive hashing for scalable image search. In: ICCV (2009)
Bentley, J.L.: Multidimensional binary search trees used for associative searching. ACM Commun. 517, 509–517 (1975)
Datar, M., Immorlica, N., Indyk, P., Mirrokni, V.: Locality-sensitive hashing scheme based on p-stable distributions. In: SCG, pp. 253–262 (2004)
Indyk, P., Motwani, R.: Approximate nearest neighbors: Towards removing the curse of dimensionality. In: STOC, pp. 604–613 (1998)
Andoni, A., Indyk, P.: Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. ACM Commun. 51, 117–122 (2008)
Scholkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Machine Learning 63, 3–42 (2006)
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: ICPR (2004)
Laptev, I.: Marsza lek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies (2008)
Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 490–503. Springer, Heidelberg (2006)
Marée, R., Geurts, P., Piater, J.H., Wehenkel, L.: Random subwindows for robust image classification. In: CVPR, vol. (1), pp. 34–40 (2005)
Wu, X., Hauptmann, A.G., Ngo, C.W.: Practical elimination of near-duplicates from web video search. ACM Multimedia, 218–227 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mu, Y., Sun, J., Han, T.X., Cheong, LF., Yan, S. (2010). Randomized Locality Sensitive Vocabularies for Bag-of-Features Model. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6313. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15558-1_54
Download citation
DOI: https://doi.org/10.1007/978-3-642-15558-1_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15557-4
Online ISBN: 978-3-642-15558-1
eBook Packages: Computer ScienceComputer Science (R0)