Incremental visual objects clustering with the growing vocabulary tree

Fu, Zhenyong; Lu, Hongtao; Li, Wenbin

doi:10.1007/s11042-010-0616-x

Incremental visual objects clustering with the growing vocabulary tree

Published: 06 October 2010

Volume 56, pages 535–552, (2012)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Zhenyong Fu¹,
Hongtao Lu¹ &
Wenbin Li²

229 Accesses
4 Citations
Explore all metrics

Abstract

With the bag-of-visual-words image representation, we can use the text analysis methods, such as pLSA and LDA, to solve the visual objects clustering and classification problems. However the previous works only used a fixed visual vocabulary, which is formed by vector quantizing SIFT like region descriptors, and so the learned visual topic models are also only based on the fixed vocabulary. This paper presents a novel approach to cluster visual objects in an incremental manner. Given a new batch of images, we firstly expand the visual vocabulary to include the new visual words, and then adjust the objects clustering model to absorb these new words, and finally give the clustering result. We achieve our goal by adapting to the visual domain of the incremental pLSA model previously used for text analysis. Experimental results demonstrate the feasibility and stability of the growing vocabulary tree and the clustering performance using the images from seven categories in a dynamic environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

ImageNet Large Scale Visual Recognition Challenge

Article 11 April 2015

A survey on instance segmentation: state of the art

Article 03 July 2020

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

Article 09 February 2021

References

Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
MATH Google Scholar
Cai D, He X, Li Z, Ma WY, Wen JR (2004) Hierarchical clustering of WWW image search results using visual, textual and link information. In: ACM multimedia
Chakrabarti D, Kumar R, Tomkins A (2006) Evolutionary clustering. In: Proc. ACM SIGKDD
Chou TC, Chen MC (2008) Using incremental plsa for threshold resilient online event anlysis. IEEE Trans Knowl Data Eng 20:289–299
Article Google Scholar
Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: Proc. CVPR
Gao B, Liu TY, Qin T, Zheng X, Cheng QS, Ma WY (2005) Web image clustering by consistent utilization of visual features and surrounding texts. In: ACM multimedia
Grauman K, Darrell T (2005) The pyramid match kernel: discriminative classification with sets of image features. In: Proc. ICCV
Hofmann T (1999) Probabilistic latent semantic indexing. In: Proc. SIGIR
Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 43:177–196
Article Google Scholar
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proc. CVPR
Lepetit V, Fua P (2006) Keypoint recognition using randomized trees. In: PAMI, pp 1465–1479
Li L, Wang G, Fei-Fei L (2007) Optimol: automatic online picture collection via incremental model learning. In: Proc. CVPR
Lowe D (2004) Distinctive image features from scale-invariant keypoints. IJCV 60:91–110
Article Google Scholar
Matas J, Chum O, Martin U, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. In: Proc. BMVC, vol 1, pp 384–393
Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. IJCV 60:63–86
Article Google Scholar
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. PAMI 27:1615–1630
Article Google Scholar
Moosmann F, Nowak E, Jurie F (2008) Randomized clustering forests for image classification. PAMI 9:1632–1646
Article Google Scholar
Nistér D, Stewénius H (2006) Scalable recognition with a vocabulary tree. In: Proc. CVPR
Reddy KK, Liu J, Shah M (2009) Incremental action recognition using feature-tree. In: ICCV
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In: Proc. ICCV, pp 370–377
Slobodan I (2008) Object labeling for recognition using vocabulary trees. In: ICPR
Yeh T, Darrell T (2008) Dynamic visual category learning. In: CVPR
Yeh T, Lee J, Darrell T (2007) Adaptive vocabulary forests for dynamic indexing and category learning. In: Proc. ICCV
Zheng X, Cai D, He X, Ma WY, Lin X (2004) Locality preserving clustering for image database. In: ACM multimedia
http://www.robots.ox.ac.uk/~vgg/research/affine/

Download references

Acknowledgements

This work was supported by the National High Technology Research and Development Program of China (No. 2008AA02Z310), Shanghai Committee of Science and Technology (No. 08411951200, No. 08JG05002), 973 (2009CB320901) and NLPR (09-4-1).

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
Zhenyong Fu & Hongtao Lu
Department of Diagnostic and Interventional Radiology, Affiliated Sixth People’s Hospital, Shanghai Jiao Tong University, Shanghai, China
Wenbin Li

Authors

Zhenyong Fu
View author publications
You can also search for this author in PubMed Google Scholar
Hongtao Lu
View author publications
You can also search for this author in PubMed Google Scholar
Wenbin Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhenyong Fu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fu, Z., Lu, H. & Li, W. Incremental visual objects clustering with the growing vocabulary tree. Multimed Tools Appl 56, 535–552 (2012). https://doi.org/10.1007/s11042-010-0616-x

Download citation

Published: 06 October 2010
Issue Date: February 2012
DOI: https://doi.org/10.1007/s11042-010-0616-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Incremental visual objects clustering with the growing vocabulary tree

Abstract

Access this article

Similar content being viewed by others

ImageNet Large Scale Visual Recognition Challenge

A survey on instance segmentation: state of the art

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Incremental visual objects clustering with the growing vocabulary tree

Abstract

Access this article

Similar content being viewed by others

ImageNet Large Scale Visual Recognition Challenge

A survey on instance segmentation: state of the art

A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation