Skip to main content

Incremental Estimation of Visual Vocabulary Size for Image Retrieval

  • Conference paper
  • First Online:
Book cover Advances in Big Data (INNS 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 529))

Included in the following conference series:

  • 2227 Accesses

Abstract

The increasing amount of image databases over the last years has highlighted our need to represent an image collection efficiently and quickly. The majority of image retrieval and image clustering approaches has been based on the construction of a visual vocabulary in the so called Bag-of-Visual-words (BoV) model, analogous to the Bag-of-Words (BoW) model in the representation of a collection of text documents. A visual vocabulary (codebook) is constructed by clustering all available visual features in an image collection, using k-means or approximate k-means, requiring as input the number of visual words, i.e. the size of the visual vocabulary, which is hard to be tuned or directly estimated by the total amount of visual descriptors. In order to avoid tuning or guessing the number of visual words, we propose an incremental estimation of the optimal visual vocabulary size, based on the DBSCAN-Martingale, which has been introduced in the context of text clustering and is able to estimate the number of clusters efficiently, even for very noisy datasets. For a sample of images, our method estimates the potential number of very dense SIFT patterns for each image in the collection. The proposed approach is evaluated in an image retrieval and in an image clustering task, by means of Mean Average Precision and Normalized Mutual Information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.r-project.org/.

  2. 2.

    https://github.com/MKLab-ITI/topic-detection/blob/master/DBSCAN_Martingale.r.

  3. 3.

    https://cran.r-project.org/web/packages/dbscan/index.html.

  4. 4.

    http://wang.ist.psu.edu/docs/related/.

  5. 5.

    http://www.vision.caltech.edu/Image_Datasets/Caltech101/.

  6. 6.

    http://pami.xmu.edu.cn/~wlzhao/lip-vireo.htm.

  7. 7.

    https://github.com/MKLab-ITI/topic-detection/blob/master/DBSCAN_Martingale.r.

References

  1. Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: OPTICS: ordering points to identify the clustering structure. ACM Sigmod Rec. 28(2), 49–60 (1999)

    Google Scholar 

  2. Devroye, L.: Sample-based non-uniform random variate generation. In: Proceedings of the 18th Conference on Winter Simulation, pp. 260–265. ACM, December 1986

    Google Scholar 

  3. Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd 96(34), 226–231 (1996)

    Google Scholar 

  4. Gan, J., Tao, Y.: DBSCAN revisited: mis-claim, un-fixability, and approximation. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp. 519–530. ACM, May 2015

    Google Scholar 

  5. Gialampoukidis, I., Vrochidis, S., Kompatsiaris, I.: Fast visual vocabulary construction for image retrieval using skewed-split kd trees. In: MultiMedia Modeling, pp. 466–477. Springer International Publishing, January 2016

    Google Scholar 

  6. Gialampoukidis, I., Vrochidis, S., Kompatsiaris, I.: A Hybrid framework for news clustering based on the DBSCAN-Martingale and LDA. In: Machine Learning and Data Mining in Pattern Recognition, pp. 170–184. Springer International Publishing, July 2016

    Google Scholar 

  7. He, Y., Tan, H., Luo, W., Feng, S., Fan, J.: MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data. Front. Comput. Sci. 8(1), 83–99 (2014)

    Article  MathSciNet  Google Scholar 

  8. Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3304–3311. IEEE, June 2010

    Google Scholar 

  9. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Intl. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  10. Markatopoulou, F., Mezaris, V., Patras, I.: . Cascade of classifiers based on binary, non-binary and deep convolutional network descriptors for video concept detection. In: 2015 IEEE International Conference on Image Processing (ICIP), pp. 1786–1790. IEEE, September 2015

    Google Scholar 

  11. Mikolajczyk, K., Leibe, B., Schiele, B.: Multiple object class detection with a generative model. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 26–36. IEEE, June 2006

    Google Scholar 

  12. Mikulik, A., Chum, O., Matas, J.: Image retrieval for online browsing in large image collections. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds.) SISAP 2013. LNCS, vol. 8199, pp. 3–15. Springer, Heidelberg (2013). doi:10.1007/978-3-642-41062-8_2

    Chapter  Google Scholar 

  13. Perronnin, F., Sánchez, J., Mensink, T.: Improving the fisher kernel for large-scale image classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 143–156. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15561-1_11

    Chapter  Google Scholar 

  14. Philbin, J.: Scalable object retrieval in very large image collections. Doctoral dissertation, Oxford University (2010)

    Google Scholar 

  15. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Conference on Computer Vision and Pattern Recognition, 2007, CVPR 2007, pp. 1–8. IEEE, June 2007

    Google Scholar 

  16. Rawlings, J.O., Pantula, S.G., Dickey, D.A.: Applied regression analysis: a research tool. Springer Science & Business Media (1998)

    Google Scholar 

  17. Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision, 2003, Proceedings, pp. 1470–1477. IEEE, October 2003

    Google Scholar 

  18. Van De Sande, K.E., Gevers, T., Snoek, C.G.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1582–1596 (2010)

    Article  Google Scholar 

  19. Wang, J., Wang, J., Ke, Q., Zeng, G., Li, S.: Fast approximate k-means via cluster closures. In: Multimedia Data Mining and Analytics, pp. 373–395. Springer International Publishing (2015)

    Google Scholar 

  20. Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. Intl. J. Comput. Vis. 73(2), 213–238 (2007)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the project MULTISENSOR (FP7-610411), funded by the European Commission.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ilias Gialampoukidis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Gialampoukidis, I., Vrochidis, S., Kompatsiaris, I. (2017). Incremental Estimation of Visual Vocabulary Size for Image Retrieval. In: Angelov, P., Manolopoulos, Y., Iliadis, L., Roy, A., Vellasco, M. (eds) Advances in Big Data. INNS 2016. Advances in Intelligent Systems and Computing, vol 529. Springer, Cham. https://doi.org/10.1007/978-3-319-47898-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47898-2_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47897-5

  • Online ISBN: 978-3-319-47898-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics