Latent Dirichlet Allocation Based Image Retrieval

  • Jing Hao
  • Hongxi WeiEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10390)


In recent years, Bag-of-Visual-Word (BoVW) model has been widely used in computer vision. However, BoVW ignores not only spatial information but also semantic information between visual words. In this study, a latent Dirichlet allocation (LDA) based model has been proposed to obtain the semantic relations of visual words. Because the LDA-based topic model used alone usually degrade performance. Thus, a visual language model (VLM) is combined with LDA-based topic model linearly to represent each image. On our dataset, the proposed approach has been compared with state-of-the-art approaches (such as BoVW, LLC, SPM and VLM). Experimental results indicate that the proposed approach outperforms the original BoVW, LLC, SPM and VLM.


Image retrieval Latent dirichlet allocation Visual language model Query likelihood model Smoothing 



The paper is supported by the National Natural Science Foundation of China under Grant 61463038.


  1. 1.
    Chen, X., Hu X., Shen, X.: Spatial weighting for bag-of-visual-words and its application in content-based image retrieval. In: Proceedings of PAKDD 2009, pp. 867–874. ACM Press, New York (2009)Google Scholar
  2. 2.
    Willamowski, J., Arregui, D., Csurka, G., et al.: Categorizing nine visual classes using local appearance descriptors. In: Proceedings of ICPR Workshop on Learning for Adaptable Visual Systems. IEEE Press, New York (2004)Google Scholar
  3. 3.
    Yuan, J., Wu, Y., Yang, M.: Discovery of collocation patterns: from visual words to visual phrases. In: Proceedings of CVPR 2007, pp. 1–8. IEEE Press, New York (2007)Google Scholar
  4. 4.
    Cao, Y., Wang, C., Li, Z., et al.: Spatial-bag-of-features. In: Proceedings of CVPR 2010, pp. 3352–3359. IEEE Press, New York (2010)Google Scholar
  5. 5.
    Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR 2006, pp. 2169–2178. IEEE Press, New York (2006)Google Scholar
  6. 6.
    Wang, J., Yang, J., Yu, K., et al.: Locality-constrained linear coding for image classification. In: Proceedings of CVPR 2010, pp. 3360–3367. IEEE Press, New York (2010)Google Scholar
  7. 7.
    Harada, T., Ushiku, Y., Yamashita, Y., et al.: Discriminative spatial pyramid. In: Proceedings of CVPR 2011, pp. 1617–1624. IEEE Press, New York (2011)Google Scholar
  8. 8.
    Ren, Y., Bugeau, A., Benois-Pineau, J.: Bag-of-bags of words irregular graph pyramids vs spatial pyramid matching for image retrieval. In: Proceedings of IPTA 2014, pp. 1–6. IEEE Press, New York (2014)Google Scholar
  9. 9.
    Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)CrossRefGoogle Scholar
  10. 10.
    Li, F.F., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. J. Comput. Vis. Image Underst. 106(1), 59–70 (2007)CrossRefGoogle Scholar
  11. 11.
    Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of ICCV 1999, pp. 1150–1157. IEEE Press, New York (1999)Google Scholar
  12. 12.
    Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: Proceedings of SIGIR 2001, pp. 334–342. ACM Press, New York (2001)Google Scholar
  13. 13.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)zbMATHGoogle Scholar
  14. 14.
    Wei, X., Croft, W.B.: LDA-based document models for ad-hoc retrieval. In: Proceedings of SIGIR 2006, pp. 178–185. ACM Press, New York (2006)Google Scholar
  15. 15.
    Wei, H., Gao, G., Su, X.: LDA-based word image representation for keyword spotting on historical mongolian documents. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds.) ICONIP 2016. LNCS, vol. 9950, pp. 432–441. Springer, Cham (2016). doi: 10.1007/978-3-319-46681-1_52 CrossRefGoogle Scholar
  16. 16.
    Manning, C.D., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2009)zbMATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.School of Computer ScienceInner Mongolia UniversityHohhotChina

Personalised recommendations