Skip to main content
Log in

Learning latent semantic model with visual consistency for image analysis

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Latent semantic models (e.g. PLSA and LDA) have been successfully used in document analysis. In recent years, many of the latent semantic models have also been proved to be promising for visual content analysis tasks, such as image clustering and classification. The topics and words which are two of the key components in latent semantic models have explicit semantic meaning in document analysis. However, these topics and words are difficult to be described or represented in visual content analysis tasks, which usually leads to failure in practice. In this paper, we consider simultaneously the topic consistency and word consistency in semantic space to adapt the traditional PLSA model to the visual content analysis tasks. In our model, the 1-graph is constructed to model the local neighborhood structure of images in feature space and the word co-occurrence is computed to capture the local word consistency. Then, the local information is incorporated into the model for topic discovering. Finally, the generalized EM algorithm is used to estimate the parameters. Extensive experiments on publicly available databases demonstrate the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. http://www.cs.nyu.edu/~roweis/data.html

References

  1. Belkin M, Niyogi P (2002) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput 15:1373–1396

    Article  Google Scholar 

  2. Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  3. Bosch A, Zisserman A, Munoz X (2006) Scene classification via PLSA. In: Proceedings of ECCV. pp 517–530

  4. Cai D, Mei Q, Han J, Zhai C (2008) Modeling hidden topics on document manifold. In: Proceedings of CIKM. pp 911–920

  5. Cai D, Wang X, He X (2009) Probabilistic dyadic data analysis with local and global consistency. In: Proceedings of ICML

  6. Cao L, Li F (2007) Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In: Proceedings of ICCV. pp 1–8

  7. Cheng B, Yang J, Yan S, Fu Y, Huang T (2010) Learning with .1-graph for image analysis. IEEE Trans Image Process 19:858–866

    Article  MathSciNet  Google Scholar 

  8. Deerwester S, Dumais S, Furnas G, Landauer T, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41:391–407

    Article  Google Scholar 

  9. Dempster A, Laird N, Rubin D (1977) Maximum likelihood from in complete data via the EM algorithm. J Royal Stat Soc 39:1–38

    MATH  MathSciNet  Google Scholar 

  10. He X, Niyogi P (2003) Locality preserving projections. In: Proceedings of NIPS

  11. He X, Cai D, Yan S, Zhang H (2005) Neighborhood preserving embedding. In: Proceedings of ICCV. pp 1208–1213

  12. Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42:177–196

    Article  MATH  Google Scholar 

  13. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR. pp 2169–2178

  14. Li F, Rob F, Pietro P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: Proceedings of CVPR workshop on generative model based vision

  15. Li P, Cheng J, Lu H (2012) Modeling hidden topics with dual local consistency for image analysis. In: Proceedings of ACCV. pp 648–659

  16. Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110

    Article  Google Scholar 

  17. Meinshansen N, Buhlmann P (2006) High-dimensional graphs and variable selection with the lasso. Annals Stat 34:1436–1462

    Article  Google Scholar 

  18. Monay F, Gatica-Perez D (2004) PLSA-based image auto-annotation: constraining the latent space. In: Proceedings of ACM multimedia. pp 348–351

  19. Neal R, Hinton G (1998) A view of the EM algorithm that justifies incremental, sparse, and other variants. Learn Graph Models

  20. Parikh D, Grauman K (2011) Relative attributes. In: Proceedings of IEEE international conference on computer vision

  21. Press W, Flannery B, Teukolsky S, Vetterling W (1992) Numerical recipes in C: the art of scientific computing. Cambridge University Press

  22. Roweis S, Saul L (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326

    Article  Google Scholar 

  23. Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380

    Article  Google Scholar 

  24. Tenenbaum J (1997) Mapping a manifold of perceptual observations. In: Proceedings of NIPS. pp 682–688

  25. Tenenbaum J, Silva V, Langford J (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290:2319–2323

    Article  Google Scholar 

  26. Wright J, Genesh A, Yang A, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31:210–227

    Article  Google Scholar 

  27. Xu W, Liu X, Gong Y (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of SIGIR. pp 267–273

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Cheng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cheng, J., Li, P., Rui, T. et al. Learning latent semantic model with visual consistency for image analysis. Multimed Tools Appl 74, 1341–1356 (2015). https://doi.org/10.1007/s11042-014-1916-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-1916-3

Keywords

Navigation