Learning latent semantic model with visual consistency for image analysis

Cheng, Jian; Li, Peng; Rui, Ting; Lu, Hanqing

doi:10.1007/s11042-014-1916-3

Learning latent semantic model with visual consistency for image analysis

Published: 29 March 2014

Volume 74, pages 1341–1356, (2015)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Jian Cheng¹,
Peng Li¹,
Ting Rui² &
…
Hanqing Lu¹

254 Accesses
Explore all metrics

Abstract

Latent semantic models (e.g. PLSA and LDA) have been successfully used in document analysis. In recent years, many of the latent semantic models have also been proved to be promising for visual content analysis tasks, such as image clustering and classification. The topics and words which are two of the key components in latent semantic models have explicit semantic meaning in document analysis. However, these topics and words are difficult to be described or represented in visual content analysis tasks, which usually leads to failure in practice. In this paper, we consider simultaneously the topic consistency and word consistency in semantic space to adapt the traditional PLSA model to the visual content analysis tasks. In our model, the ℓ ¹-graph is constructed to model the local neighborhood structure of images in feature space and the word co-occurrence is computed to capture the local word consistency. Then, the local information is incorporated into the model for topic discovering. Finally, the generalized EM algorithm is used to estimate the parameters. Extensive experiments on publicly available databases demonstrate the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

http://www.cs.nyu.edu/~roweis/data.html

References

Belkin M, Niyogi P (2002) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput 15:1373–1396
Article Google Scholar
Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
MATH Google Scholar
Bosch A, Zisserman A, Munoz X (2006) Scene classification via PLSA. In: Proceedings of ECCV. pp 517–530
Cai D, Mei Q, Han J, Zhai C (2008) Modeling hidden topics on document manifold. In: Proceedings of CIKM. pp 911–920
Cai D, Wang X, He X (2009) Probabilistic dyadic data analysis with local and global consistency. In: Proceedings of ICML
Cao L, Li F (2007) Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In: Proceedings of ICCV. pp 1–8
Cheng B, Yang J, Yan S, Fu Y, Huang T (2010) Learning with .1-graph for image analysis. IEEE Trans Image Process 19:858–866
Article MathSciNet Google Scholar
Deerwester S, Dumais S, Furnas G, Landauer T, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41:391–407
Article Google Scholar
Dempster A, Laird N, Rubin D (1977) Maximum likelihood from in complete data via the EM algorithm. J Royal Stat Soc 39:1–38
MATH MathSciNet Google Scholar
He X, Niyogi P (2003) Locality preserving projections. In: Proceedings of NIPS
He X, Cai D, Yan S, Zhang H (2005) Neighborhood preserving embedding. In: Proceedings of ICCV. pp 1208–1213
Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42:177–196
Article MATH Google Scholar
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of CVPR. pp 2169–2178
Li F, Rob F, Pietro P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: Proceedings of CVPR workshop on generative model based vision
Li P, Cheng J, Lu H (2012) Modeling hidden topics with dual local consistency for image analysis. In: Proceedings of ACCV. pp 648–659
Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110
Article Google Scholar
Meinshansen N, Buhlmann P (2006) High-dimensional graphs and variable selection with the lasso. Annals Stat 34:1436–1462
Article Google Scholar
Monay F, Gatica-Perez D (2004) PLSA-based image auto-annotation: constraining the latent space. In: Proceedings of ACM multimedia. pp 348–351
Neal R, Hinton G (1998) A view of the EM algorithm that justifies incremental, sparse, and other variants. Learn Graph Models
Parikh D, Grauman K (2011) Relative attributes. In: Proceedings of IEEE international conference on computer vision
Press W, Flannery B, Teukolsky S, Vetterling W (1992) Numerical recipes in C: the art of scientific computing. Cambridge University Press
Roweis S, Saul L (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326
Article Google Scholar
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Article Google Scholar
Tenenbaum J (1997) Mapping a manifold of perceptual observations. In: Proceedings of NIPS. pp 682–688
Tenenbaum J, Silva V, Langford J (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290:2319–2323
Article Google Scholar
Wright J, Genesh A, Yang A, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31:210–227
Article Google Scholar
Xu W, Liu X, Gong Y (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of SIGIR. pp 267–273

Download references

Author information

Authors and Affiliations

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
Jian Cheng, Peng Li & Hanqing Lu
PLA University of Sciences and Technology, Nanjing, 210007, China
Ting Rui

Authors

Jian Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Peng Li
View author publications
You can also search for this author in PubMed Google Scholar
Ting Rui
View author publications
You can also search for this author in PubMed Google Scholar
Hanqing Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jian Cheng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cheng, J., Li, P., Rui, T. et al. Learning latent semantic model with visual consistency for image analysis. Multimed Tools Appl 74, 1341–1356 (2015). https://doi.org/10.1007/s11042-014-1916-3

Download citation

Published: 29 March 2014
Issue Date: February 2015
DOI: https://doi.org/10.1007/s11042-014-1916-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning latent semantic model with visual consistency for image analysis

Abstract

Access this article

Similar content being viewed by others

Modeling Hidden Topics with Dual Local Consistency for Image Analysis

Multilayer Semantic Analysis in Image Databases

Visual and semantic context modeling for scene-centric image annotation

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Learning latent semantic model with visual consistency for image analysis

Abstract

Access this article

Similar content being viewed by others

Modeling Hidden Topics with Dual Local Consistency for Image Analysis

Multilayer Semantic Analysis in Image Databases

Visual and semantic context modeling for scene-centric image annotation

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation