A Novel Model for Semantic Learning and Retrieval of Images

Li, Zhixin; Shi, ZhiPing; Tang, ZhengJun; Zhao, Weizhong

doi:10.1007/978-3-642-32891-6_42

Zhixin Li⁴,
ZhiPing Shi⁵,
ZhengJun Tang⁴ &
…
Weizhong Zhao⁶

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 385))

Included in the following conference series:

International Conference on Intelligent Information Processing

1359 Accesses
1 Citations

Abstract

In this paper, we firstly propose an extended probabilistic latent semantic analysis (PLSA) to model continuous quantity. In addition, corresponding EM algorithm is derived to determine the parameters. Then, we apply this model in automatic image annotation. In order to deal with the data of different modalities according to their characteristics, we present a semantic annotation model which employs continuous PLSA and traditional PLSA to model visual features and textual words respectively. These two models are linked with the same distribution over all aspects. Furthermore, an asymmetric learning approach is adopted to estimate the model parameters. This model can predict semantic annotation well for an unseen image because it associates visual and textual modalities more precisely and effectively. We evaluate our approach on the Corel5k and Corel30k dataset. The experiment results show that our approach outperforms several state-of-the-art approaches.

Download to read the full chapter text

Chapter PDF

Refining Image Annotation by Integrating PLSA with Random Walk Model

A two-stage hybrid probabilistic topic model for refining image annotation

Article 20 July 2019

A New Method for Image Understanding and Retrieval Using Text-Mined Knowledge

Keywords

References

Barnard, K., Duygulu, P., Forsyth, D., et al.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
MATH Google Scholar
Blei, D.M., Jordan, M.I.: Modeling annotated data. In: Proc. 26th Intl. ACM SIGIR Conf., pp. 127–134 (2003)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
MATH Google Scholar
Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans. PAMI 29(3), 394–410 (2007)
Article Google Scholar
Datta, R., Joshi, D., Li, J., Wang, J.Z.: Image retrieval: ideas, influences, and trends of the new age. ACM Computing Surveys 40(2), article 5, 1–60 (2008)
Article Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar
Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Chapter Google Scholar
Feng, S.L., Manmatha, R., Lavrenko, V.: Multiple Bernoulli relevance models for image and video annotation. In: Proc. CVPR, pp. 1002–1009 (2004)
Google Scholar
Hofmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42(1-2), 177–196 (2001)
Article MATH Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proc. 26th Int’l ACM SIGIR Conf., pp. 119–126 (2003)
Google Scholar
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Proc. NIPS, pp. 553–560 (2003)
Google Scholar
Li, J., Wang, J.Z.: Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. PAMI 25(9), 1075–1088 (2003)
Article Google Scholar
Li, Z., Shi, Z., Liu, X., Li, Z., Shi, Z.: Fusing semantic aspects for image annotation and retrieval. Journal of Visual Communication and Image Representation 21(8), 798–805 (2010)
Article Google Scholar
Li, Z., Shi, Z., Liu, X., Shi, Z.: Automatic image annotation with continuous PLSA. In: Proc. 35th ICASSP, pp. 806–809 (2010)
Google Scholar
Monay, F., Gatica-Perez, D.: Modeling semantic aspects for cross-media image indexing. IEEE Trans. PAMI 29(10), 1802–1817 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Information Technology, Guangxi Normal University, Guilin, 541004, China
Zhixin Li & ZhengJun Tang
College of Information Engineering, Capital Normal University, Beijing, 100048, China
ZhiPing Shi
College of Information Engineering, Xiangtan University, Xiangtan, 411105, China
Weizhong Zhao

Authors

Zhixin Li
View author publications
You can also search for this author in PubMed Google Scholar
ZhiPing Shi
View author publications
You can also search for this author in PubMed Google Scholar
ZhengJun Tang
View author publications
You can also search for this author in PubMed Google Scholar
Weizhong Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, China
Zhongzhi Shi
Computer Science Department, Indiana University, 47405, Bloomington, IN, USA
David Leake
School of Computing Science and Engineering, University of Salford, M5 4WT, Salford, UK
Sunil Vadera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Z., Shi, Z., Tang, Z., Zhao, W. (2012). A Novel Model for Semantic Learning and Retrieval of Images. In: Shi, Z., Leake, D., Vadera, S. (eds) Intelligent Information Processing VI. IIP 2012. IFIP Advances in Information and Communication Technology, vol 385. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32891-6_42

Download citation

DOI: https://doi.org/10.1007/978-3-642-32891-6_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32890-9
Online ISBN: 978-3-642-32891-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Novel Model for Semantic Learning and Retrieval of Images

Abstract

Chapter PDF

Similar content being viewed by others

Refining Image Annotation by Integrating PLSA with Random Walk Model

A two-stage hybrid probabilistic topic model for refining image annotation

A New Method for Image Understanding and Retrieval Using Text-Mined Knowledge

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Novel Model for Semantic Learning and Retrieval of Images

Abstract

Chapter PDF

Similar content being viewed by others

Refining Image Annotation by Integrating PLSA with Random Walk Model

A two-stage hybrid probabilistic topic model for refining image annotation

A New Method for Image Understanding and Retrieval Using Text-Mined Knowledge

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation