Probabilistic semantic component descriptor

Chiang, Cheng-Chieh; Wu, Jia-Wei; Lee, Greg C.

doi:10.1007/s11042-011-0726-0

Probabilistic semantic component descriptor

Published: 13 January 2011

Volume 59, pages 629–643, (2012)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Cheng-Chieh Chiang¹,
Jia-Wei Wu² &
Greg C. Lee²

132 Accesses
2 Citations
Explore all metrics

Abstract

This paper proposes the probabilistic semantic component descriptor (PSCD) for automatically extracting semantic information in a set of images. The basic idea of the PSCD is first to identify what kinds of hidden semantic concepts associated with regions in a set of images and then to construct an image-based descriptor by integrating hidden concepts of regions in an image. First, low-level features of regions in images are quantized into a set of visual words. Visual words for representing region features and high-level concepts hidden in images are linked together using the unsupervised method probabilistic latent semantic analysis. The linkage of visual words and images is built on the entire set of images, and hence a set of hidden concepts to describe each of the regions is extracted. Next, regions with unreliable concepts are eliminated, and then a PSCD for each image is constructed by propagating the probabilities of hidden concepts in the remaining regions of an image. We also present quantitative experiments to demonstrate the performance of our proposed PSCD.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge

Article 11 April 2015

Olga Russakovsky, Jia Deng, … Li Fei-Fei

The Pascal Visual Object Classes Challenge: A Retrospective

Article 25 June 2014

Mark Everingham, S. M. Ali Eslami, … Andrew Zisserman

References

Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
MATH Google Scholar
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60
Article Google Scholar
Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: an experimental comparison. Inf Retrieval 11(2):77–107
Article Google Scholar
Duda RO, Hart PE, Stork DG (2001) Pattern classification. In second ed., John Wiley and Sons, Inc
Duygulu P, Barnard K, de Freitas N, Forsyth D (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proceedings of ECCV
Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In Proceedings of Workshop on Generative-Model Based Vision, in conjunction with CVPR
Hofmann T (1999) Probabilistic latent semantic indexing. In Proceedings of ACM SIGIR
Lavrenko V, Croft W (2001) Relevance-based language models. In Proceedings of ACM SIGIR
Lew MS, Sebe N, Djeraba C, Jain R (2006) Content-based multimedia information retrieval: state of the art and challenges. ACM Trans Multimed Comput Commun Appl 2(1):1–19
Article Google Scholar
Li J, Wang JZ (2008) Real-time computerized annotation of pictures. IEEE Trans Pattern Anal Mach Intell 30(6):985–1002
Article Google Scholar
Liu D, Chen T (2006) Semantic-shift for unsupervised object detection. In Proceedings of Workshop on Beyond Patches, in conjunction with CVPR
Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Article Google Scholar
Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767
Article Google Scholar
Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Comput Vision 60(1):63–86
Article Google Scholar
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Article Google Scholar
Monay F, Gatica-Perez D (2004) PLSA-based image auto-annotation: constraining the latent space. In Proceedings of ACM International Conference on Multimedia
Mori Y, Takahashi H, Oka R (1999) Image-to-word transformation based on dividing and vector quantizing images with words. In Proceedings of First International Workshop on Multimedia Intelligent Storage and Retrieval Management
Quelhas P, Monay F, Odobez J-M, Gatica-Perez D, Tuytelaars T (2007) A thousand words in a scene. IEEE Trans Pattern Anal Mach Intell 29(9):1575–1589
Article Google Scholar
Rabinovich A, Vedaldi A, Galleguillos C, Wiewiora E, Belongie S (2007) Objects in context. In Proceedings of International Conference on Computer Vision
Schreer O, Feldmann I, Mediavilla IA, Concejero P, Sadka AH, Swash MR, Benini S, Leonardi R, Janjusevic T, Izquierdo E (2010) Rushes—an annotation and retrieval engine for multimedia semantic units. Multimed Tools Appl Spec Issue Content Based Multimed Indexing 48(1):23–49
Google Scholar
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In Proceedings of International Conference on Computer Vision
Smeulders AW, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Article Google Scholar
Smith J, Naphade M, Natsev A (2003) Multimedia semantic indexing using model vectors. In Proceedings of International Conference on Multimedia and Expo
Wolf L, Bileschi S (2006) A critical view of context. Int J Comput Vision 69(2):251–261
Article Google Scholar

Download references

Acknowledgement

This work was in part supported by National Science Council, Taiwan, under Grant No. NSC 98-2631-S-003-003 and by Ministry of Economic Affairs, Taiwan, under Grant No. 99-EC-17-A-02-S1-032.

Author information

Authors and Affiliations

Department of Information Technology, Takming University of Science and Technology, Taipei, Taiwan
Cheng-Chieh Chiang
Department of Computer Science and Information Engineering, National Taiwan Normal University, Taipei, Taiwan
Jia-Wei Wu & Greg C. Lee

Authors

Cheng-Chieh Chiang
View author publications
You can also search for this author in PubMed Google Scholar
Jia-Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Greg C. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Greg C. Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chiang, CC., Wu, JW. & Lee, G.C. Probabilistic semantic component descriptor. Multimed Tools Appl 59, 629–643 (2012). https://doi.org/10.1007/s11042-011-0726-0

Download citation

Published: 13 January 2011
Issue Date: July 2012
DOI: https://doi.org/10.1007/s11042-011-0726-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Probabilistic semantic component descriptor

Abstract

Access this article

Similar content being viewed by others

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge

The Pascal Visual Object Classes Challenge: A Retrospective

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Probabilistic semantic component descriptor

Abstract

Access this article

Similar content being viewed by others

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge

The Pascal Visual Object Classes Challenge: A Retrospective

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation