Abstract
This paper proposes the probabilistic semantic component descriptor (PSCD) for automatically extracting semantic information in a set of images. The basic idea of the PSCD is first to identify what kinds of hidden semantic concepts associated with regions in a set of images and then to construct an image-based descriptor by integrating hidden concepts of regions in an image. First, low-level features of regions in images are quantized into a set of visual words. Visual words for representing region features and high-level concepts hidden in images are linked together using the unsupervised method probabilistic latent semantic analysis. The linkage of visual words and images is built on the entire set of images, and hence a set of hidden concepts to describe each of the regions is extracted. Next, regions with unreliable concepts are eliminated, and then a PSCD for each image is constructed by propagating the probabilities of hidden concepts in the remaining regions of an image. We also present quantitative experiments to demonstrate the performance of our proposed PSCD.
Similar content being viewed by others
References
Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60
Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: an experimental comparison. Inf Retrieval 11(2):77–107
Duda RO, Hart PE, Stork DG (2001) Pattern classification. In second ed., John Wiley and Sons, Inc
Duygulu P, Barnard K, de Freitas N, Forsyth D (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proceedings of ECCV
Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In Proceedings of Workshop on Generative-Model Based Vision, in conjunction with CVPR
Hofmann T (1999) Probabilistic latent semantic indexing. In Proceedings of ACM SIGIR
Lavrenko V, Croft W (2001) Relevance-based language models. In Proceedings of ACM SIGIR
Lew MS, Sebe N, Djeraba C, Jain R (2006) Content-based multimedia information retrieval: state of the art and challenges. ACM Trans Multimed Comput Commun Appl 2(1):1–19
Li J, Wang JZ (2008) Real-time computerized annotation of pictures. IEEE Trans Pattern Anal Mach Intell 30(6):985–1002
Liu D, Chen T (2006) Semantic-shift for unsupervised object detection. In Proceedings of Workshop on Beyond Patches, in conjunction with CVPR
Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110
Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767
Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Comput Vision 60(1):63–86
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Monay F, Gatica-Perez D (2004) PLSA-based image auto-annotation: constraining the latent space. In Proceedings of ACM International Conference on Multimedia
Mori Y, Takahashi H, Oka R (1999) Image-to-word transformation based on dividing and vector quantizing images with words. In Proceedings of First International Workshop on Multimedia Intelligent Storage and Retrieval Management
Quelhas P, Monay F, Odobez J-M, Gatica-Perez D, Tuytelaars T (2007) A thousand words in a scene. IEEE Trans Pattern Anal Mach Intell 29(9):1575–1589
Rabinovich A, Vedaldi A, Galleguillos C, Wiewiora E, Belongie S (2007) Objects in context. In Proceedings of International Conference on Computer Vision
Schreer O, Feldmann I, Mediavilla IA, Concejero P, Sadka AH, Swash MR, Benini S, Leonardi R, Janjusevic T, Izquierdo E (2010) Rushes—an annotation and retrieval engine for multimedia semantic units. Multimed Tools Appl Spec Issue Content Based Multimed Indexing 48(1):23–49
Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In Proceedings of International Conference on Computer Vision
Smeulders AW, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Smith J, Naphade M, Natsev A (2003) Multimedia semantic indexing using model vectors. In Proceedings of International Conference on Multimedia and Expo
Wolf L, Bileschi S (2006) A critical view of context. Int J Comput Vision 69(2):251–261
Acknowledgement
This work was in part supported by National Science Council, Taiwan, under Grant No. NSC 98-2631-S-003-003 and by Ministry of Economic Affairs, Taiwan, under Grant No. 99-EC-17-A-02-S1-032.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chiang, CC., Wu, JW. & Lee, G.C. Probabilistic semantic component descriptor. Multimed Tools Appl 59, 629–643 (2012). https://doi.org/10.1007/s11042-011-0726-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-011-0726-0