Skip to main content
Log in

Probabilistic semantic component descriptor

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper proposes the probabilistic semantic component descriptor (PSCD) for automatically extracting semantic information in a set of images. The basic idea of the PSCD is first to identify what kinds of hidden semantic concepts associated with regions in a set of images and then to construct an image-based descriptor by integrating hidden concepts of regions in an image. First, low-level features of regions in images are quantized into a set of visual words. Visual words for representing region features and high-level concepts hidden in images are linked together using the unsupervised method probabilistic latent semantic analysis. The linkage of visual words and images is built on the entire set of images, and hence a set of hidden concepts to describe each of the regions is extracted. Next, regions with unreliable concepts are eliminated, and then a PSCD for each image is constructed by propagating the probabilities of hidden concepts in the remaining regions of an image. We also present quantitative experiments to demonstrate the performance of our proposed PSCD.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Blei DM, Ng AY, Jordan MI (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  2. Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60

    Article  Google Scholar 

  3. Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: an experimental comparison. Inf Retrieval 11(2):77–107

    Article  Google Scholar 

  4. Duda RO, Hart PE, Stork DG (2001) Pattern classification. In second ed., John Wiley and Sons, Inc

  5. Duygulu P, Barnard K, de Freitas N, Forsyth D (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proceedings of ECCV

  6. Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In Proceedings of Workshop on Generative-Model Based Vision, in conjunction with CVPR

  7. Hofmann T (1999) Probabilistic latent semantic indexing. In Proceedings of ACM SIGIR

  8. Lavrenko V, Croft W (2001) Relevance-based language models. In Proceedings of ACM SIGIR

  9. Lew MS, Sebe N, Djeraba C, Jain R (2006) Content-based multimedia information retrieval: state of the art and challenges. ACM Trans Multimed Comput Commun Appl 2(1):1–19

    Article  Google Scholar 

  10. Li J, Wang JZ (2008) Real-time computerized annotation of pictures. IEEE Trans Pattern Anal Mach Intell 30(6):985–1002

    Article  Google Scholar 

  11. Liu D, Chen T (2006) Semantic-shift for unsupervised object detection. In Proceedings of Workshop on Beyond Patches, in conjunction with CVPR

  12. Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60(2):91–110

    Article  Google Scholar 

  13. Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767

    Article  Google Scholar 

  14. Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Comput Vision 60(1):63–86

    Article  Google Scholar 

  15. Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630

    Article  Google Scholar 

  16. Monay F, Gatica-Perez D (2004) PLSA-based image auto-annotation: constraining the latent space. In Proceedings of ACM International Conference on Multimedia

  17. Mori Y, Takahashi H, Oka R (1999) Image-to-word transformation based on dividing and vector quantizing images with words. In Proceedings of First International Workshop on Multimedia Intelligent Storage and Retrieval Management

  18. Quelhas P, Monay F, Odobez J-M, Gatica-Perez D, Tuytelaars T (2007) A thousand words in a scene. IEEE Trans Pattern Anal Mach Intell 29(9):1575–1589

    Article  Google Scholar 

  19. Rabinovich A, Vedaldi A, Galleguillos C, Wiewiora E, Belongie S (2007) Objects in context. In Proceedings of International Conference on Computer Vision

  20. Schreer O, Feldmann I, Mediavilla IA, Concejero P, Sadka AH, Swash MR, Benini S, Leonardi R, Janjusevic T, Izquierdo E (2010) Rushes—an annotation and retrieval engine for multimedia semantic units. Multimed Tools Appl Spec Issue Content Based Multimed Indexing 48(1):23–49

    Google Scholar 

  21. Sivic J, Russell BC, Efros AA, Zisserman A, Freeman WT (2005) Discovering objects and their location in images. In Proceedings of International Conference on Computer Vision

  22. Smeulders AW, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380

    Article  Google Scholar 

  23. Smith J, Naphade M, Natsev A (2003) Multimedia semantic indexing using model vectors. In Proceedings of International Conference on Multimedia and Expo

  24. Wolf L, Bileschi S (2006) A critical view of context. Int J Comput Vision 69(2):251–261

    Article  Google Scholar 

Download references

Acknowledgement

This work was in part supported by National Science Council, Taiwan, under Grant No. NSC 98-2631-S-003-003 and by Ministry of Economic Affairs, Taiwan, under Grant No. 99-EC-17-A-02-S1-032.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Greg C. Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chiang, CC., Wu, JW. & Lee, G.C. Probabilistic semantic component descriptor. Multimed Tools Appl 59, 629–643 (2012). https://doi.org/10.1007/s11042-011-0726-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-011-0726-0

Keywords

Navigation