Skip to main content
Log in

Multiple Hypergraph Clustering of Web Images by MiningWord2Image Correlations

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

In this paper, we consider the problem of clustering Web images by mining correlations between images and their corresponding words. Since Web images always come with associated text, the corresponding textual tags of Web images are used as a source to enhance the description of Web images. However, each word has different contribution for the interpretation of image semantics. Therefore, in order to evaluate the importance of each corresponding word of Web images, we propose a novel visibility model to compute the extent to which a word can be perceived visually in images, and then infer the correlation of word to image by the integration of visibility with tf-idf. Furthermore, Latent Dirichlet Allocation (LDA) is used to discover topic information inherent in surrounding text and topic correlations of images could be defined for image clustering. For integrating visibility and latent topic information into an image clustering framework, we first represent textual correlated and latent-topic correlated images by two hypergraph views, and then the proposed Spectral Multiple Hypergraph Clustering (SMHC) algorithm is used to cluster images into categories. The SMHC could be regarded as a new unsupervised learning process with two hypergraphs to classify Web images. Experimental results show that the SMHC algorithm has better clustering performance and the proposed SMHC-based image clustering framework is effective.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Smeulders A W M et al. Content-based image retrieval: The end of the early years. IEEE Trans. PAMI, 2000, 22(12): 1349-1380.

    Google Scholar 

  2. Rege M, Dong M, Hua J. Graph theoretical framework for simultaneously integrating visual and textual features for efficient Web image clustering. In Proc. the 17th Int. Conf. World Wide Web 2008, Beijing, China, April 21-25, 2008, pp.317-326.

  3. Jing F, Wang C et al. IGroup: A Web image search engine with semantic clustering of search results. In Proc. the 14th ACM Int. Conf. Multimedia, Singapore, Nov. 6-11, 2005, pp.377-384.

  4. Cai D, He X et al. Hierarchical clustering of WWW image search results using visual, textual and link information. In Proc. the 13th ACM Int. Conf. Multimedia, New York, USA, Oct. 10-16, 2004, pp.952-959.

  5. Saenko K, Darrell T. Unsupervised learning of visual sense models for polysemous words. In Proc. NIPS 2008, Vancouver, Canada, Dec. 8-11, 2008, pp.1393-1400.

  6. Blei D, Ng A, Jordan M. Latent Dirichlet allocation. Journal of Machine Learning Research, Mar. 2003, 3: 993-1022.

    Article  MATH  Google Scholar 

  7. Wu F, Liu Y, Zhuang Y. Tensor-based transductive learning for multi-modality video semantic concept detection. IEEE Transactions on Multimedia, 2009, 11(5): 868-878.

    Article  Google Scholar 

  8. Yang Y, Zhuang Y, Wu F, Pan Y. Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. IEEE Transactions on Multimedia, 2008, 10(3): 437-446.

    Article  Google Scholar 

  9. Wu F, Han Y, Zhuang Y, Shao J. Clustering web images by correlation mining of image-text. Journal of Software, http://www.jos.org.cn/1000-9825/3704.htm, 2010. (in Chinese)

  10. Zhou D, Burges C J C. Spectral clustering and transductive learning with multiple views. In Proc. the 24th Int. Conf. Machine Learning, Corvallis, OR, 2007, pp.1159-1166.

  11. Barnard K, Duygulu P et al. Matching words and pictures. Journal of Machine Learning Research, 2003, 3: 1107-1135.

    Article  MATH  Google Scholar 

  12. Wang X J, Zhang L et al. Annotating images by mining image search results. IEEE Trans. PAMI, 2008, 30(11): 1919-1932.

    Google Scholar 

  13. Zhu X, Goldberg A et al. A text-to-picture synthesis system for augmenting communication. In Proc. the 22nd Conf. AAAI, Vancouver, Canada, July 22-26, 2007, pp.1590-1595.

  14. Li H, Tang J et al. Word2Image: Towards visual interpretation of words. In Proc. the 16th ACM Int. Conf. Multimedia, Vancouver, Canada, Oct. 26-31, 2008, pp.813-816.

  15. Xia D, Wu F, Zhuang Y. Search-based automatic Web image annotation using latent visual and semantic analysis. In Proc. the 9th Pacific Rim Conf. Multimedia, Tainan, China, Dec. 9-13, 2008.

  16. Berg T, Forsyth D. Animals on the Web. In Proc. CVPR 2006, Washington, DC, USA, June 17-22, 2006, pp.1463-1470.

  17. Zhou D, Huang J, Schölkopf B. Learning with hypergraphs clustering, classification, and embedding. Advances in Neural Information Processing Systems, Vancouver/Whistler, Canada, Dec. 4-9, 2007, pp. 1601-1608.

  18. Golub G, Loan C. Matrix Computations. 3rd Edition, The Johns Hopkins University Press, 1996.

  19. Griffiths T L, Steyvers M. Finding scientific topics. Proc. National Academy of Science, 2004, 101(Supp.1): 5228-5235.

    Article  Google Scholar 

  20. Dhillon I S. Co-clustering documents and words using bipartite spectral graph partitioning. In Proc. KDD, San Francisco, USA, Aug. 26-29, 2001, 269-274.

  21. LSCOM lexicon definitions and annotations version 1.0. In DTO Challenge Workshop on Large Scale Concept Ontology for Multimedia, Columbia University ADVENT Technical Report 117-2006-3, 2006.

  22. Grubinger M, Clough P et al. The IAPR TC-12 Benchmark: A new evaluation resource for visual information systems. In Proc. Int. Workshop OntoImage'2006 Language Resources for Content-Based Image Retrieval, Genoa, Italy, May 22, 2006, pp.13-23.

  23. Alexander S, Joydeep G. Cluster ensembles – A knowledge reuse framework for combining multiple partitions. Journal Machine Learning Research, December 2002, 3: 583-617.

    Google Scholar 

  24. Long B, Zhang Z, Xu T. Clustering on complex graphs. In Proc. the 23rd Conf. AAAI 2008, Chicago, USA, July, 2008.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fei Wu.

Additional information

Supported by the National Natural Science Foundation of China under Grant Nos. 90920303, 60833006; the National Basic Research 973 Program of China under Grant No. 2010CB327905; the Program for Changjiang Scholars and Innovative Research Team in University of China under Grant Nos. IRT0652, PCSIRT.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, F., Han, YH. & Zhuang, YT. Multiple Hypergraph Clustering of Web Images by MiningWord2Image Correlations. J. Comput. Sci. Technol. 25, 750–760 (2010). https://doi.org/10.1007/s11390-010-9362-9

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-010-9362-9

Keywords

Navigation