Skip to main content

Image Clustering Using Multimodal Keywords

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4306))

Abstract

Extending our previous work on visual keywords, we use the concept of template-based visual keywords using MPEG-7 color descriptors. MPEG-7, also called the Multimedia Content Description Interface, has been a standard for many years. These color descriptors have the ability to characterize perceptual color similarity and need relatively low complexity operations to extract them, besides being scalable and interoperable. We then demonstrate the power of these visual keywords for image clustering, when used in tandem with textual keyword annotations, in the context of latent semantic analysis, a popular technique in classical information retrieval which has been used to reveal the underlying semantic structure of document collections.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vailaya, A., Figueiredo, M., Jain, A., Zhang, H.: Image Classification for Content-Based Indexing. IEEE Transaction on Image Processing 10(1) (2001)

    Google Scholar 

  2. Carson, C., Belonge, S., Greenspan, H., Malik, J.: Blobworld: A System for Region-Based Image Indexing and Retrieval. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 509–517. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  3. Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)

    Article  Google Scholar 

  4. Bhattacharya, A., Ljosa, V., Pan, J., Verardo, M.R., Yang, H., Faloutsos, C., Singh, A.K.: ViVo: Visual Vocabulary Construction for Mining Biomedical Images. In: ICDM (2005)

    Google Scholar 

  5. Sreenath, D.V., Grosky, W.I., Fotouhi, F.: Using Coherent Semantic Subpaths to Derive Emergent Semantics. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds.) KES 2004. LNCS (LNAI), vol. 3215, pp. 173–179. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  6. Dhillon, I.S., Modha, d.S.: Concept Decompositions for Large Sparse Text Data Using Clustering. Machine Learning 42(1), 143–175 (2001)

    Article  MATH  Google Scholar 

  7. Salton, G., McGill, M.J.: Introduction to Modern retrieval. McGraw Hill Book Company, New York (1983)

    MATH  Google Scholar 

  8. Berry, M.W., Dumais, S.T., O’Brien, G.W.: Using Linear Algebra for Intelligent Information Retrieval. SIAM Review 37(4), 573–595 (1995)

    Article  MATH  MathSciNet  Google Scholar 

  9. Manjunath, B.S., Salembier, P., Sikora, T. (eds.): Introduction to MPEG-7- Multimedia Content Description Interface. John Wiley & Sons, Chichester (2002)

    Google Scholar 

  10. van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval. British Library Research and Development Report, no. 5587 (1980)

    Google Scholar 

  11. Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D., Blei, D., Jordan, M.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)

    Article  MATH  Google Scholar 

  12. http://www.chiariglione.org/MPEG/standards/mpeg-7/mpeg-7.htm

  13. MPEG-7: Visual experimentation model (xm) version 10.0. ISO/IEC/JTC1/SC29/WG11, Doc. N4062 (2001)

    Google Scholar 

  14. Turk, M.A., Pentland, A.P.: Eigenfaces for recognition. Journal of Cognitive Neuroscience 3(1), 71–96 (1991)

    Article  Google Scholar 

  15. Draper, B.A., Baek, K., Barlett, M.S., Beveridge, J.R.: Recognizing faces with PCA and ICA. Comp. Vis. And Image Understanding (91), 115–137 (2003)

    Google Scholar 

  16. Kasutani, E., Yamada, A.: The MPEG-7 Color Layout Descriptor: A Compact Image Feature Description for High-Speed Image/Video Segment Retrieval. In: ICIP 2001, October 2001, vol. I, pp. 674–677 (2001)

    Google Scholar 

  17. Manjunath, B.S., Ohm, J.R., Vasudevan, V.V., Yamada, A.: Color and Texture Descriptors. IEEE Transactions on Circuits and Systems for Video Technology 11(6) (2001)

    Google Scholar 

  18. Deerwester, A., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  19. Eckart, C., Young, G.: The Approximation of One Matrix by another of Lower Rank. Psychometrika 1, 211–218 (1936)

    Article  Google Scholar 

  20. Karypis, G.: CLUTO: A Clustering Toolkit Release 2.1.1, University of Minnesota, Department of Computer Science, Minneapolis, MN 55455, Technical Report: #02-017 (November 28, 2003)

    Google Scholar 

  21. Text retrieval Conference, http://trec.nist.gov

  22. Markkula, M., Sormunen, E.: Searching for photos — journalists’ practices in pictorial IR. In: The Challenge of Image Retrieval. Electronic Workshops in computing (1988)

    Google Scholar 

  23. Smeaton, A.F., Quigley, I.: Experiments on Using Semantic Distances Between Words in Image Caption Retrieval. In: Proceedings of SIGIR 1996, pp. 174–180 (1996)

    Google Scholar 

  24. http://wordnet.princeton.edu/

  25. Zeimpekis, D., Gallopoulos, E.: TMG: A MATLAB toolbox for generating term-document matrices from text collections. Technical Report HPCLAB-SCG 1/01-05, Computer Engineering & Informatics Dept., University of Patras, Greece, January (2005); Kogan, J., Nicholas, C., Teboulle, M. (eds.): Grouping Multidimensional Data: Recent Advances in Clustering. Springer, Heidelberg (2005) (to appear)

    Google Scholar 

  26. Carson, C., Belonge, S., Greenspan, H., Malik, J.: Blobworld: Image Segmentation using Expectation-Maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1026–1038 (2002)

    Article  Google Scholar 

  27. Frankel, C., Swain, M.J., Athios, V.: Webseer: An Image Search Engine for the World Wide Web. U. Chicago TR-96-14 (1996)

    Google Scholar 

  28. Hubert, L., Arabie, P.: Comparing partitions. Journal of Classification, 193–218 (1985)

    Google Scholar 

  29. Kuncheva, L.I., Hadjitodorov, S.T.: Using Diversity in Cluster Ensembles. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 2, pp. 1214–1219 (2004)

    Google Scholar 

  30. Russell, B.C., Torralba, A., Murphy, K.P., Freeman, W.T.: LabelMe: a database and web based tool for image annotation. MIT AI Lab Memo AIM-2005-025 (2005)

    Google Scholar 

  31. MILOS, http://milos.isti.cnr.it

  32. Tang, J., Hare, J.S., Lewis, P.H.: Image Auto-annotation using a Statistical Model with Salient Regions (Speech). In: IEEE International Conference on Multimedia & Expo (ICME 2006) (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Agrawal, R., Grosky, W., Fotouhi, F. (2006). Image Clustering Using Multimodal Keywords. In: Avrithis, Y., Kompatsiaris, Y., Staab, S., O’Connor, N.E. (eds) Semantic Multimedia. SAMT 2006. Lecture Notes in Computer Science, vol 4306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11930334_9

Download citation

  • DOI: https://doi.org/10.1007/11930334_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49335-8

  • Online ISBN: 978-3-540-49337-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics