Image Clustering Using Multimodal Keywords

Agrawal, Rajeev; Grosky, William; Fotouhi, Farshad

doi:10.1007/11930334_9

Image Clustering Using Multimodal Keywords

Rajeev Agrawal^20,21,
William Grosky²² &
Farshad Fotouhi²¹

Conference paper

347 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4306))

Abstract

Extending our previous work on visual keywords, we use the concept of template-based visual keywords using MPEG-7 color descriptors. MPEG-7, also called the Multimedia Content Description Interface, has been a standard for many years. These color descriptors have the ability to characterize perceptual color similarity and need relatively low complexity operations to extract them, besides being scalable and interoperable. We then demonstrate the power of these visual keywords for image clustering, when used in tandem with textual keyword annotations, in the context of latent semantic analysis, a popular technique in classical information retrieval which has been used to reveal the underlying semantic structure of document collections.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Vailaya, A., Figueiredo, M., Jain, A., Zhang, H.: Image Classification for Content-Based Indexing. IEEE Transaction on Image Processing 10(1) (2001)
Google Scholar
Carson, C., Belonge, S., Greenspan, H., Malik, J.: Blobworld: A System for Region-Based Image Indexing and Retrieval. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 509–517. Springer, Heidelberg (1999)
Chapter Google Scholar
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)
Article Google Scholar
Bhattacharya, A., Ljosa, V., Pan, J., Verardo, M.R., Yang, H., Faloutsos, C., Singh, A.K.: ViVo: Visual Vocabulary Construction for Mining Biomedical Images. In: ICDM (2005)
Google Scholar
Sreenath, D.V., Grosky, W.I., Fotouhi, F.: Using Coherent Semantic Subpaths to Derive Emergent Semantics. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds.) KES 2004. LNCS (LNAI), vol. 3215, pp. 173–179. Springer, Heidelberg (2004)
Chapter Google Scholar
Dhillon, I.S., Modha, d.S.: Concept Decompositions for Large Sparse Text Data Using Clustering. Machine Learning 42(1), 143–175 (2001)
Article MATH Google Scholar
Salton, G., McGill, M.J.: Introduction to Modern retrieval. McGraw Hill Book Company, New York (1983)
MATH Google Scholar
Berry, M.W., Dumais, S.T., O’Brien, G.W.: Using Linear Algebra for Intelligent Information Retrieval. SIAM Review 37(4), 573–595 (1995)
Article MATH MathSciNet Google Scholar
Manjunath, B.S., Salembier, P., Sikora, T. (eds.): Introduction to MPEG-7- Multimedia Content Description Interface. John Wiley & Sons, Chichester (2002)
Google Scholar
van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval. British Library Research and Development Report, no. 5587 (1980)
Google Scholar
Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D., Blei, D., Jordan, M.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
Article MATH Google Scholar
http://www.chiariglione.org/MPEG/standards/mpeg-7/mpeg-7.htm
MPEG-7: Visual experimentation model (xm) version 10.0. ISO/IEC/JTC1/SC29/WG11, Doc. N4062 (2001)
Google Scholar
Turk, M.A., Pentland, A.P.: Eigenfaces for recognition. Journal of Cognitive Neuroscience 3(1), 71–96 (1991)
Article Google Scholar
Draper, B.A., Baek, K., Barlett, M.S., Beveridge, J.R.: Recognizing faces with PCA and ICA. Comp. Vis. And Image Understanding (91), 115–137 (2003)
Google Scholar
Kasutani, E., Yamada, A.: The MPEG-7 Color Layout Descriptor: A Compact Image Feature Description for High-Speed Image/Video Segment Retrieval. In: ICIP 2001, October 2001, vol. I, pp. 674–677 (2001)
Google Scholar
Manjunath, B.S., Ohm, J.R., Vasudevan, V.V., Yamada, A.: Color and Texture Descriptors. IEEE Transactions on Circuits and Systems for Video Technology 11(6) (2001)
Google Scholar
Deerwester, A., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41(6), 391–407 (1990)
Article Google Scholar
Eckart, C., Young, G.: The Approximation of One Matrix by another of Lower Rank. Psychometrika 1, 211–218 (1936)
Article Google Scholar
Karypis, G.: CLUTO: A Clustering Toolkit Release 2.1.1, University of Minnesota, Department of Computer Science, Minneapolis, MN 55455, Technical Report: #02-017 (November 28, 2003)
Google Scholar
Text retrieval Conference, http://trec.nist.gov
Markkula, M., Sormunen, E.: Searching for photos — journalists’ practices in pictorial IR. In: The Challenge of Image Retrieval. Electronic Workshops in computing (1988)
Google Scholar
Smeaton, A.F., Quigley, I.: Experiments on Using Semantic Distances Between Words in Image Caption Retrieval. In: Proceedings of SIGIR 1996, pp. 174–180 (1996)
Google Scholar
http://wordnet.princeton.edu/
Zeimpekis, D., Gallopoulos, E.: TMG: A MATLAB toolbox for generating term-document matrices from text collections. Technical Report HPCLAB-SCG 1/01-05, Computer Engineering & Informatics Dept., University of Patras, Greece, January (2005); Kogan, J., Nicholas, C., Teboulle, M. (eds.): Grouping Multidimensional Data: Recent Advances in Clustering. Springer, Heidelberg (2005) (to appear)
Google Scholar
Carson, C., Belonge, S., Greenspan, H., Malik, J.: Blobworld: Image Segmentation using Expectation-Maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1026–1038 (2002)
Article Google Scholar
Frankel, C., Swain, M.J., Athios, V.: Webseer: An Image Search Engine for the World Wide Web. U. Chicago TR-96-14 (1996)
Google Scholar
Hubert, L., Arabie, P.: Comparing partitions. Journal of Classification, 193–218 (1985)
Google Scholar
Kuncheva, L.I., Hadjitodorov, S.T.: Using Diversity in Cluster Ensembles. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 2, pp. 1214–1219 (2004)
Google Scholar
Russell, B.C., Torralba, A., Murphy, K.P., Freeman, W.T.: LabelMe: a database and web based tool for image annotation. MIT AI Lab Memo AIM-2005-025 (2005)
Google Scholar
MILOS, http://milos.isti.cnr.it
Tang, J., Hare, J.S., Lewis, P.H.: Image Auto-annotation using a Statistical Model with Salient Regions (Speech). In: IEEE International Conference on Multimedia & Expo (ICME 2006) (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Kettering University, 1700 West Third Av., Flint, MI, 48504, USA
Rajeev Agrawal
Wayne State University, 5143 Cass Avenue, 431 State Hall, Detroit, MI, 48202, USA
Rajeev Agrawal & Farshad Fotouhi
The University of Michigan – Dearborn, 4901 Evergreen Road, Dearborn, MI, 48128, USA
William Grosky

Authors

Rajeev Agrawal
View author publications
You can also search for this author in PubMed Google Scholar
William Grosky
View author publications
You can also search for this author in PubMed Google Scholar
Farshad Fotouhi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Image, Video and Multimedia Systems Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., 157 80, Athens, Greece
Yannis Avrithis
Informatics and Telematics Institute, Centre for Research and Technology-Hellas, 57001, Thessaloniki, Greece
Yiannis Kompatsiaris
Fachbereich Informatik, Universität Koblenz-Landau, Universitätsstraße 1, 56070, Koblenz, Germany
Steffen Staab
Centre for Digital Video Processing, Adaptive Information Cluster, Dublin City University, Ireland
Noel E. O’Connor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agrawal, R., Grosky, W., Fotouhi, F. (2006). Image Clustering Using Multimodal Keywords. In: Avrithis, Y., Kompatsiaris, Y., Staab, S., O’Connor, N.E. (eds) Semantic Multimedia. SAMT 2006. Lecture Notes in Computer Science, vol 4306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11930334_9

Download citation

DOI: https://doi.org/10.1007/11930334_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49335-8
Online ISBN: 978-3-540-49337-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics