The State of the Art in Image and Video Retrieval

Sebe, Nicu; Lew, Michael S.; Zhou, Xiang; Huang, Thomas S.; Bakker, Erwin M.

doi:10.1007/3-540-45113-7_1

The State of the Art in Image and Video Retrieval

Nicu Sebe⁸,
Michael S. Lew⁹,
Xiang Zhou¹⁰,
Thomas S. Huang¹¹ &
…
Erwin M. Bakker⁹

Conference paper
First Online: 01 January 2003

1344 Accesses
31 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2728))

Abstract

Image and video retrieval continues to be one of the most exciting and fastest-growing research areas in the field of multimedia technology. What are the main challenges in image and video retrieval? Despite the sustained efforts in the last years, we think that the paramount challenge remains bridging the semantic gap. By this we mean that low level features are easily measured and computed, but the starting point of the retrieval process is typically the high level query from a human. Translating or converting the question posed by a human to the low level features seen by the computer illustrates the problem in bridging the semantic gap. However, the semantic gap is not merely translating high level features to low level features. The essence of a semantic query is understanding the meaning behind the query. This can involve understanding both the intellectual and emotional sides of the human, not merely the distilled logical portion of the query but also the personal preferences and emotional subtons of the query and the preferential form of the results.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Addis, M. Boniface, S. Goodall, P. Grimwood, S. Kim, P. Lewis, K. Martinez, and A. Stevenson. Integrated image content and metadata search and retrieval across multiple databases. In International Conference on Image and Video Retrieval, pages 88–97. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
N. Arica and F. Yarman-Vural. A compact shape descriptor based on the beam angle statistics. In International Conference on Image and Video Retrieval, pages 148–157. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
M. Baillie and J.M. Jose. Audio-based event detection for sports video. In International Conference on Image and Video Retrieval, pages 288–297. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
L. Barcelo, X. Oriols, and X. Binefa. Spatio-temporal decomposition of sport events for video indexing. In International Conference on Image and Video Retrieval, pages 418–427. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
Y. Cao, W. Tavanapong, and K. Kim. Audio-assisted scene segmentation for story browsing. In International Conference on Image and Video Retrieval, pages 428–437. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
Z. Chen, J. Ding, M. Zhang, and W. Tavanapong. Hierarchical clustering-merging in multidimensional index structures. In International Conference on Image and Video Retrieval, pages 78–87. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
I. Cohen, N. Sebe, Y. Sun, M.S. Lew, and T.S. Huang. Evaluation of expression recognition techniques. In International Conference on Image and Video Retrieval, pages 178–187. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
N. Dimitrova. Multimedia content analysis: The next wave. In International Conference on Image and Video Retrieval, pages 8–17. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
J.P. Eakins, K. Jonathan Riley, and J.D. Edwards. Shape feature matching for trademark image retrieval. In International Conference on Image and Video Retrieval, pages 28–37. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
P.G.B. Enser and C.J. Sandom. Towards a comprehensive survey of the semantic gap in visual image retrieval. In International Conference on Image and Video Retrieval, pages 279–287. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
A.A. Goodrum, M.M. Bejune, and A.C. Siochi. A state transition analysis of image search patterns on the web. In International Conference on Image and Video Retrieval, pages 269–278. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
D. Heesch, A. Yavlinski, and S. Rüger. Performance comparison of different similarity models for CBIR with relevance feedback. In International Conference on Image and Video Retrieval, pages 438–447. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
C-H. Hoi, W. Wang, and M. Lyu. A novel scheme for video similarity detection. In International Conference on Image and Video Retrieval, pages 358–367. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
N.R. Howe. A closer look at boosted image retrieval. In International Conference on Image and Video Retrieval, pages 58–67. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
A. Hughes, T. Wilkens, B.M. Wildemuth, and G. Marchionini. Text or pictures? An eyetracking study of how people view digital video surrogates. In International Conference on Image and Video Retrieval, pages 259–268. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
A. Jaimes, B.L. Tseng, and J.R. Smith. Modal keywords, ontologies, and reasoning for video understanding. In International Conference on Image and Video Retrieval, pages 239–248. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
F. Jing, M. Li, L. Zhang, H-J. Zhang, and B. Zhang. Learning in region-based image retrieval. In International Conference on Image and Video Retrieval, pages 198–207. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
A. Joly, C. Frelicot, and O. Buisson. Robust content-based video copy identification in a large reference database. In International Conference on Image and Video Retrieval, pages 398–407. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
S. Kim, S. Park, and M. Kim. Central object extraction for object-based image retrieval. In International Conference on Image and Video Retrieval, pages 38–47. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
J. Lay and L. Guan. Concept-based retrieval of art documents. In International Conference on Image and Video Retrieval, pages 368–377. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
T. Liu and J.R. Kender. Spatial-temporal semantic grouping of instructional video content. In International Conference on Image and Video Retrieval, pages 348–357. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
X. Liu, A. Srivastva, and D. Sun. Learning optimal representations for image retrieval applications. In International Conference on Image and Video Retrieval, pages 48–57. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
Y. Liu and J.R. Kender. Fast video retrieval under sparse training data. In International Conference on Image and Video Retrieval, pages 388–397. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
K. Miura, R. Hamada, I. Ide, S. Sakai, and H. Tanaka. Associating cooking video segments with preparation steps. In International Conference on Image and Video Retrieval, pages 168–177. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
H. Miyamori. Automatic annotation of tennis action for content-based retrieval by integrated audio and visual information. In International Conference on Image and Video Retrieval, pages 318–327. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
P. Mulhem and J-H. Lim. Home photo retrieval: Time matters. In International Conference on Image and Video Retrieval, pages 308–317. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
M. Naphade and J.R. Smith. A hybrid framework for detecting the semantics of concepts and context. In International Conference on Image and Video Retrieval, pages 188–197. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
H.J. Nock, G. Iyengar, and C. Neti. Speaker localisation using audio-visual synchrony: An empirical study. In International Conference on Image and Video Retrieval, pages 468–477. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
J-M. Odobez, D. Gatica-Perez, and M. Guillemot. Spectral structuring of home videos. In International Conference on Image and Video Retrieval, pages 298–307. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
G. Park, Y. Baek, and H-K. Lee. Majority based ranking approach in web image retrieval. In International Conference on Image and Video Retrieval, pages 108–117. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
S. Park, J. Park, and J.K. Aggarwal. Video retrieval of human interactions using model-based motion tracking and multi-layer finite state automata. In International Conference on Image and Video Retrieval, pages 378–387. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
M.H. Pi, C.S. Tong, and A. Basu. Improving fractal codes-based image retrieval using histogram of collage errors. In International Conference on Image and Video Retrieval, pages 118–127. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
M.J. Pickering, L. Wong, and S.M. Rüger. ANSES: Summarisation of news video. In International Conference on Image and Video Retrieval, pages 408–417. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
F. Qian, B. Zhang, and F. Lin. Constructive learning algorithm-based RBF network for relevance feedback in image retrieval. In International Conference on Image and Video Retrieval, pages 338–347. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
M. Rautianen, T. Seppanen, J. Pentilla, and J. Peltola. Detecting semantic concepts from video using temporal gradients and audio classification. In International Conference on Image and Video Retrieval, pages 249–258. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
K. Jonathan Riley, J.D. Edwards, and J.P. Eakins. Content-based retrieval of historical watermark images: II-electron radiographs. In International Conference on Image and Video Retrieval, pages 128–137. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
M. Rummukainen, J. Laaksonen, and M. Koskela. An efficiency comparison of two content-based image retrieval systems, GIFT and PicSOM. In International Conference on Image and Video Retrieval, pages 478–487. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
J.M. Sanchez, X. Binefa, and J.R. Kender. Combining multiple features in temporal models for the representation of visual contents in video. In International Conference on Image and Video Retrieval, pages 208–217. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
Y. Sawahata and K. Aizawa. Indexing of personal video captured by a wearable imaging system. In International Conference on Image and Video Retrieval, pages 328–337. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
G.J. Scott and C-R. Shyu. EBS k-d tree: An entropy balanced statistical k-d tree for image databases with ground-truth labels. In International Conference on Image and Video Retrieval, pages 448–457. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
H. Shao, T. Svoboda, T. Tuytelaars, and L. van Gool. HPAT indexing for fast object/scene recognition based on local appearance. In International Conference on Image and Video Retrieval, pages 68–77. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
C-B. Shim and J-W. Chang. Efficient similar trajectory-based retrieval for moving objects in video databases. In International Conference on Image and Video Retrieval, pages 158–167. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
A. Smeaton and P. Over. TRECVID: Benchmarking the effectiveness of information retrieval tasks in video. In International Conference on Image and Video Retrieval, pages 18–27. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
M. Uysal and F. Yarman-Vural. Selection of the best representative feature and membership assignment for content-based fuzzy image database. In International Conference on Image and Video Retrieval, pages 138–147. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Google Scholar
A. Velivelli, C-W. Ngo, and T.S. Huang. Detection of documentary scene changes by audio-visual fusion. In International Conference on Image and Video Retrieval, pages 218–228. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
H. Wu, H. Lu, and S. Ma. Multilevel relevance judgment, loss function, and performance measure in image retrieval. In International Conference on Image and Video Retrieval, pages 98–107. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
R. Yan, A. Hauptmann, and R. Jin. Multimedia search with pseudo-relevance feedback. In International Conference on Image and Video Retrieval, pages 229–238. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar
H. Ye and G. Xu. Fast search in large-scale image database using vector quantization. In International Conference on Image and Video Retrieval, pages 458–467. Lecture Notes in Computer Science, vol. 2728, Springer, 2003.
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

University of Amsterdam, The Netherlands
Nicu Sebe
Leiden University, The Netherlands
Michael S. Lew & Erwin M. Bakker
Siemens Corporate Research, USA
Xiang Zhou
University of Illinois at Urbana-Champaign, USA
Thomas S. Huang

Authors

Nicu Sebe
View author publications
You can also search for this author in PubMed Google Scholar
Michael S. Lew
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Thomas S. Huang
View author publications
You can also search for this author in PubMed Google Scholar
Erwin M. Bakker
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LIACS Media Lab, Leiden University, Niels Bohrweg 1, 2333 CA, Leiden, The Netherlands
Erwin M. Bakker & Michael S. Lew &
Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, 405 N. Mathews Avenue, Urbana, IL, 61801, USA
Thomas S. Huang
University of Amsterdam, Kruislaan 403, 1098 SJ, Amsterdam, The Netherlands
Nicu Sebe
Siemens Corporate Research, 755 College Road East, Princeton, NJ, 08540, USA
Xiang Sean Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sebe, N., Lew, M.S., Zhou, X., Huang, T.S., Bakker, E.M. (2003). The State of the Art in Image and Video Retrieval. In: Bakker, E.M., Lew, M.S., Huang, T.S., Sebe, N., Zhou, X.S. (eds) Image and Video Retrieval. CIVR 2003. Lecture Notes in Computer Science, vol 2728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45113-7_1

Download citation

DOI: https://doi.org/10.1007/3-540-45113-7_1
Published: 24 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40634-1
Online ISBN: 978-3-540-45113-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics