Filling the Semantic Gap in Video Retrieval: An Exploration

Hauptmann, Alexander; Yan, Rong; Lin, Wei-Hao; Christel, Michael; Wactlar, Howard

doi:10.1007/978-1-84800-076-6_10

Alexander Hauptmann³,
Rong Yan,
Wei-Hao Lin,
Michael Christel &
…
Howard Wactlar

504 Accesses
5 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alexander, A. and Meehleib, T. (2001) , ‘The thesaurus for graphic materials: Its history, use and future’, Cataloging and Classification Quarterly 31(3/4), 189–212.
Google Scholar
Amir, A., Hsu, W., Iyengar, G., Lin. C. -Y., Naphade, M., Natsev, A., Neti, C., Nock, H. J., Smith, J. R., Tseng, B. L., Wu, Y. and Zhang, D. (2003) , IBM research TRECVID-2003 video retrieval system, in ‘NIST TRECVID-2003’.
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D. and Jordan, M. (2002), ‘Matching words and pictures’, Journal of Machine Learning Research 3, 1107–1135.
Article Google Scholar
Beitzel, S., Jensen, E. C., Frieder, O., Chowdhury, A. and Pass, G. (2005) , Surrogate scoring for improved metasearch precision, in ‘SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval’, ACM Press, New York, NY, USA, pp. 583–584.
Google Scholar
Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E. and Zhang, D.-Q. (2005) , Columbia university TRECVID-2005 video search and high-level feature extraction, in ‘NIST TRECVID’.
Google Scholar
Chang, S. F., Manmatha, R. and Chua., T. S. (2005), Combining text and audio-visual features in video indexing, in ‘IEEE ICASSP 2005’, Vol. 5, pp. 1005–1008.
Google Scholar
Christel, M. and Hauptmann, A. G. (2005) , The use and utility of high-level semantic features, in ‘International Conference on Image and Video Retrieval (CIVR’05)’, Vol. 3568 of Lecture Notes in Computer Science, Singapore, pp. 134–144.
Google Scholar
Chua, T. S., Neo, S. Y., Li, K., Wang, G. H., Shi, R., Zhao, M., Xu, H., Gao, S. and Nwe, T. L. (2004) , Trecvid 2004 search and feature extraction task by NUS PRIS, in ‘NIST TRECVID’.
Google Scholar
Cover, T. and Thomas, J. (1991) , Elements of Information Theory, Wiley-Interscience, New York, NY, USA.
MATH Google Scholar
Fellbaum, C. (1998) , WordNet: An Electronic Lexical Database, MIT Press, Cambridge, MA, USA.
MATH Google Scholar
Hauptmann, A. (2004) , Towards a large scale concept ontology for broadcast video, in ‘Third International Conference on Image and Video Retrieval (CIVR)’, pp. 674–675.
Google Scholar
Hauptmann, A. G., Baron, R., Chen, M.-Y., Christel, M., Duygulu, P., Huang, C., Jin, R., Lin, W.-H., Ng, T., Moraveji, N., Papernick, N., Snoek, C., Tzanetakis, G., Yang, J., Yan, R., and Wactlar, H. (2003) , Informedia at TRECVID 2003: Analyzing and searching broadcast news video, in ‘Proceedings of TRECVID’.
Google Scholar
Hauptmann, A. G. and Christel, M. G. (2004) , Successful approaches in the TREC video retrieval evaluations, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 668–675.
Google Scholar
Hauptmann, A., Yan, R., Lin, W.-H., Christel, M. and Wactlar, H. (2007) , ‘Can high-level concepts fill the semantic gap in video retrieval? a case study with broadcast news’, IEEE Transactions on Multimedia 9(5), 958–966.
Article Google Scholar
Jeon, J., Lavrenko, V. and Manmatha, R. (2003) , Automatic image annotation and retrieval using cross-media relevance models, in ‘Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval’, ACM Press, New York, NY, USA, pp. 119–126.
Google Scholar
Kender, J. and Naphade, M. (2005) , Visual concepts for news story tracking: Analyzing and exploiting the NIST TRECVID video annotation experiment, in ‘Conference on Computer Vision and Pattern Recognition’, pp. 1174–1181.
Google Scholar
Lew, M., ed. (2002) , Intl. Conf. on Image and Video Retrieval, Vol. 2383 of Lecture Notes in Computer Science, Springer, The Brunei Gallery, SOAS, Russell Square, London, UK.
Google Scholar
Lin, W.-H. and Hauptmann, A. G. (2002) , News video classification using SVM-based multimodal classifiers and combination strategies, in ‘MULTIMEDIA ’02: Tenth ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 323–326.
Google Scholar
Markkula, M. and Sormunen, E. (2000) , ‘End-user searching challenges indexing practices in the digital newspaper photo archive’, Information Retrieval 1(4), 259–285.
Article MATH Google Scholar
Naphade, M. R., Kristjansson, T., Frey, B. and Huang, T. (1998) , Probabilistic multimedia objects (multijects): A novel approach to video indexing and retrieval in multimedia systems, in ‘Proceedings of ICIP’, Vol. 3, pp. 536–540.
Google Scholar
Naphade, M. R. and Smith, J. R. (2004) , On the detection of semantic concepts at TRECVID, in ‘12th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 660–667.
Google Scholar
Naphade, M., Smith, J., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A. and Curtis, J. (2006) , ‘Large-scale concept ontology for multimedia’, IEEE MultiMedia 13(3), 86–91.
Article Google Scholar
Natsev, A., Naphade, M. and Teši’c, J. (2005) , Learning the semantics of multimedia queries and concepts from a small number of examples, in ‘13th ACM International Conference on Multimedia’, pp. 598–607.
Google Scholar
Neo, S.-Y., Zhao, J., Kan, M.-Y. and Chua, T.-S. (2006) , Video retrieval using high level features: Exploiting query matching and confidence-based weighting, in ‘Proceedings of the Conference on Image and Video Retrieval (CIVR)’, Vol. 4071 of Lecture Notes in Computer Science, pp. 143–152.
Google Scholar
Over, P., Ianeva, T., Kraaij, W. and Smeaton, A. (2005) , Trecvid 2005 - an overview, in ‘Proceedings of TRECVID 2005’, NIST, USA.
Google Scholar
Petersen, T. (1994) , Art & Architecture Thesaurus, second edn, Oxford University Press, UK.
Google Scholar
Qiu, Y. and Frei, H.-P. (1993) , Concept based query expansion, in ‘Proceedings of the 16th annual international ACM SIGIR conference’, ACM Press, New York, NY, USA, pp. 160–169.
Google Scholar
Reed, S. and Lenat, D. (2002) , Mapping ontologies into CYC, in ‘AAAI Conference, Workshop on Ontologies For The Semantic Web’.
Google Scholar
Rodden, K., Basalaj, W., Sinclair, D. and Wood, K. (2001) , Does organisation by similarity assist image browsing?, in ‘CHI ’01: Proceedings of the SIGCHI conference on Human factors in computing systems’, ACM Press, New York, NY, USA, pp. 190–197.
Chapter Google Scholar
Shatford, S. (1986) , ‘Analyzing the subject of a picture: A theoretical aproach.’, Cataloging and Classification Quarterly 6, 39–62.
Article Google Scholar
Smeaton, A. F., Over, P. and Kraaij, W. (2006) , Evaluation campaigns and TRECVid, in ‘MIR ’06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval’, ACM Press, New York, NY, USA, pp. 321–330.
Chapter Google Scholar
Smeaton, A. and Over, P. (2003) , TRECVID: Benchmarking the effectiveness of information retrieval tasks on digital video., in ‘Proceedings of the International Conference on Image and Video Retrieval’.
Google Scholar
Smeulders, A., Worring, M., Santini, S., Gupta, A. and Jain, R. (2000) , ‘Content-based image retrieval: the end of the early years’, IEEE Transactions Pattern Analysis and Machine Intelligence 22(12), 1349–1380.
Article Google Scholar
Smith, J. R., Lin, C. Y., Naphade, M. R., Natsev, P. and Tseng, B. (2002) , Learning concepts from video using multi-modal features, in ‘International Thyrrhenian Workshop for Digital Communications IWDC’, Capri, Italy.
Google Scholar
Snoek, C., Worring, M., Geusebroek, J.-M., Seinstra, F. and Smeulders, A. (2006) , ‘The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing’, IEEE Transactions Pattern Analysis Machine Intelligence 28(10), 1678–1689.
Article Google Scholar
Snoek, C., Worring, M. and Smeulders, A. (2005) , Early versus late fusion in semantic video analysis, in ‘Proceedings of ACM Multimedia’, pp. 399–402.
Google Scholar
Snoek, C., Worring, M., van Gemert, J. C., Geusebroek, J.-M. and Smeulders, A. (2006) , The challenge problem for automated detection of 101 semantic concepts in multimedia, in ‘ACM Multimedia’, pp. 421–430.
Google Scholar
Srikanth, M., Varner, J., Bowden, M. and Moldovan, D. (2005) , Exploiting ontologies for automatic image annotation, in ‘Proceedings of the 28th Annual International ACM SIGIR (SIGIR 2005)’, pp. 552–558.
Google Scholar
Volkmer, T. and Natsev, A. (2006) , Exploring automatic query refinement for text-based video retrieval, in ‘IEEE International Conference on Multimedia and Expo (ICME)’, pp. 765 – 768.
Google Scholar
Wang, H., Liu, S. and Chia, L.-T. (2006) , Does ontology help in image retrieval?: a comparison between keyword, text ontology and multi-modality ontology approaches, in ‘MULTIMEDIA ’06: Proceedings of the 14th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 109–112.
Chapter Google Scholar
Wu, Y., Chang, E. Y., Chang, K. C.-C. and Smith, J. (2004) , Optimal multimodal fusion for multimedia data analysis, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, pp. 572–579.
Google Scholar
Yan, R. (2006) , Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval, PhD thesis, Carnegie Mellon University.
Google Scholar
Yan, R. and Hauptmann, A. G. (2003) , The combination limit in multimedia retrieval, in ‘Proceedings of the eleventh ACM international conference on Multimedia’, pp. 339–342.
Google Scholar
Yan, R., Yang, J. and Hauptmann, A. G. (2004) , Learning query-class dependent weights in automatic video retrieval, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, pp. 548–555.
Google Scholar
Yang, J., Chen, M. Y. and Hauptmann, A. G. (2004) , Finding person x: Correlating names with visual appearances, in ‘International Conference on Image and Video Retrieval (CIVR’04)’, Ireland.
Google Scholar
Yang, Y. and Pedersen, J. (1997) , A comparative study on feature selection in text categorization, in ‘Proceedings of the 14th International Conference on Machine Learning (ICML)’, pp. 412–420.
Google Scholar
Yuan, J., Xiao, L., Wang, D., Ding, D., Zuo, Y., Tong, Z., Liu, X., Xu, S., Zheng, W., Li, X., Si, Z., Li, J., Lin, F. and Zhang, B. (2005) , Tsinghua university at TRECVID 2005, in ‘NIST TRECVID 2005’.
Google Scholar
Zhao, W., Chellappa, R., Phillips, P. J. and Rosenfeld, A. (2003) , ‘Face recognition: A literature survey’, ACM Computing Surveys 35(4), 399–458.
Article Google Scholar
Zipf, G. (1972) , Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology, Hafner Publishing Company, New York.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
Alexander Hauptmann

Authors

Alexander Hauptmann
View author publications
You can also search for this author in PubMed Google Scholar
Rong Yan
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Hao Lin
View author publications
You can also search for this author in PubMed Google Scholar
Michael Christel
View author publications
You can also search for this author in PubMed Google Scholar
Howard Wactlar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Informatics and Telematics Institute, Thermi-Thessaloniki, Greece
Yiannis Kompatsiaris PhD
Chartered Engineer Motorola Labs, Basingstoke, UK
Paola Hobson PhD, MBA

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hauptmann, A., Yan, R., Lin, WH., Christel, M., Wactlar, H. (2008). Filling the Semantic Gap in Video Retrieval: An Exploration. In: Kompatsiaris, Y., Hobson, P. (eds) Semantic Multimedia and Ontologies. Springer, London. https://doi.org/10.1007/978-1-84800-076-6_10

Download citation

DOI: https://doi.org/10.1007/978-1-84800-076-6_10
Publisher Name: Springer, London
Print ISBN: 978-1-84800-075-9
Online ISBN: 978-1-84800-076-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics