Skip to main content

Filling the Semantic Gap in Video Retrieval: An Exploration

  • Chapter
Semantic Multimedia and Ontologies

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Alexander, A. and Meehleib, T. (2001) , ‘The thesaurus for graphic materials: Its history, use and future’, Cataloging and Classification Quarterly 31(3/4), 189–212.

    Google Scholar 

  • Amir, A., Hsu, W., Iyengar, G., Lin. C. -Y., Naphade, M., Natsev, A., Neti, C., Nock, H. J., Smith, J. R., Tseng, B. L., Wu, Y. and Zhang, D. (2003) , IBM research TRECVID-2003 video retrieval system, in ‘NIST TRECVID-2003’.

    Google Scholar 

  • Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D. and Jordan, M. (2002), ‘Matching words and pictures’, Journal of Machine Learning Research 3, 1107–1135.

    Article  Google Scholar 

  • Beitzel, S., Jensen, E. C., Frieder, O., Chowdhury, A. and Pass, G. (2005) , Surrogate scoring for improved metasearch precision, in ‘SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval’, ACM Press, New York, NY, USA, pp. 583–584.

    Google Scholar 

  • Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E. and Zhang, D.-Q. (2005) , Columbia university TRECVID-2005 video search and high-level feature extraction, in ‘NIST TRECVID’.

    Google Scholar 

  • Chang, S. F., Manmatha, R. and Chua., T. S. (2005), Combining text and audio-visual features in video indexing, in ‘IEEE ICASSP 2005’, Vol. 5, pp. 1005–1008.

    Google Scholar 

  • Christel, M. and Hauptmann, A. G. (2005) , The use and utility of high-level semantic features, in ‘International Conference on Image and Video Retrieval (CIVR’05)’, Vol. 3568 of Lecture Notes in Computer Science, Singapore, pp. 134–144.

    Google Scholar 

  • Chua, T. S., Neo, S. Y., Li, K., Wang, G. H., Shi, R., Zhao, M., Xu, H., Gao, S. and Nwe, T. L. (2004) , Trecvid 2004 search and feature extraction task by NUS PRIS, in ‘NIST TRECVID’.

    Google Scholar 

  • Cover, T. and Thomas, J. (1991) , Elements of Information Theory, Wiley-Interscience, New York, NY, USA.

    MATH  Google Scholar 

  • Fellbaum, C. (1998) , WordNet: An Electronic Lexical Database, MIT Press, Cambridge, MA, USA.

    MATH  Google Scholar 

  • Hauptmann, A. (2004) , Towards a large scale concept ontology for broadcast video, in ‘Third International Conference on Image and Video Retrieval (CIVR)’, pp. 674–675.

    Google Scholar 

  • Hauptmann, A. G., Baron, R., Chen, M.-Y., Christel, M., Duygulu, P., Huang, C., Jin, R., Lin, W.-H., Ng, T., Moraveji, N., Papernick, N., Snoek, C., Tzanetakis, G., Yang, J., Yan, R., and Wactlar, H. (2003) , Informedia at TRECVID 2003: Analyzing and searching broadcast news video, in ‘Proceedings of TRECVID’.

    Google Scholar 

  • Hauptmann, A. G. and Christel, M. G. (2004) , Successful approaches in the TREC video retrieval evaluations, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 668–675.

    Google Scholar 

  • Hauptmann, A., Yan, R., Lin, W.-H., Christel, M. and Wactlar, H. (2007) , ‘Can high-level concepts fill the semantic gap in video retrieval? a case study with broadcast news’, IEEE Transactions on Multimedia 9(5), 958–966.

    Article  Google Scholar 

  • Jeon, J., Lavrenko, V. and Manmatha, R. (2003) , Automatic image annotation and retrieval using cross-media relevance models, in ‘Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval’, ACM Press, New York, NY, USA, pp. 119–126.

    Google Scholar 

  • Kender, J. and Naphade, M. (2005) , Visual concepts for news story tracking: Analyzing and exploiting the NIST TRECVID video annotation experiment, in ‘Conference on Computer Vision and Pattern Recognition’, pp. 1174–1181.

    Google Scholar 

  • Lew, M., ed. (2002) , Intl. Conf. on Image and Video Retrieval, Vol. 2383 of Lecture Notes in Computer Science, Springer, The Brunei Gallery, SOAS, Russell Square, London, UK.

    Google Scholar 

  • Lin, W.-H. and Hauptmann, A. G. (2002) , News video classification using SVM-based multimodal classifiers and combination strategies, in ‘MULTIMEDIA ’02: Tenth ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 323–326.

    Google Scholar 

  • Markkula, M. and Sormunen, E. (2000) , ‘End-user searching challenges indexing practices in the digital newspaper photo archive’, Information Retrieval 1(4), 259–285.

    Article  MATH  Google Scholar 

  • Naphade, M. R., Kristjansson, T., Frey, B. and Huang, T. (1998) , Probabilistic multimedia objects (multijects): A novel approach to video indexing and retrieval in multimedia systems, in ‘Proceedings of ICIP’, Vol. 3, pp. 536–540.

    Google Scholar 

  • Naphade, M. R. and Smith, J. R. (2004) , On the detection of semantic concepts at TRECVID, in ‘12th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 660–667.

    Google Scholar 

  • Naphade, M., Smith, J., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A. and Curtis, J. (2006) , ‘Large-scale concept ontology for multimedia’, IEEE MultiMedia 13(3), 86–91.

    Article  Google Scholar 

  • Natsev, A., Naphade, M. and TeÅ¡i’c, J. (2005) , Learning the semantics of multimedia queries and concepts from a small number of examples, in ‘13th ACM International Conference on Multimedia’, pp. 598–607.

    Google Scholar 

  • Neo, S.-Y., Zhao, J., Kan, M.-Y. and Chua, T.-S. (2006) , Video retrieval using high level features: Exploiting query matching and confidence-based weighting, in ‘Proceedings of the Conference on Image and Video Retrieval (CIVR)’, Vol. 4071 of Lecture Notes in Computer Science, pp. 143–152.

    Google Scholar 

  • Over, P., Ianeva, T., Kraaij, W. and Smeaton, A. (2005) , Trecvid 2005 - an overview, in ‘Proceedings of TRECVID 2005’, NIST, USA.

    Google Scholar 

  • Petersen, T. (1994) , Art & Architecture Thesaurus, second edn, Oxford University Press, UK.

    Google Scholar 

  • Qiu, Y. and Frei, H.-P. (1993) , Concept based query expansion, in ‘Proceedings of the 16th annual international ACM SIGIR conference’, ACM Press, New York, NY, USA, pp. 160–169.

    Google Scholar 

  • Reed, S. and Lenat, D. (2002) , Mapping ontologies into CYC, in ‘AAAI Conference, Workshop on Ontologies For The Semantic Web’.

    Google Scholar 

  • Rodden, K., Basalaj, W., Sinclair, D. and Wood, K. (2001) , Does organisation by similarity assist image browsing?, in ‘CHI ’01: Proceedings of the SIGCHI conference on Human factors in computing systems’, ACM Press, New York, NY, USA, pp. 190–197.

    Chapter  Google Scholar 

  • Shatford, S. (1986) , ‘Analyzing the subject of a picture: A theoretical aproach.’, Cataloging and Classification Quarterly 6, 39–62.

    Article  Google Scholar 

  • Smeaton, A. F., Over, P. and Kraaij, W. (2006) , Evaluation campaigns and TRECVid, in ‘MIR ’06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval’, ACM Press, New York, NY, USA, pp. 321–330.

    Chapter  Google Scholar 

  • Smeaton, A. and Over, P. (2003) , TRECVID: Benchmarking the effectiveness of information retrieval tasks on digital video., in ‘Proceedings of the International Conference on Image and Video Retrieval’.

    Google Scholar 

  • Smeulders, A., Worring, M., Santini, S., Gupta, A. and Jain, R. (2000) , ‘Content-based image retrieval: the end of the early years’, IEEE Transactions Pattern Analysis and Machine Intelligence 22(12), 1349–1380.

    Article  Google Scholar 

  • Smith, J. R., Lin, C. Y., Naphade, M. R., Natsev, P. and Tseng, B. (2002) , Learning concepts from video using multi-modal features, in ‘International Thyrrhenian Workshop for Digital Communications IWDC’, Capri, Italy.

    Google Scholar 

  • Snoek, C., Worring, M., Geusebroek, J.-M., Seinstra, F. and Smeulders, A. (2006) , ‘The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing’, IEEE Transactions Pattern Analysis Machine Intelligence 28(10), 1678–1689.

    Article  Google Scholar 

  • Snoek, C., Worring, M. and Smeulders, A. (2005) , Early versus late fusion in semantic video analysis, in ‘Proceedings of ACM Multimedia’, pp. 399–402.

    Google Scholar 

  • Snoek, C., Worring, M., van Gemert, J. C., Geusebroek, J.-M. and Smeulders, A. (2006) , The challenge problem for automated detection of 101 semantic concepts in multimedia, in ‘ACM Multimedia’, pp. 421–430.

    Google Scholar 

  • Srikanth, M., Varner, J., Bowden, M. and Moldovan, D. (2005) , Exploiting ontologies for automatic image annotation, in ‘Proceedings of the 28th Annual International ACM SIGIR (SIGIR 2005)’, pp. 552–558.

    Google Scholar 

  • Volkmer, T. and Natsev, A. (2006) , Exploring automatic query refinement for text-based video retrieval, in ‘IEEE International Conference on Multimedia and Expo (ICME)’, pp. 765 – 768.

    Google Scholar 

  • Wang, H., Liu, S. and Chia, L.-T. (2006) , Does ontology help in image retrieval?: a comparison between keyword, text ontology and multi-modality ontology approaches, in ‘MULTIMEDIA ’06: Proceedings of the 14th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 109–112.

    Chapter  Google Scholar 

  • Wu, Y., Chang, E. Y., Chang, K. C.-C. and Smith, J. (2004) , Optimal multimodal fusion for multimedia data analysis, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, pp. 572–579.

    Google Scholar 

  • Yan, R. (2006) , Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval, PhD thesis, Carnegie Mellon University.

    Google Scholar 

  • Yan, R. and Hauptmann, A. G. (2003) , The combination limit in multimedia retrieval, in ‘Proceedings of the eleventh ACM international conference on Multimedia’, pp. 339–342.

    Google Scholar 

  • Yan, R., Yang, J. and Hauptmann, A. G. (2004) , Learning query-class dependent weights in automatic video retrieval, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, pp. 548–555.

    Google Scholar 

  • Yang, J., Chen, M. Y. and Hauptmann, A. G. (2004) , Finding person x: Correlating names with visual appearances, in ‘International Conference on Image and Video Retrieval (CIVR’04)’, Ireland.

    Google Scholar 

  • Yang, Y. and Pedersen, J. (1997) , A comparative study on feature selection in text categorization, in ‘Proceedings of the 14th International Conference on Machine Learning (ICML)’, pp. 412–420.

    Google Scholar 

  • Yuan, J., Xiao, L., Wang, D., Ding, D., Zuo, Y., Tong, Z., Liu, X., Xu, S., Zheng, W., Li, X., Si, Z., Li, J., Lin, F. and Zhang, B. (2005) , Tsinghua university at TRECVID 2005, in ‘NIST TRECVID 2005’.

    Google Scholar 

  • Zhao, W., Chellappa, R., Phillips, P. J. and Rosenfeld, A. (2003) , ‘Face recognition: A literature survey’, ACM Computing Surveys 35(4), 399–458.

    Article  Google Scholar 

  • Zipf, G. (1972) , Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology, Hafner Publishing Company, New York.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag London Limited

About this chapter

Cite this chapter

Hauptmann, A., Yan, R., Lin, WH., Christel, M., Wactlar, H. (2008). Filling the Semantic Gap in Video Retrieval: An Exploration. In: Kompatsiaris, Y., Hobson, P. (eds) Semantic Multimedia and Ontologies. Springer, London. https://doi.org/10.1007/978-1-84800-076-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-1-84800-076-6_10

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84800-075-9

  • Online ISBN: 978-1-84800-076-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics