Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alexander, A. and Meehleib, T. (2001) , ‘The thesaurus for graphic materials: Its history, use and future’, Cataloging and Classification Quarterly 31(3/4), 189–212.
Amir, A., Hsu, W., Iyengar, G., Lin. C. -Y., Naphade, M., Natsev, A., Neti, C., Nock, H. J., Smith, J. R., Tseng, B. L., Wu, Y. and Zhang, D. (2003) , IBM research TRECVID-2003 video retrieval system, in ‘NIST TRECVID-2003’.
Barnard, K., Duygulu, P., Forsyth, D., de Freitas, N., Blei, D. and Jordan, M. (2002), ‘Matching words and pictures’, Journal of Machine Learning Research 3, 1107–1135.
Beitzel, S., Jensen, E. C., Frieder, O., Chowdhury, A. and Pass, G. (2005) , Surrogate scoring for improved metasearch precision, in ‘SIGIR ’05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval’, ACM Press, New York, NY, USA, pp. 583–584.
Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E. and Zhang, D.-Q. (2005) , Columbia university TRECVID-2005 video search and high-level feature extraction, in ‘NIST TRECVID’.
Chang, S. F., Manmatha, R. and Chua., T. S. (2005), Combining text and audio-visual features in video indexing, in ‘IEEE ICASSP 2005’, Vol. 5, pp. 1005–1008.
Christel, M. and Hauptmann, A. G. (2005) , The use and utility of high-level semantic features, in ‘International Conference on Image and Video Retrieval (CIVR’05)’, Vol. 3568 of Lecture Notes in Computer Science, Singapore, pp. 134–144.
Chua, T. S., Neo, S. Y., Li, K., Wang, G. H., Shi, R., Zhao, M., Xu, H., Gao, S. and Nwe, T. L. (2004) , Trecvid 2004 search and feature extraction task by NUS PRIS, in ‘NIST TRECVID’.
Cover, T. and Thomas, J. (1991) , Elements of Information Theory, Wiley-Interscience, New York, NY, USA.
Fellbaum, C. (1998) , WordNet: An Electronic Lexical Database, MIT Press, Cambridge, MA, USA.
Hauptmann, A. (2004) , Towards a large scale concept ontology for broadcast video, in ‘Third International Conference on Image and Video Retrieval (CIVR)’, pp. 674–675.
Hauptmann, A. G., Baron, R., Chen, M.-Y., Christel, M., Duygulu, P., Huang, C., Jin, R., Lin, W.-H., Ng, T., Moraveji, N., Papernick, N., Snoek, C., Tzanetakis, G., Yang, J., Yan, R., and Wactlar, H. (2003) , Informedia at TRECVID 2003: Analyzing and searching broadcast news video, in ‘Proceedings of TRECVID’.
Hauptmann, A. G. and Christel, M. G. (2004) , Successful approaches in the TREC video retrieval evaluations, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 668–675.
Hauptmann, A., Yan, R., Lin, W.-H., Christel, M. and Wactlar, H. (2007) , ‘Can high-level concepts fill the semantic gap in video retrieval? a case study with broadcast news’, IEEE Transactions on Multimedia 9(5), 958–966.
Jeon, J., Lavrenko, V. and Manmatha, R. (2003) , Automatic image annotation and retrieval using cross-media relevance models, in ‘Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval’, ACM Press, New York, NY, USA, pp. 119–126.
Kender, J. and Naphade, M. (2005) , Visual concepts for news story tracking: Analyzing and exploiting the NIST TRECVID video annotation experiment, in ‘Conference on Computer Vision and Pattern Recognition’, pp. 1174–1181.
Lew, M., ed. (2002) , Intl. Conf. on Image and Video Retrieval, Vol. 2383 of Lecture Notes in Computer Science, Springer, The Brunei Gallery, SOAS, Russell Square, London, UK.
Lin, W.-H. and Hauptmann, A. G. (2002) , News video classification using SVM-based multimodal classifiers and combination strategies, in ‘MULTIMEDIA ’02: Tenth ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 323–326.
Markkula, M. and Sormunen, E. (2000) , ‘End-user searching challenges indexing practices in the digital newspaper photo archive’, Information Retrieval 1(4), 259–285.
Naphade, M. R., Kristjansson, T., Frey, B. and Huang, T. (1998) , Probabilistic multimedia objects (multijects): A novel approach to video indexing and retrieval in multimedia systems, in ‘Proceedings of ICIP’, Vol. 3, pp. 536–540.
Naphade, M. R. and Smith, J. R. (2004) , On the detection of semantic concepts at TRECVID, in ‘12th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 660–667.
Naphade, M., Smith, J., Tesic, J., Chang, S.-F., Hsu, W., Kennedy, L., Hauptmann, A. and Curtis, J. (2006) , ‘Large-scale concept ontology for multimedia’, IEEE MultiMedia 13(3), 86–91.
Natsev, A., Naphade, M. and Teši’c, J. (2005) , Learning the semantics of multimedia queries and concepts from a small number of examples, in ‘13th ACM International Conference on Multimedia’, pp. 598–607.
Neo, S.-Y., Zhao, J., Kan, M.-Y. and Chua, T.-S. (2006) , Video retrieval using high level features: Exploiting query matching and confidence-based weighting, in ‘Proceedings of the Conference on Image and Video Retrieval (CIVR)’, Vol. 4071 of Lecture Notes in Computer Science, pp. 143–152.
Over, P., Ianeva, T., Kraaij, W. and Smeaton, A. (2005) , Trecvid 2005 - an overview, in ‘Proceedings of TRECVID 2005’, NIST, USA.
Petersen, T. (1994) , Art & Architecture Thesaurus, second edn, Oxford University Press, UK.
Qiu, Y. and Frei, H.-P. (1993) , Concept based query expansion, in ‘Proceedings of the 16th annual international ACM SIGIR conference’, ACM Press, New York, NY, USA, pp. 160–169.
Reed, S. and Lenat, D. (2002) , Mapping ontologies into CYC, in ‘AAAI Conference, Workshop on Ontologies For The Semantic Web’.
Rodden, K., Basalaj, W., Sinclair, D. and Wood, K. (2001) , Does organisation by similarity assist image browsing?, in ‘CHI ’01: Proceedings of the SIGCHI conference on Human factors in computing systems’, ACM Press, New York, NY, USA, pp. 190–197.
Shatford, S. (1986) , ‘Analyzing the subject of a picture: A theoretical aproach.’, Cataloging and Classification Quarterly 6, 39–62.
Smeaton, A. F., Over, P. and Kraaij, W. (2006) , Evaluation campaigns and TRECVid, in ‘MIR ’06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval’, ACM Press, New York, NY, USA, pp. 321–330.
Smeaton, A. and Over, P. (2003) , TRECVID: Benchmarking the effectiveness of information retrieval tasks on digital video., in ‘Proceedings of the International Conference on Image and Video Retrieval’.
Smeulders, A., Worring, M., Santini, S., Gupta, A. and Jain, R. (2000) , ‘Content-based image retrieval: the end of the early years’, IEEE Transactions Pattern Analysis and Machine Intelligence 22(12), 1349–1380.
Smith, J. R., Lin, C. Y., Naphade, M. R., Natsev, P. and Tseng, B. (2002) , Learning concepts from video using multi-modal features, in ‘International Thyrrhenian Workshop for Digital Communications IWDC’, Capri, Italy.
Snoek, C., Worring, M., Geusebroek, J.-M., Seinstra, F. and Smeulders, A. (2006) , ‘The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing’, IEEE Transactions Pattern Analysis Machine Intelligence 28(10), 1678–1689.
Snoek, C., Worring, M. and Smeulders, A. (2005) , Early versus late fusion in semantic video analysis, in ‘Proceedings of ACM Multimedia’, pp. 399–402.
Snoek, C., Worring, M., van Gemert, J. C., Geusebroek, J.-M. and Smeulders, A. (2006) , The challenge problem for automated detection of 101 semantic concepts in multimedia, in ‘ACM Multimedia’, pp. 421–430.
Srikanth, M., Varner, J., Bowden, M. and Moldovan, D. (2005) , Exploiting ontologies for automatic image annotation, in ‘Proceedings of the 28th Annual International ACM SIGIR (SIGIR 2005)’, pp. 552–558.
Volkmer, T. and Natsev, A. (2006) , Exploring automatic query refinement for text-based video retrieval, in ‘IEEE International Conference on Multimedia and Expo (ICME)’, pp. 765 – 768.
Wang, H., Liu, S. and Chia, L.-T. (2006) , Does ontology help in image retrieval?: a comparison between keyword, text ontology and multi-modality ontology approaches, in ‘MULTIMEDIA ’06: Proceedings of the 14th annual ACM international conference on Multimedia’, ACM Press, New York, NY, USA, pp. 109–112.
Wu, Y., Chang, E. Y., Chang, K. C.-C. and Smith, J. (2004) , Optimal multimodal fusion for multimedia data analysis, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, pp. 572–579.
Yan, R. (2006) , Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval, PhD thesis, Carnegie Mellon University.
Yan, R. and Hauptmann, A. G. (2003) , The combination limit in multimedia retrieval, in ‘Proceedings of the eleventh ACM international conference on Multimedia’, pp. 339–342.
Yan, R., Yang, J. and Hauptmann, A. G. (2004) , Learning query-class dependent weights in automatic video retrieval, in ‘Proceedings of the 12th annual ACM international conference on Multimedia’, pp. 548–555.
Yang, J., Chen, M. Y. and Hauptmann, A. G. (2004) , Finding person x: Correlating names with visual appearances, in ‘International Conference on Image and Video Retrieval (CIVR’04)’, Ireland.
Yang, Y. and Pedersen, J. (1997) , A comparative study on feature selection in text categorization, in ‘Proceedings of the 14th International Conference on Machine Learning (ICML)’, pp. 412–420.
Yuan, J., Xiao, L., Wang, D., Ding, D., Zuo, Y., Tong, Z., Liu, X., Xu, S., Zheng, W., Li, X., Si, Z., Li, J., Lin, F. and Zhang, B. (2005) , Tsinghua university at TRECVID 2005, in ‘NIST TRECVID 2005’.
Zhao, W., Chellappa, R., Phillips, P. J. and Rosenfeld, A. (2003) , ‘Face recognition: A literature survey’, ACM Computing Surveys 35(4), 399–458.
Zipf, G. (1972) , Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology, Hafner Publishing Company, New York.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag London Limited
About this chapter
Cite this chapter
Hauptmann, A., Yan, R., Lin, WH., Christel, M., Wactlar, H. (2008). Filling the Semantic Gap in Video Retrieval: An Exploration. In: Kompatsiaris, Y., Hobson, P. (eds) Semantic Multimedia and Ontologies. Springer, London. https://doi.org/10.1007/978-1-84800-076-6_10
Download citation
DOI: https://doi.org/10.1007/978-1-84800-076-6_10
Publisher Name: Springer, London
Print ISBN: 978-1-84800-075-9
Online ISBN: 978-1-84800-076-6
eBook Packages: Computer ScienceComputer Science (R0)