Abstract
All video will eventually become fully digital — there seems to be no way around it. Consequently, digital video databases will become more and more pervasive and finding video in large digital video databases will become a problem just like it is a problem today to find video in analog video databases. The digital form of the video, however, opens up tremendous possibilities. Just like it is possible today to retrieve text documents from large text document databases by querying document content represented by indices, it will become possible to index digital video databases based (semi-automatically derived indices.
In this paper, we address the problem of automatic video annotation — associating semantic meaning with video segments to aid in content-based video retrieval. We present a novel framework of structural video analysis which focuses on the processing of low-level visual data cues to obtain high-level (structural and semantic) video interpretations. Additionally, we propose a flexible framework for video query formulation to aid rapid retrieval of video. This framework is meant to accommodate the “depth-first searcher” i.e., the power user, the “breath-first searcher,” and the casual browser.
Preview
Unable to display preview. Download preview PDF.
References
D. Anastassiou Digital television. Proc. IEEE, 82(4):510–519, April 1994.
R.M. Bolle, M.M. Yeung, and B.L. Yeo. Video query: Beyond the keyframes. Technical Report RC 20586, IBM T.J. Watson Research Center, October 1996.
J. Brinkley. Defining Vision. Harcourt Brace & Company, New York, NY, 1997.
E. Chan, S. Garcia, and S. Roukos. KNN nearest neighbor information retrieval, 1997.
W. Niblack et al. The QBIC project: Querying images by content using color, texture and shape. In Storage and Retrieval for Image and Video Databases, volume SPIE 1908, pages 13–25, 1993.
S. Mann and R.W. Picard. Virtual bellows: Constructing high quality stills from video. In Int. Conf. Image Processing, volume 1, pages 363–367, 1994.
J. Miller. Moving pictures. In H. Arlow, C. Blakemore, and M. Weston-Smith, editors, Images and Understanding, pages 180–194. Cambridge University Press, October 1986.
R. Mohan. Text based indexing of TV news stories. In Proceedings, SPIE Multimedia Storage and Archiving Systems, volume SPIE 2916, pages 2–13, November 1996.
L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257–286, February 1989.
L.A. Rowe, J.S. Boreczky, and C.A. Eads. Indices for user access to large video database. In Storage and Retrieval for Image and Video Database II, IS&T/SPIE, Symposium on Elec. Imaging Sci. & Tech., pages 150–161, February 1994.
B. Shahraray and D. Gibbon. Automatic generation of pictorial transcripts of video programs. In Multimedia Computing and Networking 1995, volume SPIE 2417, pages 512–528, February 1995.
D. Swanberg, C. F. Shu, and R. Jain. Knowledge guided parsing in video databases. In Storage and Retrieval for Image and Video Databases, volume SPIE 1908, pages 13–25, 1993.
Y-P Tan and R.M. Bolle. Binary video classification. Technical Report TBD, IBM T.J. Watson Research Center, 1997.
I. Witten,A. Moffat, and T. Bell. Managing gigabytes. Compressing and indexing documents and images. Van Nostrand Reinhold, New York, NY, 1994.
B. L. Yeo and B. Liu. Rapid scene analysis on compressed videos. IEEE Trans. on Circuits and Sys. For Video Techn., 5(6):533–544, December 1995.
M. M. Yeung and B. Liu. Efficient matching and clustering of video shots. In International Conference on Image Processing, volume I, pages 338–341, 1995.
M. M. Yeung and B. L. Yeo. Time-constrained clustering for segmentation of video into story units. In Int. Conf. on Pattern Recog., pages 375–380, August 1996.
M. M. Yeung, B. L. Yeo, W. Wolf, and B. Liu. Video browsing using clustering and scene transitions on compressed sequences. In Multimedia Computing and Networking 1995, volume SPIE 2417, pages 399–413, February 1995.
M.M. Yeung and B. L. Yeo. Video visualization for compact presentation and fast browsing of pictorial content. to appear in IEEE Transactions on Circuits and Systems For Video Technology, August 1997 (also IBM Research Report RC 20615, 1996).
M.M. Yeung and B.L. Yeo. Video content characterization and compaction for digital library applications. In SPIE Storage and Retrieval for Image & Video Databases, volume SPIE 3022, pages 45–58, February 1997.
H. J. Zhang, Y. H. Gong, S. W. Smoliar, and S. Y. Yan. Automatic parsing of news video. In Int. Conf. Multimedia Computing and Sys., pages 45–54, 1994.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bolle, R.M., Yeo, BL., Yeung, M.M. (1997). Video query and retrieval. In: Sattar, A. (eds) Advanced Topics in Artificial Intelligence. AI 1997. Lecture Notes in Computer Science, vol 1342. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63797-4_54
Download citation
DOI: https://doi.org/10.1007/3-540-63797-4_54
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63797-4
Online ISBN: 978-3-540-69649-0
eBook Packages: Springer Book Archive