A Case Study in Video Parsing: Television News

Furht, Borko; Smoliar, Stephen W.; Zhang, HongJiang

doi:10.1007/978-1-4615-2277-5_14

Borko Furht⁴,
Stephen W. Smoliar⁵ &
HongJiang Zhang⁵

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 326))

121 Accesses
1 Citations

Abstract

Automatic extraction of “semantic” information of general video programs is outside the capability of current machine vision and audio signal analysis technologies. On the other hand “content parsing” may be possible when one has an a priori model of a video’s structure based on domain knowledge. Such a model may represent a strong spatial order within the individual images and/or a strong temporal order across a sequence of shots. A television news program is a good example of a video which follows such a structural model: there tends to be spatial structure within the anchorperson shots and temporal structure in the order of shots and episodes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Akutsu et al. Video indexing using motion vectors. In Vi-sual Communications and Image Processing ‘82, pages 1522–1530, Boston, MA, November 1992. SPIE.
Google Scholar
F. Arman et al. Content-based browsing of video sequences. In Pro-ceedings: ACM Multimedia 94, San Francisco, CA, October 1994. ACM.
Google Scholar
E. H. Adelson and J. R. Bergen. Spatiotemporal energy models for the perception of motion. Journal of the Optical Society of America A, 2(2):284–299, February 1985.
Article Google Scholar
E. Adelson. Mechanisms for motion perception. Optics and Pho-tonics News, 2(8):24–30, August 1991.
Article Google Scholar
S. Al-Hawamdeh et al. Nearest neighbour searching in a picture archive system. In International Conference on Multimedia Information Systems ‘81, pages 17–33, Singapore, January 1991. ACM, McGraw Hill.
Google Scholar
F. Arman, A. Hsu, and M.-Y. Chiu. Feature management for large video databases. In W. Niblack, editor, Symposium on Electronic Imaging Science and Technology: Storage and Retrieval for Image Video Databases, pages 2–12, San Jose, CA, February 1993. IS&T/SPIE.
Book Google Scholar
F. Arman, A. Hsu, and M.-Y. Chiu. Image processing on compressed data for large video databases. In Proceedings: ACM Multimedia 93, pages 267–272, Anaheim, CA, August 1993. ACM.
Google Scholar
Y.-H. Ang, A. D. Narasimhalu, and S. Al-Hawamdeh. Image information retrieval systems. In C. H. Chen, L. F. Pau, and P. S. P. Wang, editors, Handbook of Pattern Recognition and Computer Vision, chapter 4.2, pages 719–739. World Scientific, SINGAPORE, 1993.
Google Scholar
T. G. Aguierre Smith. If you could see what I mean…descriptions of video in an anthropologist’s video notebook. Master’s thesis, Massachusetts Institute of Technology, Cambridge, MA, September 1992.
Google Scholar
E. H. Adelson and J. Y. A. Wang. Representing moving images with layers. Technical Report 228, MIT Media Lab Perceptual Computing Group, Cambridge, MA, April 1993.
Google Scholar
N. J. Belkin and W. B. Croft. Information filtering and information retrieval: Two sides of the same coin? Communications of the ACM, 35(12):29–38, December 1992.
Article Google Scholar
A. Barr and E. A. Feigenbaum, editors. The Handbook of Artificial Intelligence, volume 1, chapter 3, pages 141–222. William Kaufmann, Los Altos, CA, 1981.
Google Scholar
P. Brodatz. Textures: A Photographic Album for Artists and De-signers. Dover, New York, 1966.
Google Scholar
N. I. Badler and S. W. Smoliar. Digital representations of human movement. Computing Surveys, 11(1):19–38, March 1979.
Article Google Scholar
D. Bordwell and K. Thompson. Film Art: An Introduction. Mc-Graw Hill, New York, NY, fourth edition, 1993.
Google Scholar
A. E. Cawkill. The British Library’s picture research projects: Im-age, word, and retrieval. Advanced Imaging, 8(10):38–40, October 1993.
Google Scholar
P. R. Cohen and E. A. Feigenbaum, editors. The Handbook of Artificial Intelligence, volume 3, chapter 13, pages 125–321. William Kaufmann, Los Altos, CA, 1982.
Google Scholar
S. K. Chang and A. Hsu. Image information systems: Where do we go from here? IEEE Transactions on Knowledge and Data Engineering, 4(5):431–442, October 1992.
Article Google Scholar
S. K. Chang. Principles of Pictorial Information Systems Design. Prentice-Hall, Englewood Cliffs, NJ, 1989.
Google Scholar
C. J. Date. An Introduction to Database Systems. The Systems Programming Series. Addison-Wesley, Reading, MA, second edition, 1977.
Google Scholar
M. Davis. Media streams: An iconic visual language for video an-notation. In Proceedings: Symposium on Visual Languages, pages 196–202, Bergen, NORWAY, 1993. IEEE.
Google Scholar
N. Dimitrova and F. Golshani. a for semantic video database retrieval. In Proceedings: ACM Multimedia 94, San Francisco, CA, October 1994. ACM.
Google Scholar
R. Duda and P. Hart. Pattern Classification and Scene Analysis. Wiley, New York, NY, 1973.
Google Scholar
D. L. Drucker and M. D. Murie. QuickTime Handbook. Hayden, Carmel, IN, 1992.
Google Scholar
G. M. Edelman. Neural Darwinism: The Theory of Neuronal Group Selection. Basic Books, New York, NY, 1987.
Google Scholar
G. M. Edelman. The Remembered Present: A Biological Theory of Consciousness. Basic Books, New York, NY, 1989.
Google Scholar
E. L. Elliott. Watch • grab • arrange • see. Master’s thesis, Mas-sachusetts Institute of Technology, Cambridge, MA, February 1993.
Google Scholar
C. Faloutsos et al. Efficient and effective querying by image content. Journal of Intelligent Information Systems, 3:231–262, 1994.
Article Google Scholar
W. T. Freeman and E. H. Adelson. The design and use of steer-able filters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(2):587–607, 1992.
MathSciNet Google Scholar
Y. Gong et al. An image database system with content capturing and fast image indexing abilities. In Proceedings of the International Conference on Multimedia Computing and Systems, pages 121–130, Boston, MA, May 1994. IEEE.
Google Scholar
E. Gidney, A. Chandler, and G. McFarlane. CSCW for film and TV preproduction. IEEE MultiMedia, 1(2):16–26, Summer 1994.
Article Google Scholar
J. J. Gibson. The Ecological Approach to Visual Perception. Erl-baum, Hillsdale, NJ, 1986.
Google Scholar
A. Gupta, T. Weymouth, and R. Jain. Semantic queries with pictures: The VIMSYS model. In Proceedings of the 17th International Conference on Very Large Databases, pages 69–79, Barcelona, SPAIN, September 1991.
Google Scholar
Y. H. Gong and H. J. Zhang. An effective method for detecting regions of given colors and the features of the region surfaces. In S. A. Rajala and R. L. Stevenson, editors, Symposium on Electronic Imaging Science and Technology: Image and Video Processing II, pages 274–285, San Jose, CA, February 1994. IS&T/SPIE.
Chapter Google Scholar
M. J. Hawley. Structure out of Sound. PhD thesis, Massachusetts Institute of Technology, Cambridge, MA, September 1993.
Google Scholar
M. Heidegger. Being and time: Introduction. In D. F. Krell, editor, Basic Writings from Being and Time (1927) to The Task of Thinking (1964), chapter 1, pages 37–89. HarperCollins, New York, NY, 1977. Translated from the German by J. Stambaugh in collaboration with J. G. Gray and D. F. Krell.
Google Scholar
A. Hampapur, R. Jain, and T. Weymouth. Digital video segmentation. In Proceedings: ACM Multimedia 94, San Francisco, CA, October 1994. ACM.
Google Scholar
B. K. P. Horn and B. G. Schunck. Determining optical flow. Arti-ficial Intelligence, 17:185–203,1981.
Article Google Scholar
M. K. Hu. Visual pattern recognition by moment invariants. In J. K. Aggarwal, R. O. Duda, and A. Rosenfeld, editors, Computer Methods in Image Analysis. IEEE Computer Society, Los Angeles, CA, 1977.
Google Scholar
L. E. Hunter. Knowledge acquisition planning: Gaining expertise through experience. Technical Report YALEU/DCS/TR-678, Yale University, New Haven, CT, January 1989.
Google Scholar
E. Husserl. The Crisis of European Sciences and Transcenden-tal Phenomenology. Northwestern University Press, Evanston, IL, 1970. Translated from the German, with an Introduction, by D. Carr.
Google Scholar
S. S. Intille. Tracking using a local closed-world assumption: Track-ing in the football domain. Technical Report 296, MIT Media Lab Perceptual Computing Group, Cambridge, MA, August 1994.
Google Scholar
M. Ioka. A method of defining the similarity of images on the basis of color information. Technical Report RT-0030, IBM Tokyo Research Laboratory, Tokyo, JAPAN, November 1989.
Google Scholar
A. K. Jain. Fundamentals of Digital Image Processing. Prentice-Hall, Englewood Cliffs, NJ, 1989.
Google Scholar
T. Kato et al. A sketch retrieval method for full color image database: Query by visual example. In Proceedings: 11th International Conference on Pattern Recognition, pages 530–533, Amsterdam, HOLLAND, September 1992. IAPR, IEEE.
Google Scholar
R. Kasturi and R. Jain. Dynamic vision. In R. Kasturi and R. Jain, editors, Computer Vision: Principles, pages 469–480. IEEE Computer Society Press, Washington, DC, 1991.
Google Scholar
A. Khotanzad and R. L. Kashyap. Feature selection for texture recognition based on image synthesis. IEEE Transactions on Systems, Man, and Cybernetics, 17(6):1087–1095, November 1987.
Google Scholar
T. Kohonen. The Self-Organizing Map. Proceedings of the IEEE, 78(9):1464–1480, September 1990.
Article Google Scholar
A. Kankanhalli, H. J. Zhang, and C. Y. Low. Using texture for image retrieval. In Third International Conference on Automation, Robotics and Computer Vision, pages 935–939, SINGAPORE, November 1994.
Google Scholar
D. B. Lenat and R. V. Guha. Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project. Addison-Wesley, Reading, MA, 1989.
Google Scholar
F. Liu and R. W. Picard. Periodicity, directionality, and random-ness: Wold features for perceptual pattern recognition. In Proceed-ings: 12th International Conference on Pattern Recognition, pages 184–189, Jerusalem, ISRAEL, October 1994. IAPR, IEEE. VolumeII.
Google Scholar
M. Mills, J. Cohen, and Y. Y. Wong. A magnifier tool for video data. In Proceedings: CHI’92, pages 93–98, Monterey, CA, May 1992. ACM.
Google Scholar
J. Mao and A. K. Jain. Texture classification and segmentation using multiresolution simultaneous autoregressive models. Pattern Recognition, 25(2):173–188, 1992.
Article Google Scholar
D. McLeod and J. M. Smith. Abstraction in databases. SIG-PLAN Notices, 16(1):19–23, January 1981. Also SIGART Newsletter, Number 74, and SIGMOD Record, Volume 11, Number 2.
Article Google Scholar
W. Niblack et al. The QBIC project: Querying images by content using color, texture and shape. In Symposium on Electronic Imaging Science and Technology: Storage and Retrieval for Image Video Databases, San Jose, CA, February 1993. IS&T/SPIE.
Google Scholar
R. M. Nowak and J. L. Paradiso. Walker’s Mammals of the World. The Johns Hopkins University Press, Baltimore, MD, fourth edition, 1983.
Google Scholar
A. Nagasaka and Y. Tanaka. Automatic video indexing and full-video search for object appearances. In E. Knuth and L. M. Wegner, editors, Visual Database Systems, II, volume A-7 of IFIP Transactions A: Computer Science and Technology, pages 113–127. North-Holland, Amsterdam, THE NETHERLANDS, 1992.
Google Scholar
B. C. O’Connor. Selecting key frames of moving image documents: A digital environment for analysis and navigation. Microcomputers for Information Management, 8(2):119–133, June 1991.
MathSciNet Google Scholar
R. W. Picard and M. Gorkani. Finding perceptually dominant ori-entations in natural textures. Technical Report 229, MIT Media Laboratory Perceptual Computing Group, Cambridge, MA, 1993.
Google Scholar
R. W. Picard, T. Kabir, and F. Liu. Real-time recognition with the entire Brodatz texture database. In Proceedings: IEEE Conference on Computer Vision and Image Processing, pages 638–639, New York, NY, June 1993. IEEE.
Google Scholar
R. W. Picard and T. P. Minka. Vision texture for annotation. Tech-nical Report 302, MIT Media Laboratory Perceptual Computing Group, Cambridge, MA, 1994.
Google Scholar
A. Pentland, R. W. Picard, and S. Sclaroff. Photobook: Tools for content-based manipulation of image databases. In W. Niblack and R. Jain, editors, Symposium on Electronic Imaging Science and Technology: Storage and Retrieval for Image Video Databases II, pages 34–47, San Jose, CA, February 1994. IS&T/SPIE.
Book Google Scholar
W. K. Pratt. Digital Image Processing. Wiley, New York, NY, second edition, 1991.
Google Scholar
L. A. Rowe, J. S. Boreczky, and C. A. Eads. Indexes for user access to large video databases. In W. Niblack and R. C. Jain, editors, Symposium on Electronic Imaging Science and Technology: Storage and Retrieval for Image Video Databases II, pages 150–161, San Jose, CA, February 1994. IS&T/SPIE.
Book Google Scholar
D. E. Rumelhart, G. E. Hinton, and J. L. McClelland. A general framework for parallel distributed processing. In Parallel Distributed Processing: Explorations in the Microstructure of Cognition, volume 1, chapter 2, pages 45–76. The MIT Press, Cambridge, MA, 1986.
Google Scholar
M. J. Swain and D. H. Ballard. Color indexing. International Journal of Computer Vision, 7(1):11–32, 1991.
Article Google Scholar
A. N. Seeley. User Guide: Aldus Fetch Version.1.0. Aldus Corpo-ration, Seattle, WA, first edition, November 1992.
Google Scholar
IBM unleashes QBIC image-content search. Seybold Report on Desktop Publishing, 9(1), September 1994.
Google Scholar
R. Sriram, J. M. Francos, and W. A. Pearlman. Texture coding using a Wold decomposition model. In Proceedings: 12th International Conference on Pattern Recognition, pages 35–39, Jerusalem, ISRAEL, October 1994. IAPR, IEEE. Volume III.
Google Scholar
G. Salton and M. McGill. Introduction to Modern Information Re-trieval. McGraw-Hill, New York, NY, 1983.
Google Scholar
S. W. Smoliar. Classifying everyday sounds in video annotation. In T.-S. Chua and T. L. Kunii, editors, Multimedia Modeling, pages 309–313, SINGAPORE, November 1993.
Google Scholar
S. W. Smoliar. On the promises of multimedia authoring. Informa-tion and Software Technology, 36(4):243–245, April 1994.
Article Google Scholar
D. Swanberg, C.-F. Shu, and R. Jain. Knowledge guided parsing in video databases. In Symposium on Electronic Imaging: Science and Technology, San Jose, CA, 1993. IS&T/SPIE.
Google Scholar
S. W. Smoliar and H. J. Zhang. Content-based video indexing and retrieval. IEEE MultiMedia, 1(2):62–72, Summer 1994.
Article Google Scholar
S. W. Smoliar, H. J. Zhang, and J. H. Wu. Using frame technology to manage video. In Second Singapore International Conference on Intelligent Systems, pages B189—B194, SINGAPORE, November 1994.
Google Scholar
Y. Tonomura et al. VideoMAP and VideoSpacelcon: Tools for anatomizing video content. In Proceedings: INTERCHI ‘83, pages 131–136, 544, Amsterdam, NETHERLANDS, April 1993. ACM
Google Scholar
L. Teodosio and W. Bender. Salient video stills: Content and con-text preserved. In Proceedings: ACM Multimedia 93, pages 39–46, Anaheim, CA, August 1993. ACM.
Google Scholar
D. C. Tseng and C. H. Chang. Color segmentation using perceptual attributes. In Proceedings: 11th International Conference on Pattern Recognition, pages 228–231, Amsterdam, HOLLAND, September 1992. IAPR, IEEE.
Google Scholar
M. Tuceryan and A. K. Jain. Texture analysis. In C. H. Chen, L. F. Pau, and P. S. P. Wang, editors, Handbook of Pattern Recognition and Computer Vision, chapter 4.2, pages 235–276. World Scientific, SINGAPORE, 1993.
Chapter Google Scholar
H. Tamura, S. Mori, and T. Yamawaki. Texture features corresponding to visual perception. IEEE Transactions on Systems, Man, and Cybernetics, 6(4):460–473, April 1976.
Google Scholar
Y. Tonomura. Video handling based on structured information for hypermedia systems. In International Conference on Multimedia Information Systems ‘81, pages 333–344, SINGAPORE, January 1991. ACM, McGraw Hill.
Google Scholar
J. K. Wu et al. Inference and retrieval of facial images. Multimedia Systems, 2(1):1–14, 1994.
Article Google Scholar
L. Wittgenstein. Philosophical Investigations. Basil Blackwell, Ox-ford, England, 1974. Translated by G. E. M. Anscombe.
Google Scholar
H. J. Zhang et al. Automatic parsing of news video. In Proceed-ings of the International Conference on Multimedia Computing and Systems, pages 45–54, Boston, MA, May 1994. IEEE.
Book Google Scholar
H. J. Zhang et al. Video parsing using compressed data. In W. Niblack and R. Jain, editors, Symposium on Electronic Imaging Science and Technology: Image and Video Processing II, pages 142–149, San Jose, CA, February 1994. IS&T/SPIS.
Chapter Google Scholar
H. J. Zhang et al. A video database system for digital libraries. InN. R. Adam, B. Bhargava, and Y. Yesha, editors, Advances in Digital Libraries, Lecture Notes in Computer Science. Springer Verlag, Berlin, GERMANY, 1995. To appear.
Google Scholar
H. J. Zhang, A. Kankanhalli, and S. W. Smoliar. Automatic parti-tioning of full-motion video. Multimedia Systems, 1(1):10–28,1993.
Article Google Scholar
H. J. Zhang, C. Y. Low, and S. W. Smoliar. Video parsing and browsing using compressed data. Multimedia Tools and Applications, 1(1):91–113, February 1995.
Article Google Scholar
H. J. Zhang and S. W. Smoliar. Developing power tools for video indexing and retrieval. In W. Niblack and R. Jain, editors, Symposium on Electronic Imaging Science and Technology: Storage and Retrieval for Image Video Databases II, pages 140–149, San Jose, CA, February 1994. IS&T/SPIE.
Chapter Google Scholar
H. J. Zhang and D. Zhong. Scheme for visual feature-based im-age indexing. In W. Niblack and R. Jain, editors, Symposium on Electronic Imaging Science and Technology: Storage and Retrieval for Image Video Databases III, San Jose, CA, February 1995. IS&T/SPIE.
Google Scholar

Download references

Author information

Authors and Affiliations

Florida Atlantic University, Boca Raton, Florida, USA
Borko Furht
Institute of Systems Science, National University of Singapore, Singapore
Stephen W. Smoliar & HongJiang Zhang

Authors

Borko Furht
View author publications
You can also search for this author in PubMed Google Scholar
Stephen W. Smoliar
View author publications
You can also search for this author in PubMed Google Scholar
HongJiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Furht, B., Smoliar, S.W., Zhang, H. (1995). A Case Study in Video Parsing: Television News. In: Video and Image Processing in Multimedia Systems. The Springer International Series in Engineering and Computer Science, vol 326. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-2277-5_14

Download citation

DOI: https://doi.org/10.1007/978-1-4615-2277-5_14
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5960-9
Online ISBN: 978-1-4615-2277-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics