Abstract
We describe the Photobook system, which is a set of interactive tools for browsing and searching images and image sequences. These query tools differ from those used in standard image databases in that they make direct use of the image content rather than relying on text annotations. Direct search on image content is made possible by use of semantics-preserving image compression, which reduces images to a small set of perceptually-significant coefficients. We discuss three types of Photobook descriptions in detail: one that allows search based on appearance, one that uses 2-D shape, and a third that allows search based on textural properties. These image content descriptions can be combined with each other and with text-based descriptions to provide a sophisticated browsing and search capability. In this paper we demonstrate Photobook on databases containing images of people, video keyframes, hand tools, fish, texture swatches, and 3-D medical data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
E. Adelson and J. Bergen, “The Plenoptic Function and the Elements of Early Vision,” in: M. Landy and J. A. Movshon, (eds) Computational Models of Visual Processing, MIT Press (1991).
ACM SIGIR. Proceedings of International Conference on Multimedia Information Systems, Singapore, 1991.
D. Ballard and C. Brown. Computer Vision. Prentice Hall, 1982
E. Binaghi, I. Gagliardi, and R. Schettini. “Indexing and fuzzy logic-based retrieval of color images.” In Visual Database Systems, II, IFIP Transactions A-7, pages 79–92.
W. E. Blanz, D. Petkovic, and J. L. Sanz. Algorithms and Architectures for Machine Vision, ed. C.H. Chen, Marcel Decker Inc., 1989.
T. Breuel, Indexing for Recognition from a Large Model Base, M.I.T. Artificial Intelligence Laboratory Memo 1108, August 1990
P. Brodatz. Textures: A Photographic Album for Artists and Designers. Dover, New York, 1966.
C. C. Chang and S. Y. Lee. “Retrieval of similar pictures on pictorial databases.” Pattern recognition, 24(7):675 – 680, 1991.
C-C. Chang and T-C. Wu. “Retrieving the most similar symbolic pictures from pictorial databases.” Information Processing and Management, 28(5):581–588, 1992.
Z. Chen and S-Y. Ho. “Computer vision for robust 3D aircraft recognition with fast library search.” Pattern Recognition, 24(5): 375–390, 1991.
T. Darrell and A. Pentland, “Robust Estimation of a Multi-Layer Motion Representation”, in Proceedings IEEE Workshop on Visual Motion, pp. 173–177, 1991. longer version available as M.I.T. Media Laboratory Perceptual Computing Technical Report No. 163
T. Darrell, P. Maes, B. Blumberg, and A. Pentland, “A Novel Environment for Situated Vision and Behavior,” IEEE Workshop on Visual Behaviors pp. 68–72, Seattle, WA., June 19, 1994.
S. Smoliar, and H. Zhang, “Content-Based Video Indexing and Retrieval,” IEEE Multimedia Magazine, Vol. 1, No. 2, pp. 62–72, 1994.
R. Duda and P. Hart Pattern Classification and Scene Analysis. Wiley, New York, 1973.
J. Francos “Orthogonal Decompositions of 2-D Random Fields and their Applications for 2-D Spectral Estimation”, Signal Processing and its Applications, pp. 287–327, N.K. Bose and C.R. Rao (eds.), North-Holland., 1993.
P. Gast, “Integrating Eigenpicture Analysis with an Image Database,” M.I.T. Bachelors Thesis, Computer Science and Electrical Engineering Deptartment, Advisor: Alex Pentland, 1993.
W. I. Grosky, P. Neo, and R. Mehrotra. “A pictorial index mechanism for model-based matching.” Data and Knowledge Engineering, 8:309–327, 1992
K. Haase, “FRAMER: A Portable Persistent Representation Library,” Proceedings of the AAAI Workshop on AI in Systems and Support, Am. Asso. for AI, 1993.
K. Haase, “AI in Service and Support: Bridging the Gap”, Haase, Proceedings of Am. Asso. AI, 1993.
H. Helson and D. Lowdenslager, “Prediction Theory and Fourier Series in Several Variables.II”, Acta Mathmatica, Vol. 196, pp. 175–213, 1962.
K. Hirata and T. Kato. “Query by visual example,” In Advances in Database Technology EDBT ′92, Third International Conference on Extending Database Technology, Vienna, Austria, March 1992. Springer-Verlag.
M. Ioka. “A method of defining the similarity of images on the basis of color information,” Technical Report RT-003 0, IBM Tokyo Research Lab, 1989.
M. A. Ireton and C. S. Xydeas. “Classification of shape for content retrieval of images in a multimedia database,” In Sixth International Conference on Digital Processing of Signals in Communications, pages 111–116, Loughborough, UK, 2–6 Sept., 1990. IEE.
H. V. Jagadish. “A retrieval technique for similar shapes,” In International Conference on Management of Data, SIGMOD 91, pages 208–217, Denver CO, May 1991. ACM.
R. Jain and W. Niblack. NSF workshop on visual information management, February 1992.
T. Kato, T. Kurita, H. Shimogaki, T. Mizutori, and K. Fujimura. “A cognitive approach to visual interaction. In International Conference of Multimedia Information Systems,” MIS ′91, pages 109–120. ACM and National University of Singapore, January 1991.
Y. Lamdan and H. J. Wolfson. “Geometric hashing: A general and efficient model-based recognition scheme,” In 2nd International Conference on Computer Vision (ICCV), pages 238–249, Tampa, Florida, 1988. IEEE.
S-Y. Lee and F-J. Hsu. “2D C-string: A new spatial knowledge representation for image database systems,” Pattern Recognition, 23(10):1077–1087, 1990.
S-Y. Lee and F-J. Hsu. “Spatial reasoning and similarity retrieval of images using 2D c-string knowledge representation,” Pattern Recognition, 25(3):305–318, 1992.
A Lippman. “Semantic bandwidth compression,” Picture Coding Symposium, 1981.
P. McLean, “Structured Video Coding,” M.I.T. Masters Thesis, Advisor: Andrew Lippman, 1989.
J. Mao and A. Jain, “Texture Classification and Segmentation using Mul-tiresolution Simultaneous Autoregressive Models”, Pattern Recognition, Vol. 25, No. 2, pp 173–188, 1992.
R. Mehrotra and W. I. Grosky. “Shape matching utilizing indexed hypotheses generation and testing,” IEEE Transactions of Robotics and Automation, 5(1):70–77, 1989.
B. Moghaddam and A. Pentland, “Face recognition using view-based and modular eigenspaces for Identification And Inspection of Humans,” SPIE Conf. on Automatic Systems, San Diego, July 1994
W. Niblack, R. Barber, W. Equitz, M. Flickner, E. Glasman, D. Petkovic, and P. Yanker. “The QBIC project: Querying image s by content using color, texture, and shape,” In IS & T/SPIE 1993 International Symposium on Electronic Imaging: Science & Technology,, Conference 1908, Storage and Retrieval for Image and Video Databases, February 1993.
J. Martin, A. Pentland, and R. Kikinis “Shape Analysis of Brain Structures using Physical and Experimental Modes,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 752–755, Seattle, WA., June 1994.
A. Pentland and S. Sclaroff “Closed-Form Solutions For Physically Based Shape Modeling and Recognition.” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 13, No. 7, pp. 715–730.
A. Pentland, R. Picard, G. Davenport, R. Welsh, “The BT/MIT Project on Advanced Image Tools for Telecommunications: An Overview,” ImageCom ′93, 2nd International Conference on Image Communications, Bordeaux, France, 23–25 March, 1993.
A. Pentland, B. Moggadam, and T. Starner, “View-Based and Modular Eigenspaces for Face Recognition,” IEEE Conf Computer Vision and Pattern Recognition, pp. 84–90, Seattle, WA, June 1994
R. W. Picard “Random Field Texture Coding,” Society for Information Display International Symposium Digest, Vol XXIII, May 1992, pages 685–688.
R. W. Picard and M. Gorkani. “Finding perceptually dominant orientations in natural textures.” Spatial Vision Vol. 8, No. 2, pp. 221–253, 1994.
R. W. Picard and T. Kabir. “Finding similar patterns in large image databases.” Proc. ICASSP, Minneapolis, MN, Vol. V, pp. 161–164, 1993.
R. W. Picard and F. Liu, “A new Wold ordering for image similarity,” IEEE Conf on ASSP, Adelaide, Australia, April, 1994.
R. W. Picard and T. P. Minka, “Vision Texture for Annotation” ACM/Springer-Verlag Journal of Multimedia Systems, to appear.
A. R. Rao and G. L. Lohse, “Towards a Texture Naming System: Identifying Relevant Dimensions of Texture,” IEEE Conf on Visualization 1993, San Jose, CA.
L. Sirovich, and M. Kirby, “Low-dimensional procedure for the characterization of human faces,” J. Opt. Soc. Am. A, Vol. 4, No. 3, March 1987, 519–524.
S. Sclaroff and A. Pentland, “A finite-element framework for correspondence and matching,” 4th International Conference on Computer Vision, pp. 308–313, May 11–14, 1993, Berlin, Germany.
S. Sclaroff and A. Pentland, “Modal Matching for Correspondence and Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, to appear. Also available as: M.I.T. Media Laboratory Perceptual Computing Technical Note No. 304.
R. Sriram, J. M. Francos and W. A. Pearlman, “Texture coding Using a Wold Decomposition Model,” sl Proc. 12th IAPR Int. Conf. Pat. Rec, Jerusalem, Israel, Oct. 1994.
M. Swain and D. Ballard, “Color indexing”. Int. J. of Computer Vision, 7(1):11–32, 1991.
S. Tanaka, M. Shima, J. Shibayama, and A. Maeda. “Retrieval method for an image database based on topographical structure.” In Applic. of Digital Image Processing, Vol. 1153, pages 318–327. SPIE, 1989.
Discrete Random Signals and Statistical Signal Processing, C. W. Therrien, Prentice-Hall, Englewood Cliffs, NJ 1992.
M. Turk and A. Pentland, “Eigenfaces for Recognition”, Journal of Cognitive Neuroscience, May 1991.
K. Wakimoto, M. Shima, S. Tanaka, and A. Maeda. “An intelligent user interface to an image database using a figure interpretation method.” In 9th Int. Conference on Pattern Recognition, volume 2, pages 516–991, 1990.
J. Y. A. Wang and E. H. Adelson, “Layered Representation for Motion Analysis” IEEE CVPR ′93. Longer version available as: M.I.T. Media Laboratory Perceptual Computing Technical Report No. 228.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1996 Kluwer Academic Publishers
About this chapter
Cite this chapter
Pentland, A., Picard, R.W., Sclaroff, S. (1996). Photobook: Content-Based Manipulation of Image Databases. In: Furht, B. (eds) Multimedia Tools and Applications. The Kluwer International Series in Engineering and Computer Science, vol 359. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1387-8_2
Download citation
DOI: https://doi.org/10.1007/978-1-4613-1387-8_2
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8600-4
Online ISBN: 978-1-4613-1387-8
eBook Packages: Springer Book Archive