Photobook: Content-Based Manipulation of Image Databases

Pentland, A.; Picard, R. W.; Sclaroff, S.

doi:10.1007/978-1-4613-1387-8_2

A. Pentland²,
R. W. Picard² &
S. Sclaroff^2,3

Part of the book series: The Kluwer International Series in Engineering and Computer Science ((SECS,volume 359))

114 Accesses
12 Citations

Abstract

We describe the Photobook system, which is a set of interactive tools for browsing and searching images and image sequences. These query tools differ from those used in standard image databases in that they make direct use of the image content rather than relying on text annotations. Direct search on image content is made possible by use of semantics-preserving image compression, which reduces images to a small set of perceptually-significant coefficients. We discuss three types of Photobook descriptions in detail: one that allows search based on appearance, one that uses 2-D shape, and a third that allows search based on textural properties. These image content descriptions can be combined with each other and with text-based descriptions to provide a sophisticated browsing and search capability. In this paper we demonstrate Photobook on databases containing images of people, video keyframes, hand tools, fish, texture swatches, and 3-D medical data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

E. Adelson and J. Bergen, “The Plenoptic Function and the Elements of Early Vision,” in: M. Landy and J. A. Movshon, (eds) Computational Models of Visual Processing, MIT Press (1991).
Google Scholar
ACM SIGIR. Proceedings of International Conference on Multimedia Information Systems, Singapore, 1991.
Google Scholar
D. Ballard and C. Brown. Computer Vision. Prentice Hall, 1982
Google Scholar
E. Binaghi, I. Gagliardi, and R. Schettini. “Indexing and fuzzy logic-based retrieval of color images.” In Visual Database Systems, II, IFIP Transactions A-7, pages 79–92.
Google Scholar
W. E. Blanz, D. Petkovic, and J. L. Sanz. Algorithms and Architectures for Machine Vision, ed. C.H. Chen, Marcel Decker Inc., 1989.
Google Scholar
T. Breuel, Indexing for Recognition from a Large Model Base, M.I.T. Artificial Intelligence Laboratory Memo 1108, August 1990
Google Scholar
P. Brodatz. Textures: A Photographic Album for Artists and Designers. Dover, New York, 1966.
Google Scholar
C. C. Chang and S. Y. Lee. “Retrieval of similar pictures on pictorial databases.” Pattern recognition, 24(7):675 – 680, 1991.
Article Google Scholar
C-C. Chang and T-C. Wu. “Retrieving the most similar symbolic pictures from pictorial databases.” Information Processing and Management, 28(5):581–588, 1992.
Article Google Scholar
Z. Chen and S-Y. Ho. “Computer vision for robust 3D aircraft recognition with fast library search.” Pattern Recognition, 24(5): 375–390, 1991.
Article Google Scholar
T. Darrell and A. Pentland, “Robust Estimation of a Multi-Layer Motion Representation”, in Proceedings IEEE Workshop on Visual Motion, pp. 173–177, 1991. longer version available as M.I.T. Media Laboratory Perceptual Computing Technical Report No. 163
Chapter Google Scholar
T. Darrell, P. Maes, B. Blumberg, and A. Pentland, “A Novel Environment for Situated Vision and Behavior,” IEEE Workshop on Visual Behaviors pp. 68–72, Seattle, WA., June 19, 1994.
Google Scholar
S. Smoliar, and H. Zhang, “Content-Based Video Indexing and Retrieval,” IEEE Multimedia Magazine, Vol. 1, No. 2, pp. 62–72, 1994.
Article Google Scholar
R. Duda and P. Hart Pattern Classification and Scene Analysis. Wiley, New York, 1973.
MATH Google Scholar
J. Francos “Orthogonal Decompositions of 2-D Random Fields and their Applications for 2-D Spectral Estimation”, Signal Processing and its Applications, pp. 287–327, N.K. Bose and C.R. Rao (eds.), North-Holland., 1993.
Google Scholar
P. Gast, “Integrating Eigenpicture Analysis with an Image Database,” M.I.T. Bachelors Thesis, Computer Science and Electrical Engineering Deptartment, Advisor: Alex Pentland, 1993.
Google Scholar
W. I. Grosky, P. Neo, and R. Mehrotra. “A pictorial index mechanism for model-based matching.” Data and Knowledge Engineering, 8:309–327, 1992
Article Google Scholar
K. Haase, “FRAMER: A Portable Persistent Representation Library,” Proceedings of the AAAI Workshop on AI in Systems and Support, Am. Asso. for AI, 1993.
Google Scholar
K. Haase, “AI in Service and Support: Bridging the Gap”, Haase, Proceedings of Am. Asso. AI, 1993.
Google Scholar
H. Helson and D. Lowdenslager, “Prediction Theory and Fourier Series in Several Variables.II”, Acta Mathmatica, Vol. 196, pp. 175–213, 1962.
MathSciNet Google Scholar
K. Hirata and T. Kato. “Query by visual example,” In Advances in Database Technology EDBT ′92, Third International Conference on Extending Database Technology, Vienna, Austria, March 1992. Springer-Verlag.
Google Scholar
M. Ioka. “A method of defining the similarity of images on the basis of color information,” Technical Report RT-003 0, IBM Tokyo Research Lab, 1989.
Google Scholar
M. A. Ireton and C. S. Xydeas. “Classification of shape for content retrieval of images in a multimedia database,” In Sixth International Conference on Digital Processing of Signals in Communications, pages 111–116, Loughborough, UK, 2–6 Sept., 1990. IEE.
Google Scholar
H. V. Jagadish. “A retrieval technique for similar shapes,” In International Conference on Management of Data, SIGMOD 91, pages 208–217, Denver CO, May 1991. ACM.
Google Scholar
R. Jain and W. Niblack. NSF workshop on visual information management, February 1992.
Google Scholar
T. Kato, T. Kurita, H. Shimogaki, T. Mizutori, and K. Fujimura. “A cognitive approach to visual interaction. In International Conference of Multimedia Information Systems,” MIS ′91, pages 109–120. ACM and National University of Singapore, January 1991.
Google Scholar
Y. Lamdan and H. J. Wolfson. “Geometric hashing: A general and efficient model-based recognition scheme,” In 2nd International Conference on Computer Vision (ICCV), pages 238–249, Tampa, Florida, 1988. IEEE.
Google Scholar
S-Y. Lee and F-J. Hsu. “2D C-string: A new spatial knowledge representation for image database systems,” Pattern Recognition, 23(10):1077–1087, 1990.
Article Google Scholar
S-Y. Lee and F-J. Hsu. “Spatial reasoning and similarity retrieval of images using 2D c-string knowledge representation,” Pattern Recognition, 25(3):305–318, 1992.
Article MathSciNet Google Scholar
A Lippman. “Semantic bandwidth compression,” Picture Coding Symposium, 1981.
Google Scholar
P. McLean, “Structured Video Coding,” M.I.T. Masters Thesis, Advisor: Andrew Lippman, 1989.
Google Scholar
J. Mao and A. Jain, “Texture Classification and Segmentation using Mul-tiresolution Simultaneous Autoregressive Models”, Pattern Recognition, Vol. 25, No. 2, pp 173–188, 1992.
Article Google Scholar
R. Mehrotra and W. I. Grosky. “Shape matching utilizing indexed hypotheses generation and testing,” IEEE Transactions of Robotics and Automation, 5(1):70–77, 1989.
Article Google Scholar
B. Moghaddam and A. Pentland, “Face recognition using view-based and modular eigenspaces for Identification And Inspection of Humans,” SPIE Conf. on Automatic Systems, San Diego, July 1994
Google Scholar
W. Niblack, R. Barber, W. Equitz, M. Flickner, E. Glasman, D. Petkovic, and P. Yanker. “The QBIC project: Querying image s by content using color, texture, and shape,” In IS & T/SPIE 1993 International Symposium on Electronic Imaging: Science & Technology,, Conference 1908, Storage and Retrieval for Image and Video Databases, February 1993.
Google Scholar
J. Martin, A. Pentland, and R. Kikinis “Shape Analysis of Brain Structures using Physical and Experimental Modes,” IEEE Conference on Computer Vision and Pattern Recognition, pp. 752–755, Seattle, WA., June 1994.
Google Scholar
A. Pentland and S. Sclaroff “Closed-Form Solutions For Physically Based Shape Modeling and Recognition.” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 13, No. 7, pp. 715–730.
Google Scholar
A. Pentland, R. Picard, G. Davenport, R. Welsh, “The BT/MIT Project on Advanced Image Tools for Telecommunications: An Overview,” ImageCom ′93, 2nd International Conference on Image Communications, Bordeaux, France, 23–25 March, 1993.
Google Scholar
A. Pentland, B. Moggadam, and T. Starner, “View-Based and Modular Eigenspaces for Face Recognition,” IEEE Conf Computer Vision and Pattern Recognition, pp. 84–90, Seattle, WA, June 1994
Google Scholar
R. W. Picard “Random Field Texture Coding,” Society for Information Display International Symposium Digest, Vol XXIII, May 1992, pages 685–688.
Google Scholar
R. W. Picard and M. Gorkani. “Finding perceptually dominant orientations in natural textures.” Spatial Vision Vol. 8, No. 2, pp. 221–253, 1994.
Article Google Scholar
R. W. Picard and T. Kabir. “Finding similar patterns in large image databases.” Proc. ICASSP, Minneapolis, MN, Vol. V, pp. 161–164, 1993.
Google Scholar
R. W. Picard and F. Liu, “A new Wold ordering for image similarity,” IEEE Conf on ASSP, Adelaide, Australia, April, 1994.
Google Scholar
R. W. Picard and T. P. Minka, “Vision Texture for Annotation” ACM/Springer-Verlag Journal of Multimedia Systems, to appear.
Google Scholar
A. R. Rao and G. L. Lohse, “Towards a Texture Naming System: Identifying Relevant Dimensions of Texture,” IEEE Conf on Visualization 1993, San Jose, CA.
Google Scholar
L. Sirovich, and M. Kirby, “Low-dimensional procedure for the characterization of human faces,” J. Opt. Soc. Am. A, Vol. 4, No. 3, March 1987, 519–524.
Article Google Scholar
S. Sclaroff and A. Pentland, “A finite-element framework for correspondence and matching,” 4th International Conference on Computer Vision, pp. 308–313, May 11–14, 1993, Berlin, Germany.
Google Scholar
S. Sclaroff and A. Pentland, “Modal Matching for Correspondence and Recognition,” IEEE Trans. Pattern Analysis and Machine Intelligence, to appear. Also available as: M.I.T. Media Laboratory Perceptual Computing Technical Note No. 304.
Google Scholar
R. Sriram, J. M. Francos and W. A. Pearlman, “Texture coding Using a Wold Decomposition Model,” sl Proc. 12th IAPR Int. Conf. Pat. Rec, Jerusalem, Israel, Oct. 1994.
Google Scholar
M. Swain and D. Ballard, “Color indexing”. Int. J. of Computer Vision, 7(1):11–32, 1991.
Article Google Scholar
S. Tanaka, M. Shima, J. Shibayama, and A. Maeda. “Retrieval method for an image database based on topographical structure.” In Applic. of Digital Image Processing, Vol. 1153, pages 318–327. SPIE, 1989.
Google Scholar
Discrete Random Signals and Statistical Signal Processing, C. W. Therrien, Prentice-Hall, Englewood Cliffs, NJ 1992.
Google Scholar
M. Turk and A. Pentland, “Eigenfaces for Recognition”, Journal of Cognitive Neuroscience, May 1991.
Google Scholar
K. Wakimoto, M. Shima, S. Tanaka, and A. Maeda. “An intelligent user interface to an image database using a figure interpretation method.” In 9th Int. Conference on Pattern Recognition, volume 2, pages 516–991, 1990.
Chapter Google Scholar
J. Y. A. Wang and E. H. Adelson, “Layered Representation for Motion Analysis” IEEE CVPR ′93. Longer version available as: M.I.T. Media Laboratory Perceptual Computing Technical Report No. 228.
Google Scholar

Download references

Author information

Authors and Affiliations

Perceptual Computing Section, The Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
A. Pentland, R. W. Picard & S. Sclaroff
Computer Science Department, Boston University, USA
S. Sclaroff

Authors

A. Pentland
View author publications
You can also search for this author in PubMed Google Scholar
R. W. Picard
View author publications
You can also search for this author in PubMed Google Scholar
S. Sclaroff
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Florida Atlantic University, Boca Raton, Florida, USA
Borko Furht

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pentland, A., Picard, R.W., Sclaroff, S. (1996). Photobook: Content-Based Manipulation of Image Databases. In: Furht, B. (eds) Multimedia Tools and Applications. The Kluwer International Series in Engineering and Computer Science, vol 359. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1387-8_2

Download citation

DOI: https://doi.org/10.1007/978-1-4613-1387-8_2
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8600-4
Online ISBN: 978-1-4613-1387-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics