Non-Euclidean object representations for calibration-free video overlay

Kutulakos, Kiriakos N.; Vallino, James R.

doi:10.1007/3-540-61750-7_38

Kiriakos N. Kutulakos¹ &
James R. Vallino¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1144))

Included in the following conference series:

International Workshop on Object Representation in Computer Vision

148 Accesses
2 Citations

Abstract

We show that the overlay of 3D graphical objects onto live video taken by a mobile camera can be considerably simplified when the camera, the camera's environment and the graphical objects are represented in an affine frame of reference. The key feature of the approach is that it does not use any metric information about the calibration parameters of the camera, the position of the user interacting with the system, or the 3D locations and dimensions of the environment's objects. The only requirement is the ability to track across frames at least four features (points or lines) that are specified by the user at system initialization time and whose world coordinates are unknown. Our approach is based on the following observation: Given a set of four or more non-coplanar 3D points, the projection of all points in the set can be computed as a linear combination of the projections of just four of the points. We exploit this observation by (1) tracking lines and feature points at frame rate, and (2) representing graphical objects in an affine frame of reference that allows the projection of virtual objects to be computed as a linear combination of the projection of the feature points.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

W. Grimson et al., “An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 430–436, 1994.
Google Scholar
M. Uenohara and T. Kanade, “Vision-based object registration for real-time image overlay,” in Proc. CVRMED'95, pp. 14–22, 1995.
Google Scholar
M. Bajura, H. Fuchs, and R. Ohbuchi, “Merging virtual objects with the real world: Seeing ultrasound imagery within the patient,” in Proc. SIGGRAPH'92, pp. 203–210, 1992.
Google Scholar
S. Feiner, B. MacIntyre, and D. Soligmann, “Knowledge-based augmented reality,” Comm. of the ACM, vol. 36, no. 7, pp. 53–62, 1993.
Google Scholar
T. Darrell, P. Maes, B. Blumberg, and A. P. Pentland, “A novel environment for situated vision and action,” in IEEE Workshop on Visual Behaviors, pp. 68–72, 1994.
Google Scholar
M. M. Wloka and B. G. Anderson, “Resolving occlusion in augmented reality,” in Proc. Symposium on Interactive 3D Graphics, pp. 5–12, 1995.
Google Scholar
M. Tuceyran et al., “Calibration requirements and procedures for a monitor-based augmented reality system,” IEEE Trans. Visualization and Computer Graphics, vol. 1, no. 3, pp. 255–273, 1995.
Google Scholar
J. Mellor, “Enhanced reality visualization in a surgical environment,” Master's thesis, Massachusetts Institute of Technology, 1995.
Google Scholar
M. Bajura and U. Neumann, “Dynamic registration correction in video-based augmented reality systems,” IEEE Computer Graphics and Applications, vol. 15, no. 5, pp. 52–60, 1995.
Google Scholar
D. G. Lowe, “Robust model-based tracking through the integration of search and estimation,” Int. J. Computer Vision, vol. 8, no. 2, pp. 113–122, 1992.
Google Scholar
S. Ravela, B. Draper, et al., “Adaptive tracking and model registration across distinct aspects,” in Proc. 1995 IEEE/RSJ Int. Conf. Intelligent Robotics and Systems, pp. 174–180, 1995.
Google Scholar
L. S. Shapiro, A. Zisserman, and M. Brady, “3D motion recovery via affine epipolar geometry,” Int. J. Computer Vision, vol. 16, no. 2, pp. 147–182, 1995.
Google Scholar
J. J. Koenderink and A. J. van Doorn, “Affine structure from motion,” J. Opt. Soc. Am., vol. A, no. 2, pp. 377–385, 1991.
Google Scholar
G. D. Hager, “Calibration-free visual control using projective invariance,” in Proc. 5th Int. Conf. Computer Vision, 1995.
Google Scholar
R. Cipolla, P. A. Hadfield, and N. J. Hollinghurst, “Uncalibrated stereo vision with pointing for a man-machine interface,” in Proc. IAPR Workshop on Machine Vision Applications, 1994.
Google Scholar
A. Azarbayejani, T. Starner, B. Horowitz, and A. Pentland, “Visually controlled graphics,” IEEE Trans. Pattern Anal. Machine Intell., vol. 15, no. 6, pp. 602–605, 1993.
Google Scholar
A. Shashua, “A geometric invariant for visual recognition and 3D reconstruction from two perspective/orthographic views,” in Proc. IEEE Workshop on Qualitative Vision, pp. 107–117, 1993.
Google Scholar
E. B. Barrett, M. H. Brill, N. N. Haag, and P. M. Payton, “Invariant linear methods in photogrammetry and model-matching,” in Geometric Invariance in Computer Vision, pp. 277–292, MIT Press, 1992.
Google Scholar
P. A. Beardsley, I. D. Reid, A. Zisserman, and D. W. Murray, “Active visual navigation using non-metric structure,” in Proc. 5th Int. Conf. Computer Vision, pp. 58–64, 1995.
Google Scholar
Y. Lamdan, J. T. Schwartz, and H. J. Wolfson, “Object recognition by affine invariant matching,” in Proc. Computer Vision and Pattern Recognition, pp. 335–344, 1988.
Google Scholar
J. D. Foley, A. van Dam, S. K. Feiner, and J. F. Hughes, Computer Graphics Principles and Practice. Addison-Wesley Publishing Co., 1990.
Google Scholar
Y. Bar-Shalom and T. E. Fortmann, Tracking and Data Association. Academic Press, 1988.
Google Scholar
M. Gleicher and A. Witkin, “Through-the-lens camera control,” in Proc. SIGGRAPH'92, pp. 331–340, 1992.
Google Scholar
J. L. Mundy and A. Zisserman, eds., Geometric Invariance in Computer Vision. MIT Press, 1992.
Google Scholar
D. Weinshall and C. Tomasi, “Linear and incremental acquisition of invariant shape models from image sequences,” in Proc. 4th Int. Conf. on Computer Vision, pp. 675–682, 1993.
Google Scholar
O. D. Faugeras, Three-Dimensional Computer Vision: A Geometric Viewpoint. MIT Press, 1993.
Google Scholar
S. M. Seitz and C. R. Dyer, “Complete scene structure from four point correspondences,” in Proc. 5th Int. Conf. on Computer Vision, pp. 330–337, 1995.
Google Scholar
A. Blake and A. Yuille, eds., Active Vision. MIT Press, 1992.
Google Scholar
C. M. Brown and D. Terzopoulos, eds., Real-Time Computer Vision. Cambridge University Press, 1994.
Google Scholar
A. Blake and M. Isard, “3D position, attitude and shape input using video tracking of hands and lips,” in ACM SIGGRAPH'94, pp. 185–192, 1994.
Google Scholar
K. Toyama and G. D. Hager, “Incremental focus of attention for robust visual tracking,” in Proc. Computer Vision and Pattern Recognition, 1996. To appear.
Google Scholar
C. Harris, “Tracking with rigid models,” in Active Vision (A. Blake and A. Yuille, eds.), pp. 21–38, MIT Press, 1992.
Google Scholar
R. Horaud, F. Dornaika, B. Boufama, and R. Mohr, “Self calibration of a stereo head mounted onto a robot arm,” in Proc. 3rd European Conf. on Computer Vision, pp. 455–462, 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Rochester, 14627-0226, Rochester, NY
Kiriakos N. Kutulakos & James R. Vallino

Authors

Kiriakos N. Kutulakos
View author publications
You can also search for this author in PubMed Google Scholar
James R. Vallino
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Jean Ponce Andrew Zisserman Martial Hebert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kutulakos, K.N., Vallino, J.R. (1996). Non-Euclidean object representations for calibration-free video overlay. In: Ponce, J., Zisserman, A., Hebert, M. (eds) Object Representation in Computer Vision II. ORCV 1996. Lecture Notes in Computer Science, vol 1144. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61750-7_38

Download citation

DOI: https://doi.org/10.1007/3-540-61750-7_38
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61750-1
Online ISBN: 978-3-540-70673-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics