Frame Decimation for Structure and Motion

Nistér, David

doi:10.1007/3-540-45296-6_2

David Nistér⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2018))

Included in the following conference series:

European Workshop on 3D Structure from Multiple Images of Large-Scale Environments

472 Accesses
19 Citations
3 Altmetric

Abstract

A frame decimation scheme is proposed that makes automatic extraction of Structure and Motion (SaM) from handheld sequences more practical. Decimation of the number of frames used for the actual SaM calculations keeps the size of the problem manageable, regardless of the input frame rate. The proposed preprocessor is based upon global motion estimation between frames and a sharpness measure. With these tools, shot boundary detection is first performed followed by the removal of redundant frames. The frame decimation makes it feasible to feed the system with a high frame rate, which in turn avoids loss of connectivity due to matching difficulties. A high input frame rate also enables robust automatic detection of shot boundaries. The development of the preprocessor was prompted by experience with a number of test sequences, acquired directly from a handheld camera. The preprocessor was tested on this material together with a SaM algorithm. The scheme is conceptually simple and still has clear benefits.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

P. Beardsley, P. Torr, A. Zisserman, 3D model acquisition from extended image sequences, Proc. ECCV 96, pp. 683–695.
Google Scholar
P. Beardsley, A. Zisserman, D. Murray, Sequential updating of projective and affine structure from motion, IJCV, 23(3), pp. 235–259, 1997.
Article Google Scholar
D. Capel, A. Zisserman, Automated mosaicing with super-resolution zoom, Proc. CVPR 98, pp. 885–891.
Google Scholar
R. Cipolla, E. G. Boyer, 3D model acquisition from uncalibrated images, Proc IAPR Workshop on Machine Vision Applications, Chiba Japan, pp. 559–568, November 1998.
Google Scholar
P. Debevec, C. Taylor, J. Malik, Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach, Proc. SIGGRAPH’96, pp. 11–20.
Google Scholar
O. Faugeras, What can be seen in three dimensions with an uncalibrated stereo rig?, Proc.ECCV 92, pp. 563–578.
Google Scholar
P. Fua, Reconstructing complex surfaces from multiple stereo views, Proc. ICCV 95, pp. 1078–1085.
Google Scholar
A. W. Fitzgibbon, A. Zisserman, Automatic camera recovery for closed or open image sequences, Proc. ECCV 98, pp. 311–326.
Google Scholar
K. Hanna, N. Okamoto, Combining stereo and motion for direct estimation of scene structure, Proc. ICCV 93, pp. 357–365.
Google Scholar
R. Hartley, Euclidean reconstruction from uncalibrated views, Applications of Invariance in Computer Vision, LNCS 825, pp. 237–256, Springer-Verlag, 1994.
Google Scholar
A. Heyden, K. Åström, Euclidean reconstruction from image sequences with varying and unknown focal length and principal point, Proc. CVPR 97, pp. 438–443.
Google Scholar
M. Irani, P. Anandan, M. Cohen, Direct recovery of planar-parallax from multiple frames, Proc. Vision Algorithms Workshop (ICCV 99), Corfu Greece, pp. 1–8, September 1999.
Google Scholar
K. Kanatani, N. Ohta, Accuracy bounds and optimal computation of homography for image mosaicing applications, Proc. ICCV 99, pp. 73–78.
Google Scholar
P. McLauchlan, D. Murray, A unifying framework for structure from motion recovery from image sequences, Proc. ICCV 95, pp. 314–320.
Google Scholar
R. Mohr, F. Veillon, L. Quan, Relative 3D reconstruction using multiple uncalibrated images, Proc. CVPR 93, pp. 543–548.
Google Scholar
D. Nistér, Reconstruction from uncalibrated sequences with a hierarchy of trifocal tensors, Accepted to ECCV 2000.
Google Scholar
S. Peleg, J. Herman, Panoramic mosaics by manifold projection, Proc. CVPR 97, pp. 338–343.
Google Scholar
M. Pollefeys, R. Koch, L. Van Gool, Self-calibration and metric reconstruction in spite of varying and unknown internal camera parameters, IJCV, 32(1), pp. 7–26, Aug, 1999.
Article Google Scholar
W. Press, S. Teukolsky, W. Vetterling, B. Flannery, Numerical recipes in C, ISBN 0-521-43108-5, Cambridge University Press, 1988.
Google Scholar
L. Robert, O. Faugeras, Relative 3D positioning and 3D convex hull computation from a weakly calibrated stereo pair, Proc. ICCV 93, pp. 540–544.
Google Scholar
H. Sawhney, S. Hsu, R. Kumar, “Robust video mosaicing through topology inference and local to global alignment”, Proc. ECCV 98, pp.103–119.
Google Scholar
A. Shashua, Trilinearity in visual recognition by alignment, Proc. ECCV 94, pp. 479–484.
Google Scholar
M. Spetsakis, J. Aloimonos, Structure from motion using line correspondences, IJCV, pp. 171–183, 1990.
Google Scholar
P. Sturm, W. Triggs, A factorization based algorithm for multi-image projective structure and motion, Proc. ECCV 96, pp. 709–720.
Google Scholar
R. Szeliski, H.-Y. Shum, Creating full view panoramic image mosaics and environment maps, Proc. SIGGRAPH’97, pp. 251–258.
Google Scholar
C. Tomasi, T. Kanade, Shape and motion from image streams under orthography: a factorization approach, IJCV, 9(2), pp. 137–154. November 1992.
Article Google Scholar
L. Van Gool, A. Zisserman, Automatic 3D model building from video sequences, Proc. ECMAST 96, pp. 563–582.
Google Scholar

References

B. Triggs. Plane + Parallax, Tensors and Factorization. In Proc. European Conference on Computer Vision, pages 522–538, 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

Visual Technology, Ericsson Research, Ericsson Radio Systems, SE-164 80, Stockholm, SWEDEN
David Nistér

Authors

David Nistér
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Processing of Speech and Images, Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, 3001, Leuven-Heverlee, Belgium
Marc Pollefeys & Luc Van Gool &
Department of Engineering Science, University of Oxford, 19 Parks Road, Oxford OX1, 3PJ, UK
Andrew Zisserman & Andrew Fitzgibbon &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nistér, D. (2001). Frame Decimation for Structure and Motion. In: Pollefeys, M., Van Gool, L., Zisserman, A., Fitzgibbon, A. (eds) 3D Structure from Images — SMILE 2000. SMILE 2000. Lecture Notes in Computer Science, vol 2018. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45296-6_2

Download citation

DOI: https://doi.org/10.1007/3-540-45296-6_2
Published: 21 March 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41845-0
Online ISBN: 978-3-540-45296-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Frame Decimation for Structure and Motion

Abstract

Access this chapter

Preview

References

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation