Abstract
We present a new structure from motion (Sfm) technique based on point and vanishing point (VP) matches in images. First, all global camera rotations are computed from VP matches as well as relative rotation estimates obtained from pairwise image matches. A new multi-staged linear technique is then used to estimate all camera translations and 3D points simultaneously. The proposed method involves first performing pairwise reconstructions, then robustly aligning these in pairs, and finally aligning all of them globally by simultaneously estimating their unknown relative scales and translations. In doing so, measurements inconsistent in three views are efficiently removed. Unlike sequential Sfm, the proposed method treats all images equally, is easy to parallelize and does not require intermediate bundle adjustments. There is also a reduction of drift and significant speedups up to two order of magnitude over sequential Sfm. We compare our method with a standard Sfm pipeline [1] and demonstrate that our linear estimates are accurate on a variety of datasets, and can serve as good initializations for final bundle adjustment. Because we exploit VPs when available, our approach is particularly well-suited to the reconstruction of man-made scenes.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Snavely, N.: Bundler (2007), http://phototour.cs.washington.edu/bundler/
Schaffalitzky, F., Zisserman, A.: Multi-view Matching for Unordered Image Sets, or How Do I Organize My Holiday Snaps? In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part I. LNCS, vol. 2350, pp. 414–431. Springer, Heidelberg (2002)
Pollefeys, M., Van Gool, L.J., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., Koch, R.: Visual modeling with a hand-held camera. IJCV 59, 207–232 (2004)
Snavely, N., Seitz, S.M., Szeliski, R.: Photo Tourism: exploring photo collections in 3d. ACM Trans. Graph 25, 835–846 (2006)
Agarwal, S., Snavely, N., Simon, I., Seitz, S.M., Szeliski, R.: Building Rome in a Day. In: ICCV (2009)
Snavely, N., Seitz, S.M., Szeliski, R.: Skeletal graphs for efficient structure from motion. In: CVPR, pp. 1–8 (2008)
Fitzgibbon, A.W., Zisserman, A.: Automatic Camera Recovery for Closed or Open Image Sequences. In: Burkhardt, H.-J., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1406, pp. 311–326. Springer, Heidelberg (1998)
Gherardi, R., Farenzena, M., Fusiello, A.: Improving the efficiency of hierarchical structure-and-motion. In: CVPR (2010)
Triggs, B., McLauchlan, P.F., Hartley, R.I., Fitzgibbon, A.W.: Bundle Adjustment – A Modern Synthesis. In: Triggs, B., Zisserman, A., Szeliski, R. (eds.) ICCV-WS 1999. LNCS, vol. 1883, pp. 298–372. Springer, Heidelberg (2000)
Antone, M., Teller, S.: Scalable extrinsic calibration of omnidirectional image networks. IJCV 49, 143–174 (2002)
Rother, C.: Linear multi-view reconstruction of points, lines, planes and cameras using a reference plane. In: ICCV, pp. 1210–1217 (2003)
Brand, M., Antone, M., Teller, S.: Spectral Solution of Large-Scale Extrinsic Camera Calibration as a Graph Embedding Problem. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3022, pp. 262–273. Springer, Heidelberg (2004)
Schindler, G., Krishnamurthy, P., Dellaert, F.: Line-based structure from motion for urban environments. In: 3DPVT, pp. 846–853 (2006)
Govindu, V.M.: Lie-algebraic averaging for globally consistent motion estimation. In: CVPR, vol. 1, pp. 684–691 (2004)
Uyttendaele, M., Criminisi, A., Kang, S.B., Winder, S.A.J., Szeliski, R., Hartley, R.I.: Image-based interactive exploration of real-world environments. IEEE Computer Graphics and Applications 24, 52–63 (2004)
Martinec, D., Padjla, T.: Robust rotation and translation estimation in multiview reconstruction. In: CVPR (2007)
Sturm, P., Triggs, B.: A Factorization Based Algorithm for Multi-Image Projective Structure and Motion. In: Buxton, B.F., Cipolla, R. (eds.) ECCV 1996. LNCS, vol. 1065, pp. 709–720. Springer, Heidelberg (1996)
Tardif, J.-P., Bartoli, A., Trudeau, M., Guilbert, N., Roy, S.: Algorithms for batch matrix factorization with application to structure-from-motion. In: CVPR (2007)
Hartley, R.I., Kaucic, R., Dano, N.Y.: Plane-based projective reconstruction. In: ICCV (2001)
Sim, K., Hartley, R.: Removing outliers using the \(l_{\inf}\) norm. In: CVPR (2006)
Agarwal, S., Snavely, N., Seitz, S.M.: Fast algorithms for L ∞  problems in multiview geometry. In: CVPR (2008)
Kahl, F., Hartley, R.I.: Multiple-view geometry under the L ∞ -Norm. PAMI 30, 1603–1617 (2008)
Bartoli, A., Sturm, P.F.: Constrained structure and motion from multiple uncalibrated views of a piecewise planar scene. IJCV 52, 45–64 (2003)
Caprile, B., Torre, V.: Using vanishing points for camera calibration. IJCV 4, 127–140 (1990)
Winder, S., Hua, G., Brown, M.: Picking the best DAISY. In: CVPR (2009)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91–110 (2004)
Fischler, M.A., Bolles, R.C.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM 24, 381–395 (1981)
Bay, H., Ferrari, V., Gool, L.V.: Wide-baseline stereo matching with line segments. In: CVPR, vol. 1, pp. 329–336 (2005)
Torr, P.H.S., Zisserman, A.: MLESAC: a new robust estimator with application to estimating image geometry. In: CVIU, vol. 78, pp. 138–156 (2000)
Niemann, T.: PTLens (2009), http://epaperpress.com/ptlens
Strecha, C., von Hansen, W., Gool, L.V., Fua, P., Thoennessen, U.: On benchmarking camera calibration and multi-view stereo. In: CVPR (2008)
Havlena, M., Torii, A., Knopp, J., Pajdla, T.: Randomized structure from motion based on atomic 3d models from camera triplets. In: CVPR, pp. 2874–2881 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sinha, S.N., Steedly, D., Szeliski, R. (2012). A Multi-stage Linear Approach to Structure from Motion. In: Kutulakos, K.N. (eds) Trends and Topics in Computer Vision. ECCV 2010. Lecture Notes in Computer Science, vol 6554. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35740-4_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-35740-4_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35739-8
Online ISBN: 978-3-642-35740-4
eBook Packages: Computer ScienceComputer Science (R0)