A Dynamic Programming Approach to Maximizing Tracks for Structure from Motion

Mooser, Jonathan; You, Suya; Neumann, Ulrich; Grasset, Raphael; Billinghurst, Mark

doi:10.1007/978-3-642-12304-7_1

A Dynamic Programming Approach to Maximizing Tracks for Structure from Motion

Jonathan Mooser¹⁹,
Suya You¹⁹,
Ulrich Neumann¹⁹,
Raphael Grasset²⁰ &
…
Mark Billinghurst²⁰

Conference paper

2684 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5995))

Abstract

We present a novel algorithm for improving the accuracy of structure from motion on video sequences. Its goal is to efficiently recover scene structure and camera pose by using dynamic programming to maximize the lengths of putative keypoint tracks. By efficiently discarding poor correspondences while maintaining the largest possible set of inliers, it ultimately provides a robust and accurate scene reconstruction. Traditional outlier detection strategies, such as RANSAC and its derivatives, cannot handle high dimensional problems such as structure from motion over long image sequences. We prove that, given an estimate of the camera pose at a given frame, the outlier detection is optimal and runs in low order polynomial time. The algorithm is applied on-line, processing each frame in sequential order. Results are presented on several indoor and outdoor video sequences processed both with and without the proposed optimization. The improvement in average reprojection errors demonstrates its effectiveness.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hartley, R., Zisserman, A.: Multiple view geometry in computer vision. Cambridge University Press, New York (2003)
Google Scholar
Pollefeys, M., Van Gool, L., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., Koch, R.: Visual modeling with a hand-held camera. International Journal of Computer Vision 59(3), 207–232 (2004)
Article Google Scholar
Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: exploring photo collections in 3d. In: SIGGRAPH 2006: ACM SIGGRAPH 2006 Papers, pp. 835–846 (2006)
Google Scholar
Fitzgibbon, A.W., Zisserman, A.: Automatic camera recovery for closed or open image sequences. In: Burkhardt, H.-J., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1406, pp. 311–326. Springer, Heidelberg (1998)
Chapter Google Scholar
Meltzer, J., Soatto, S.: Edge descriptors for robust wide-baseline correspondence (2008)
Google Scholar
Shi, J., Tomasi, C.: Good features to track. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 593–600 (1994)
Google Scholar
Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: International Joint Conference on Artificial Intelligence, pp. 674–679 (1981)
Google Scholar
Bouguet, J.Y.: Pyramidal implementation of the Lucas-Kanade feature tracker. OpenCV library (2001), http://sourceforge.net/projects/opencvlibrary
Mooser, J., Wang, Q., You, S., Neumann, U.: Fast simultaneous tracking and recognition by incremental keypoint matching. In: 3D Data Processing, Visualization and Transmission (2008)
Google Scholar
Kolmogorov, V., Zabih, R.: Multi-camera scene reconstruction via graph cuts. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 82–96. Springer, Heidelberg (2002)
Chapter Google Scholar
Gallup, D., Frahm, J.M., Mordohai, P., Yang, Q., Pollefeys, M.: Real-time plane-sweeping stereo with multiple sweeping directions, pp. 1–8 (2007)
Google Scholar
Zhu, Z., Oskiper, T., Samarasekera, S., Kumar, R., Sawhney, H.: Real-time global localization with a pre-built visual landmark database (2008)
Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Article MathSciNet Google Scholar
Torr, P.H.S., Zisserman, A.: Mlesac: a new robust estimator with application to estimating image geometry. Computer Vision and Image Understanding 78(1), 138–156 (2000)
Article Google Scholar
Nistér, D.: Preemptive ransac for live structure and motion estimation. In: IEEE International Conference on Computer Vision, p. 199 (2003)
Google Scholar
Buchanan, A., Fitzgibbon, A.: Interactive feature tracking using k-d trees and dynamic programming. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 626–633 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

CGIT Lab, University of Southern California, Los Angeles, California
Jonathan Mooser, Suya You & Ulrich Neumann
HITLabNZ, University of Canterbury, Christchurch, New Zealand
Raphael Grasset & Mark Billinghurst

Authors

Jonathan Mooser
View author publications
You can also search for this author in PubMed Google Scholar
Suya You
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Neumann
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Grasset
View author publications
You can also search for this author in PubMed Google Scholar
Mark Billinghurst
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Machine Intelligence, Peking University, 100871, Beijing, China
Hongbin Zha
Department of Advanced Information Technology, Kyushu University, 819-0395, Fukuoka, Japan
Rin-ichiro Taniguchi
Department of Computer Science, University of London, Birkbeck College, WC1E 7HX, London, UK
Stephen Maybank

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mooser, J., You, S., Neumann, U., Grasset, R., Billinghurst, M. (2010). A Dynamic Programming Approach to Maximizing Tracks for Structure from Motion. In: Zha, H., Taniguchi, Ri., Maybank, S. (eds) Computer Vision – ACCV 2009. ACCV 2009. Lecture Notes in Computer Science, vol 5995. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12304-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-12304-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12303-0
Online ISBN: 978-3-642-12304-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics