3D Reconstruction of Dynamic Scenes with Multiple Handheld Cameras

Jiang, Hanqing; Liu, Haomin; Tan, Ping; Zhang, Guofeng; Bao, Hujun

doi:10.1007/978-3-642-33709-3_43

Hanqing Jiang^21,22,
Haomin Liu²¹,
Ping Tan²²,
Guofeng Zhang²¹ &
…
Hujun Bao²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7573))

Included in the following conference series:

European Conference on Computer Vision

11k Accesses
15 Citations

Abstract

Accurate dense 3D reconstruction of dynamic scenes from natural images is still very challenging. Most previous methods rely on a large number of fixed cameras to obtain good results. Some of these methods further require separation of static and dynamic points, which are usually restricted to scenes with known background. We propose a novel dense depth estimation method which can automatically recover accurate and consistent depth maps from the synchronized video sequences taken by a few handheld cameras. Unlike fixed camera arrays, our data capturing setup is much more flexible and easier to use. Our algorithm simultaneously solves bilayer segmentation and depth estimation in a unified energy minimization framework, which combines different spatio-temporal constraints for effective depth optimization and segmentation of static and dynamic points. A variety of examples demonstrate the effectiveness of the proposed framework.

Download to read the full chapter text

Chapter PDF

Temporally Coherent General Dynamic Scene Reconstruction

Article Open access 18 August 2020

A Dense Pipeline for 3D Reconstruction from Image Sequences

ArticulatedFusion: Real-Time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth Camera

Keywords

References

de Aguiar, E., Stoll, C., Theobalt, C., Ahmed, N., Seidel, H.P., Thrun, S.: Performance capture from sparse multi-view video. ACM Transactions on Graphics 27, 98:1–98:10 (2008)
Article Google Scholar
Ballan, L., Brostow, G.J., Puwein, J., Pollefeys, M.: Unstructured video-based rendering: interactive exploration of casually captured videos. ACM Transactions on Graphics 29, 87:1–87:11 (2010)
Article Google Scholar
Comaniciu, D., Meer, P., Member, S.: Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 603–619 (2002)
Article Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient belief propagation for early vision. International Journal of Computer Vision 70(1), 41–54 (2006)
Article Google Scholar
Goldlücke, B., Magnor, M.A.: Joint 3D-reconstruction and background separation in multiple views using graph cuts. In: CVPR, vol. 1 (2003)
Google Scholar
Guillemaut, J.-Y., Kilner, J., Hilton, A.: Robust graph-cut scene segmentation and reconstruction for free-viewpoint video of complex dynamic scenes. In: ICCV, pp. 809–816 (2009)
Google Scholar
Hasler, N., Rosenhahn, B., Thormählen, T., Wand, M., Gall, J., Seidel, H.-P.: Markerless motion capture with unsynchronized moving cameras. In: CVPR, pp. 224–231 (2009)
Google Scholar
Larsen, E.S., Mordohai, P., Pollefeys, M., Fuchs, H.: Temporally consistent reconstruction from multiple video streams using enhanced belief propagation. In: ICCV, pp. 1–8 (2007)
Google Scholar
Lei, C., Da Chen, X., Yang, Y.H.: A new multiview spacetime-consistent depth recovery framework for free viewpoint video rendering. In: ICCV, pp. 1570–1577 (2009)
Google Scholar
Liu, C.: Beyond pixels: exploring new representations and applications for motion analysis. Ph.D. thesis, Massachusetts Institute of Technology (May 2009)
Google Scholar
Seitz, S.M., Curless, B., Diebel, J., Scharstein, D., Szeliski, R.: A comparison and evaluation of multi-view stereo reconstruction algorithms. In: CVPR, vol. 1, pp. 519–528 (2006)
Google Scholar
Tao, H., Sawhney, H.S., Kumar, R.: Dynamic depth recovery from multiple synchronized video streams. In: CVPR, pp. 118–124 (2001)
Google Scholar
Tola, E., Lepetit, V., Fua, P.: Daisy: An efficient dense descriptor applied to wide-baseline stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(5), 815–830 (2010)
Article Google Scholar
Torresani, L., Hertzmann, A., Bregler, C.: Nonrigid structure-from-motion: Estimating shape and motion with hierarchical priors. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(5), 878–892 (2008)
Article Google Scholar
Yang, M., Cao, X., Dai, Q.: Multiview video depth estimation with spatial-temporal consistency. In: BMVC (2010)
Google Scholar
Yang, W., Zhang, G., Bao, H., Kim, J., Lee, H.Y.: Consistent depth maps recovery from a trinocular video sequence. In: CVPR, pp. 1466–1473 (2012)
Google Scholar
Zhang, G., Dong, Z., Jia, J., Wong, T.-T., Bao, H.: Efficient Non-consecutive Feature Tracking for Structure-from-Motion. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 422–435. Springer, Heidelberg (2010)
Chapter Google Scholar
Zhang, G., Jia, J., Bao, H.: Simultaneous multi-body stereo and segmentation. In: ICCV, pp. 826–833 (2011)
Google Scholar
Zhang, G., Jia, J., Hua, W., Bao, H.: Robust bilayer segmentation and motion/depth estimation with a handheld camera. IEEE Transactions on Pattern Analysis and Machine Intelligence (99), 603–617 (2011)
Article Google Scholar
Zhang, G., Jia, J., Wong, T.-T., Bao, H.: Consistent depth maps recovery from a video sequence. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(6), 974–988 (2009)
Article Google Scholar
Zhang, G., Qin, X., Hua, W., Wong, T.-T., Heng, P.-A., Bao, H.: Robust metric reconstruction from challenging video sequences. In: CVPR (2007)
Google Scholar
Zhang, Y., Kambhamettu, C.: Integrated 3D scene flow and structure recovery from multiview image sequences. In: CVPR, vol. 2, pp. 674–681 (2000)
Google Scholar
Zitnick, C.L., Kang, S.B., Uyttendaele, M., Winder, S., Szeliski, R.: High-quality video view interpolation using a layered representation. ACM Transactions on Graphics 23, 600–608 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

State Key Lab of CAD&CG, Zhejiang University, China
Hanqing Jiang, Haomin Liu, Guofeng Zhang & Hujun Bao
National University of Singapore, Singapore
Hanqing Jiang & Ping Tan

Authors

Hanqing Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Haomin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ping Tan
View author publications
You can also search for this author in PubMed Google Scholar
Guofeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hujun Bao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd, CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, H., Liu, H., Tan, P., Zhang, G., Bao, H. (2012). 3D Reconstruction of Dynamic Scenes with Multiple Handheld Cameras. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33709-3_43

Download citation

DOI: https://doi.org/10.1007/978-3-642-33709-3_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33708-6
Online ISBN: 978-3-642-33709-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

3D Reconstruction of Dynamic Scenes with Multiple Handheld Cameras

Abstract

Chapter PDF

Similar content being viewed by others

Temporally Coherent General Dynamic Scene Reconstruction

A Dense Pipeline for 3D Reconstruction from Image Sequences

ArticulatedFusion: Real-Time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth Camera

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

3D Reconstruction of Dynamic Scenes with Multiple Handheld Cameras

Abstract

Chapter PDF

Similar content being viewed by others

Temporally Coherent General Dynamic Scene Reconstruction

A Dense Pipeline for 3D Reconstruction from Image Sequences

ArticulatedFusion: Real-Time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth Camera

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation