Human-Centric Indoor Environment Modeling from Depth Videos

Lu, Jiwen; Wang, Gang

doi:10.1007/978-3-642-33868-7_5

Jiwen Lu¹⁹ &
Gang Wang^19,20

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7584))

Included in the following conference series:

European Conference on Computer Vision

4838 Accesses
2 Citations

Abstract

We propose an approach to model indoor environments from depth videos (the camera is stationary when recording the videos), which includes extracting the 3-D spatial layout of the rooms and modeling objects as 3-D cuboids. Different from previous work which purely relies on image appearance, we argue that indoor environment modeling should be human-centric: not only because humans are an important part of the indoor environments, but also because the interaction between humans and environments can convey much useful information about the environments. In this paper, we develop an approach to extract physical constraints from human poses and motion to better recover the spatial layout and model objects inside. We observe that the cues provided by human-environment intersection are very powerful: we don’t have a lot of training data but our method can still achieve promising performance. Our approach is built on depth videos, which makes it more user friendly.

Download to read the full chapter text

Chapter PDF

Building Scene Models by Completing and Hallucinating Depth and Semantics

Real-time indoor scene reconstruction with Manhattan assumption

Article 21 December 2017

Hierarchical Grid-Based Learning Approach for Recovering Unknown Depths in Kinect Depth Maps

Keywords

References

Hoiem, D., Efros, A., Hebert, M.: Recovering surface layout from an image. IJCV 75, 151–172 (2007)
Article Google Scholar
Lee, D.C., Hebert, M., Kanade, T.: Geometric reasoning for single image structure recovery. In: CVPR, pp. 2136–2143 (2009)
Google Scholar
Lee, D.C., Gupta, A., Hebert, M., Kanade, T.: Estimating spatial layout of rooms using volumetric reasoning about objects and surfaces. In: NIPS, pp. 1288–1296 (2010)
Google Scholar
Hedau, V., Hoiem, D., Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: ICCV, pp. 1849–1856 (2009)
Google Scholar
Wang, H., Gould, S., Koller, D.: Discriminative Learning with Latent Variables for Cluttered Indoor Scene Understanding. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 497–510. Springer, Heidelberg (2010)
Chapter Google Scholar
Li, L., Socher, R., Fei-Fei, L.: Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: CVPR, pp. 2036–2043 (2009)
Google Scholar
Gupta, A., Satkin, S., Efros, A., Hebert, M.: From 3d scene geometry to human workspace. In: CVPR, pp. 1961–1968 (2011)
Google Scholar
Hedau, V., Hoiem, D., Forsyth, D.: Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 224–237. Springer, Heidelberg (2010)
Chapter Google Scholar
Tsai, G., Xu, C., Liu, J., Kuipers, B.: Real-time indoor scene understanding using bayesian filtering with motion cues. In: ICCV (2011)
Google Scholar
Delage, E., Lee, H., Ng, A.: A dynamic bayesian network model for autonomous 3d reconstruction from a single indoor image. In: CVPR, pp. 2418–2428 (2006)
Google Scholar
Yu, S.X., Zhang, H., Malik, J.: Inferring spatial layout from a single image via depth-ordered grouping. In: CVPRW, pp. 1–7 (2008)
Google Scholar
Lu, J., Zhang, E.: Gait recognition for human identification based on ica and fuzzy svm through multiple views fusion. Pattern Recognition Letters 28, 2401–2411 (2007)
Article Google Scholar
Herbst, E., Ren, X., Fox, D.: Rgb-d object discovery via multi-scene analysis. In: IROS, pp. 4850–4856 (2011)
Google Scholar
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view rgb-d object dataset. In: ICRA, pp. 1817–1824 (2011)
Google Scholar
Rother, C.: A new approach to vanishing point detection in architectural environments. Image and Vision Computing 20, 647–655 (2002)
Article Google Scholar
Zhou, Z., Geng, X.: Projection functions for eye detection. Pattern Recognition 37, 1049–1056 (2004)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Advanced Digital Sciences Center, Singapore
Jiwen Lu & Gang Wang
Nanyang Technological University, Singapore
Gang Wang

Authors

Jiwen Lu
View author publications
You can also search for this author in PubMed Google Scholar
Gang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Ingegneria Elettrica, Gestionale e Meccanica (DIEGM), Università degli Studi di Udine, Via delle Scienze, 208, 33100, Udine, Italy
Andrea Fusiello
IIT Istituto Italiano di Tecnologia, Via Morego 30, 16163, Genoa, Italy
Vittorio Murino
Dipartimento di Ingegneria dell’Informazione, Università degli Studi di Modena e Reggio Emilia, Strada Vignolege, 905, 41125, Modena, Italy
Rita Cucchiara

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, J., Wang, G. (2012). Human-Centric Indoor Environment Modeling from Depth Videos. In: Fusiello, A., Murino, V., Cucchiara, R. (eds) Computer Vision – ECCV 2012. Workshops and Demonstrations. ECCV 2012. Lecture Notes in Computer Science, vol 7584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33868-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-33868-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33867-0
Online ISBN: 978-3-642-33868-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Human-Centric Indoor Environment Modeling from Depth Videos

Abstract

Chapter PDF

Similar content being viewed by others

Building Scene Models by Completing and Hallucinating Depth and Semantics

Real-time indoor scene reconstruction with Manhattan assumption

Hierarchical Grid-Based Learning Approach for Recovering Unknown Depths in Kinect Depth Maps

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Human-Centric Indoor Environment Modeling from Depth Videos

Abstract

Chapter PDF

Similar content being viewed by others

Building Scene Models by Completing and Hallucinating Depth and Semantics

Real-time indoor scene reconstruction with Manhattan assumption

Hierarchical Grid-Based Learning Approach for Recovering Unknown Depths in Kinect Depth Maps

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation