3D Object Detection Based on Vanishing Point and Prior Orientation

Gao, Yongbin; Zhao, Huaqing; Fang, Zhijun; Huang, Bo; Zhong, Cengsi

doi:10.1007/s11859-019-1408-4

3D Object Detection Based on Vanishing Point and Prior Orientation

Computer Science
Published: 12 September 2019

Volume 24, pages 369–375, (2019)
Cite this article

Wuhan University Journal of Natural Sciences

Yongbin Gao¹,
Huaqing Zhao¹,
Zhijun Fang¹,
Bo Huang¹ &
…
Cengsi Zhong¹

77 Accesses
2 Citations
Explore all metrics

Abstract

3D object detection is one of the most challenging research tasks in computer vision. In order to solve the problem of template information dependency of 3D object proposal in the method of 3D object detection based on 2.5D information, we proposed a 3D object detector based on fusion of vanishing point and prior orientation, which estimates an accurate 3D proposal from 2.5D data, and provides an excellent start point for 3D object classification and localization. The algorithm first calculates three mutually orthogonal vanishing points by the Euler angle principle and projects them into the pixel coordinate system. Then, the top edge of the 2D proposal is sampled by the preset sampling pitch, and the first one vertex is taken. Finally, the remaining seven vertices of the 3D proposal are calculated according to the linear relationship between the three vanishing points and the vertices, and the complete information of the 3D proposal is obtained. The experimental results show that this proposed method improves the Mean Average Precision score by 2.7% based on the Amodal3Det method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Method for Estimating the Camera Parameters Based on Vanishing Points

Lifting 2D Object Detections to 3D: A Geometric Approach in Multiple Views

Pedestrian and Vehicle Detection and Tracking with Object-Driven Vanishing Line Estimation

References

Ross G, Jeff D, Trevor D, et al. Rich feature hierarchies for accurate object detection and semantic segmentation [C]// 2014 Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2014: 580–587.
Google Scholar
Girshick R. Fast R-CNN [C]// 2015 IEEE International Conference on Computer Vision. Piscataway: IEEE, 2015: 1440–1448.
Google Scholar
Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2017, 39(6): 1137–1149.
Article Google Scholar
Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 779–788.
Google Scholar
Redmon J, Farhadi A. YOLO9000: Better, faster, stronger [C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 6517–6525.
Google Scholar
Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector [C]// European Conference on Computer Vision. Heidelberg: Springer-Verlag, 2016: 21–37.
Google Scholar
Bao Z, Lyu C. Real-time hand gesture recognition based on Kinect [J]. Progress in Laser and Optoelectronics, 2018, 55(03): 031008.
Article Google Scholar
Gupta S, Arbelaez P, Malik J. Perceptual organization and recognition of indoor scenes from RGB-D images [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2013: 564–571.
Google Scholar
Bo L, Ren X, Fox D. Unsupervised feature learning for RGB-D based object recognition [C]// Experimental Robotics. Heidelberg: Springer-Verlag, 2013: 387–402.
Chapter Google Scholar
Bo L, Ren X, Fox D. Learning hierarchical sparse features for RGB-D object recognition [J]. The International Journal of Robotics Research, 2014, 33(4): 581–599.
Article Google Scholar
Socher R, Huval B, Bath B, et al. Convolutional-recursive deep learning for 3D object classification [C]// Advances in Neural Information Processing Systems. 2012: 656–664.
Gupta S, Girshick R, Arbeláez P, et al. Learning rich features from RGB-D images for object detection and segmentation [C]// European Conference on Computer Vision. Heidelberg: Springer-Verlag, 2014: 345–360.
Google Scholar
Gupta S, Arbeláez P, Girshick R, et al. Aligning 3D models to RGB-D images of cluttered scenes [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2015: 4731–4740.
Google Scholar
Gupta S, Hoffman J, Malik J. Cross modal distillation for supervision transfer [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 2827–2836.
Google Scholar
Su H, Maji S, Kalogerakis E, et al. Multi-view convolutional neural networks for 3D shape recognition [C]// Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE, 2015: 945–953.
Google Scholar
Deng Z, Jan L. Amodal detection of 3D objects: Inferring 3D bounding boxed from 2D ones in RGB-depth images [C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2017: 1063–6919.
Google Scholar
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition [EB/OL]. [2018-09-04]. https://arxiv.org/abs/1409.1556.
Fang Z, Zhao H, Gao Y. Prior direction angle estimation in 3D object detection [J]. Transducer and Microsystem Technologies, 2019, 38(6): 35–38.
CAS Google Scholar
Song S R, Xiao J X. Sliding shapes for 3D object detection in depth images [C]//2014 European Conference on Computer Vision. Heidelberg: Springer-Verlag, 2014: 634–651.
Google Scholar
Song S R, Xiao J X. Deep sliding shapes for amodal 3D object detection in RGB-D images [J]. Computer Science, 2015, 139(2): 808–816.
Google Scholar
Ren Z, Sudderth E B. Three-dimensional object detection and layout prediction using clouds of oriented gradients [C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, 2016: 1525–1533.
Google Scholar
Slabaugh G G. Computing Euler angles from a rotation matrix [J]. Retrieved on August, 1999, 6(2000): 39–63.
Google Scholar
Yang S, Scherer S. CubeSLAM: Monocular 3D object detection and SLAM without prior models [EB/OL]. [2018-06-01]. https://arxiv.org/abs/1806.00557.

Download references

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, Shanghai University of Engineering Science, Shanghai, 201620, China
Yongbin Gao, Huaqing Zhao, Zhijun Fang, Bo Huang & Cengsi Zhong

Authors

Yongbin Gao
View author publications
You can also search for this author in PubMed Google Scholar
Huaqing Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhijun Fang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Huang
View author publications
You can also search for this author in PubMed Google Scholar
Cengsi Zhong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhijun Fang.

Additional information

Foundation item: Supported by the National Natural Science Foundation of China (61772328, 61802253, 61831018)

Biography: GAO Yongbin, male, Ph. D., Associate professor, research direction: machine learning, computer vision.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gao, Y., Zhao, H., Fang, Z. et al. 3D Object Detection Based on Vanishing Point and Prior Orientation. Wuhan Univ. J. Nat. Sci. 24, 369–375 (2019). https://doi.org/10.1007/s11859-019-1408-4

Download citation

Received: 26 September 2018
Published: 12 September 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s11859-019-1408-4

Key words

CLC number

TP 391.4

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

3D Object Detection Based on Vanishing Point and Prior Orientation

Abstract

Access this article

Similar content being viewed by others

A Method for Estimating the Camera Parameters Based on Vanishing Points

Lifting 2D Object Detections to 3D: A Geometric Approach in Multiple Views

Pedestrian and Vehicle Detection and Tracking with Object-Driven Vanishing Line Estimation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Navigation

3D Object Detection Based on Vanishing Point and Prior Orientation

Abstract

Access this article

Similar content being viewed by others

A Method for Estimating the Camera Parameters Based on Vanishing Points

Lifting 2D Object Detections to 3D: A Geometric Approach in Multiple Views

Pedestrian and Vehicle Detection and Tracking with Object-Driven Vanishing Line Estimation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Search

Navigation