Abstract
As they decrease in price and increase in fidelity, visually-textured 3D models offer a foundation for robotic spatial reasoning that can support a huge variety of platforms and tasks. This work investigates the capabilities, strengths, and drawbacks of a new sensor, the Matterport 3D camera, in the context of several robot applications. By using hierarchical 2D matching into a database of images rendered from a visually-textured 3D model, this work demonstrates that – when similar cameras are used – 2D matching into visually-textured 3D maps yields excellent performance on both global-localization and local-servoing tasks. When the 2D-matching spans very different camera transforms, however, we show that performance drops significantly. To handle this situation, we propose and prototype a map-alignment phase, in which several visual representations of the same spatial environment overlap: one to support the image-matching needed for visual localization, and the other carrying a global coordinate system needed for task accomplishment, e.g., point-to-point positioning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Matterport. matterport.com. Accessed 20 October 2015
Oliver, A., Kang, S., Wunsche, B.C., MacDonald, B.: Using the Kinect as a navigation sensor for mobile robotics. In: Proceedings of IVCNZ 2012, pp 509–514, Dunedin, New Zealand. ACM, NY
Olesk, A., Wang, J.: Geometric and error analysis for 3D map-matching. In: Proceedings of IGNSS Symposium, 14 p, Queensland, Australia (2009)
Pinto, M., Moreira, A.P., Matos, A., Sobreira, H., Santos, F.: Fast 3D map matching localisation algorithm. J. Autom. Control Eng. 1(2), 110–114 (2013)
Endres, F., Hess, J., Engelhard, N., Sturm, J.: In: Proceedings of ICRA 2012, pp. 1691–1696, 14–18 May 2012
Wu, C., Clipp, B., Li, X., Frahm, J.-M.: 3D model matching with viewpoint-invariant patches (VIP). In: Proceedings of CVPR 2008, pp. 1–8, 23–28 June 2008
Unity. unity3d.com. Accessed 20 August 2015
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Rosebrock, A.L Pyimagesearch. www.pyimagesearch.com/
The OpenCV Library [http://opencv.org/]; Bradski, G. The OpenCV Library Dr. Dobbs Journal, November 2000
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. Comput. Vis. Image Underst. (CVIU) 110(3), 346–359 (2008)
Rublee, E., et al.: ORB: an efficient alternative to SIFT or SURF. In: 2011 IEEE International Conference on Computer Vision (ICCV). IEEE (2011)
Quigley, M., Gerkey, B., Conley, K., Faust, J., Foote, T., Leibs, J., Berger, E., Wheeler, R., Ng, A.Y.: ROS: an open-source robot operating system. In: Proceedings of Open-Source Software Workshop of the International Conference on Robotics and Automation (ICRA) (2009)
Watson, O., Touretzky, D.S.: Navigating with the Tekkotsu Pilot. In: Proceedings of FLAIRS-24, pp. 591–596, Palm Beach, FL, USA, 18–20 May 2011
Hartley, R., Zisserman, A.: Multiple View Geometry. Cambridge University Press, Cambridge (2004)
Haralick, R., Lee, C., Ottenberg, K., Nolle, M.: Review and analysis of solutions of the three point perspective pose estimation problem. Int. J. Comput. Vis. 13(3), 331–356 (1994)
Lynen, S., Sattler, T., Bosse, M., Hesch, J., Pollefeys, M., Siegwart, R.: Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization. Science and Systems, Rovotics (2011)
Acknowledgments
We acknowledge and thank the generous support of the NSF, though CISE REU Site project #1359170, as well as from Harvey Mudd College and its Computer Science department.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Tenorio, D., Rivera, V., Medina, J., Leondar, A., Gaumer, M., Dodds, Z. (2015). Visual Autonomy via 2D Matching in Rendered 3D Models. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2015. Lecture Notes in Computer Science(), vol 9474. Springer, Cham. https://doi.org/10.1007/978-3-319-27857-5_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-27857-5_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27856-8
Online ISBN: 978-3-319-27857-5
eBook Packages: Computer ScienceComputer Science (R0)