Abstract
In this chapter, we describe algorithms for three-dimensional (GlossaryTerm
3-D
) vision that help robots accomplish navigation and grasping. To model cameras, we start with the basics of perspective projection and distortion due to lenses. This projection from a 3-D world to a two-dimensional (GlossaryTerm2-D
) image can be inverted only by using information from the world or multiple 2-D views. If we know the 3-D model of an object or the location of 3-D landmarks, we can solve the pose estimation problem from one view. When two views are available, we can compute the 3-D motion and triangulate to reconstruct the world up to a scale factor. When multiple views are given either as sparse viewpoints or a continuous incoming video, then the robot path can be computer and point tracks can yield a sparse 3-D representation of the world. In order to grasp objects, we can estimate 3-D pose of the end effector or 3-D coordinates of the graspable points on the object.Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Abbreviations
- 2-D:
-
two-dimensional
- 3-D:
-
three-dimensional
- 6-D:
-
six-dimensional
- GPS:
-
global positioning system
- IMU:
-
inertial measurement unit
- MRF:
-
Markov random field
- PnP:
-
prespective-n-point
- SLAM:
-
simultaneous localization and mapping
- SVD:
-
singular value decomposition
References
S. Izadi, R.A. Newcombe, D. Kim, O. Hilliges, D. Molyneaux, S. Hodges, P. Kohli, J. Shotton, A.J. Davison, A. Fitzgibbon: Kinectfusion: Real-time dynamic 3D surface reconstruction and interaction, ACM SIGGRAPH 2011 Talks (2011) p. 23
Google: Atap project tango, https://www.google.com/atap/projecttango (2014)
J.A. Hesch, D.G. Kottas, S.L. Bowman, S.I. Roumeliotis: Camera-IMU-based localization: Observability analysis and consistency improvement, Int. J.Robotics Res. 33(1), 182–201 (2014)
N. Snavely, S.M. Seitz, R. Szeliski: Modeling the world from internet photo collections, Int. J.Comput. Vis. 80(2), 189–210 (2008)
Z. Kukelova, M. Bujnak, T. Pajdla: Polynomial eigenvalue solutions to minimal problems in computer vision, IEEE Trans. Pattern Anal.Mach. Intell. 34(7), 1381–1393 (2012)
F. Kahl, S. Agarwal, M.K. Chandraker, D. Kriegman, S. Belongie: Practical global optimization for multiview geometry, Int. J.Comput. Vis. 79(3), 271–284 (2008)
R.I. Hartley, F. Kahl: Global optimization through rotation space search, Int. J.Comput. Vis. 82(1), 64–79 (2009)
Z. Zhang: A flexible new technique for camera calibration, IEEE Trans. Pattern Anal.Mach. Intell. 22, 1330–1334 (2000)
M. Pollefeys, L. Van Gool, M. Vergauwen, F. Verbiest, K. Cornelis, J. Tops, R. Koch: Visual modeling with a hand-held camera, Int. J.Comput. Vis. 59, 207–232 (2004)
M. Pollefeys, L. Van Gool: Stratified self-calibration with the modulus constraint, IEEE Trans. Pattern Anal.Mach. Intell. 21, 707–724 (1999)
O. Faugeras, Q.-T. Luong, T. Papadopoulo: The Geometry of Multiple Images: The Laws That Govern the Formation of Multiple Images of a Scene and Some of Their Applications (MIT Press, Cambridge 2001)
R. Hartley, A. Zisserman: Multiple View Geometry (Cambridge Univ. Press, Cambridge 2000)
K. Ottenberg, R.M. Haralick, C.-N. Lee, M. Nolle: Review and analysis of solutions of the three-point perspective problem, Int. J.Comput. Vis. 13, 331–356 (1994)
M.A. Fischler, R.C. Bolles: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography, ACM Commun. 24, 381–395 (1981)
R. Kumar, A.R. Hanson: Robust methods for estimaging pose and a sensitivity analysis, Comput. Vis.Image Underst. 60, 313–342 (1994)
C.-P. Lu, G. Hager, E. Mjolsness: Fast and globally convergent pose estimation from video images, IEEE Trans. Pattern Anal.Mach. Intell. 22, 610–622 (2000)
L. Quan, Z. Lan: Linear n-point camera pose determination, IEEE Trans. Pattern Anal.Mach. Intell. 21, 774–780 (1999)
A. Ansar, K. Daniilidis: Linear pose estimation from points and lines, IEEE Trans. Pattern Anal.Mach. Intell. 25, 578–589 (2003)
V. Lepetit, F. Moreno-Noguer, P. Fua: EPNP: An accurate $o(n)$ solution to the PNP problem, Int. J. Comput. Vis. 81(2), 155–166 (2009)
G.H. Golub, C.F. van Loan: Matrix Computations (Johns Hopkins Univ. Press, Baltimore 1983)
J.A. Hesch, S.I. Roumeliotis: A direct least-squares (dls) method for pnp, IEEE Int. Conf.Comput. Vis. (ICCV) (2011) pp. 383–390
C.J. Taylor, D.J. Kriegman: Minimization on the Lie Group SO(3) and Related Manifolds (Yale University, New Haven 1994)
P.-A. Absil, R. Mahony, R. Sepulchre: Optimization Algorithms on Matrix Manifolds (Princeton Univ. Press, Princeton 2009)
Y. Ma, J. Košecká, S. Sastry: Optimization criteria and geometric algorithms for motion and structure estimation, Int. J.Comput. Vis. 44(3), 219–249 (2001)
R.I. Hartley, P. Sturm: Triangulation, Comput. Vis.Image Underst. 68(2), 146–157 (1997)
B. Kitt, A. Geiger, H. Lategahn: Visual odometry based on stereo image sequences with ransac-based outlier rejection scheme, IEEE Intell. Veh. Symp. (IV) (2010)
B.K.P. Horn, H.M. Hilden, S. Negahdaripour: Closed-form solution of absolute orientation using orthonormal matrices, J. Opt. Soc. Am. A 5, 1127–1135 (1988)
A.J. Davison, I.D. Reid, N.D. Molton, O. Stasse: Monoslam: Real-time single camera SLAM, IEEE Trans.Pattern Anal.Mach. Intell. 29(6), 1052–1067 (2007)
R. Tron, K. Daniilidis: On the quotient representation for the essential manifold, Proc. IEEE Conf.Comput. Vis.Pattern Recognit. (2014) pp. 1574–1581
T.S. Huang, O.D. Faugeras: Some properties of the E matrix in two-view motion estimation, IEEE Trans. Pattern Anal.Mach. Intell. 11, 1310–1312 (1989)
D. Nister: An efficient solution for the five-point relative pose problem, IEEE Trans. Pattern Anal.Mach. Intell. 26, 756–777 (2004)
H. Li, R. Hartley: Five-point motion estimation made easy, IEEE 18th Int. Conf. Pattern Recognit. (ICPR), Vol. 1 (2006) pp. 630–633
Z. Kukelova, M. Bujnak, T. Pajdla: Polynomial eigenvalue solutions to the 5-pt and 6-pt relative pose problems, BMVC (2008) pp. 1–10
H. Stewenius, C. Engels, D. Nistér: Recent developments on direct relative orientation, ISPRS J.Photogramm.Remote Sens. 60(4), 284–294 (2006)
D. Batra, B. Nabbe, M. Hebert: An alternative formulation for five point relative pose problem, IEEE WorkshopMotionVideo Comput. (2007) pp. 21–21
Center for Machine Perception, Minimal problems in computer vision; http://cmp.felk.cvut.cz/minimal/5_pt_relative.php
S. Maybank: Theory of Reconstruction from Image Motion (Springer, Berlin, Heidelberg 1993)
S.J. Maybank: The projective geometry of ambiguous surfaces, Phil. Trans. Royal Soc. Lond. A 332(1623), 1–47 (1990)
A. Jepson, D.J. Heeger: A fast subspace algorithm for recovering rigid motion, Proc. IEEE WorkshopVis. Motion, Princeton (1991) pp. 124–131
C. Fermüller, Y. Aloimonos: Algorithmic independent instability of structure from motion, Proc. 5th Eur. Conf. Comput. Vision, Freiburg (1998)
K. Daniilidis, M. Spetsakis: Understanding noise sensitivity in structure from motion. In: Visual Navigation, ed. by Y. Aloimonos (Lawrence Erlbaum, Mahwah 1996) pp. 61–88
S. Soatto, R. Brockett: Optimal structure from motion: Local ambiguities and global estimates, IEEE Conf. Comput. Vis.Pattern Recognit., Santa Barbara (1998)
J. Oliensis: A new structure-from-motion ambiguity, IEEE Trans. Pattern Anal.Mach. Intell. 22, 685–700 (1999)
O. Naroditsky, X.S. Zhou, J. Gallier, S. Roumeliotis, K. Daniilidis: Two efficient solutions for visual odometry using directional correspondence, IEEE Trans. Patterns Anal. Mach. Intell. (2012)
Y. Ma, K. Huang, R. Vidal, J. Kosecka, S. Sastry: Rank conditions of the multiple view matrix, Int. J.Comput. Vis. 59(2), 115–139 (2004)
Y. Ma, S. Soatto, J. Kosecka, S. Sastry: An Invitation to 3-D Vision: From Images to Geometric Models (Springer, Berlin, Heidelberg 2003)
W. Triggs, P. McLauchlan, R. Hartley, A. Fitzgibbon: Bundle adjustment – A modern synthesis, Lect. Notes Comput. Sci 1883, 298–372 (2000)
M. Lourakis, A. Argyros: The Design and Implementation of a Generic Sparse Bundle Adjustment Software Package Based on the Levenberg–Marquard Method, Tech. Rep, Vol. 340 (ICS/FORTH, Heraklion 2004)
S. Teller, M. Antone, Z. Bodnar, M. Bosse, S. Coorg: Calibrated, registered images of an extended urban area, Int. Conf. Comput. Vis.Pattern Recognit., Kauai, Vol. 1 (2001) pp. 813–820
D. Kragic, M. Madry, D. Song: From object categories to grasp transfer using probabilistic reasoning, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2012) pp. 1716–1723
A.T. Miller, S. Knoop, H.I. Christensen, P.K. Allen: Automatic grasp planning using shape primitives, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2003) pp. 1824–1829
K. Hübner, D. Kragic: Selection of robot pre-grasps using box-based shape approximation, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS) (2008) pp. 1765–1770
C. Dunes, E. Marchand, C. Collowet, C. Leroux: Active rough shape estimation of unknown objects, IEEE Int. Conf.Intell. RobotsSyst. (IROS) (2008) pp. 3622–3627
M. Przybylski, T. Asfour: Unions of balls for shape approximation in robot grasping, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS), Taipei (2010) pp. 1592–1599
C. Goldfeder, P.K. Allen, C. Lackner, R. Pelossof: Grasp Planning Via Decomposition Trees, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2007) pp. 4679–4684
S. El-Khoury, A. Sahbani: Handling objects by their handles, IEEE/RSJ Int. Conf.Intell. RobotsSyst. WorkshopGraspTask Learn. Imitation (2008)
R. Pelossof, A. Miller, P. Allen, T. Jebera: An SVM learning approach to robotic grasping, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2004) pp. 3512–3518
A. Boularias, O. Kroemer, J. Peters: Learning robot grasping from 3-d images with markov random fields, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS) (2011) pp. 1548–1553
R. Detry, E. Başeski, N. Krüger, M. Popović, Y. Touati, O. Kroemer, J. Peters, J. Piater: Learning object-specific grasp affordance densities, IEEE Int. Conf.Dev.Learn. (2009) pp. 1–7
C. Papazov, S. Haddadin, S. Parusel, K. Krieger, D. Burschka: Rigid 3D geometry matching for grasping of known objects in cluttered scenes, Int. J.Robotics Res. 31(4), 538–553 (2012)
J. Weisz, P.K. Allen: Pose error robust grasping from contact wrench space metrics, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2012) pp. 557–562
D. Song, C.H. Ek, K. Hübner, D. Kragic: Multivariate discretization for bayesian network structure learning in robot grasping, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2011) pp. 1944–1950
Z.C. Marton, D. Pangercic, N. Blodow, J. Kleinehellefort, M. Beetz: General 3D modelling of novel objects from a single view, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS) (2010) pp. 3700–3705
D. Rao, V. Le Quoc, T. Phoka, M. Quigley, A. Sudsang, A.Y. Ng: Grasping novel objects with depth segmentation, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS), Taipei (2010) pp. 2578–2585
J. Bohg, M. Johnson-Roberson, B. León, J. Felip, X. Gratal, N. Bergström, D. Kragic, A. Morales: Mind the gap – Robotic grasping under incomplete observation, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2011)
G.M. Bone, A. Lambert, M. Edwards: Automated Modelling and Robotic Grasping of Unknown Three-Dimensional Objects, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2008) pp. 292–298
K. Hsiao, S. Chitta, M. Ciocarlie, E.G. Jones: Contact-reactive grasping of objects with partial shape information, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS) (2010) pp. 1228–1235
M.A. Roa, M.J. Argus, D. Leidner, C. Borst, G. Hirzinger: Power grasp planning for anthropomorphic robot hands, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2012)
M. Richtsfeld, M. Vincze: Grasping of Unknown Objects from a Table Top, ECCV WorkshopVis.Action: Effic. Strateg.Cogn. AgentsComplex Environ. (2008)
A. Maldonado, U. Klank, M. Beetz: Robotic grasping of unmodeled objects using time-of-flight range data and finger torque information, IEEE/RSJ Int. Conf.Intell. RobotsSyst. (IROS) (2010) pp. 2586–2591
J. Stückler, R. Steffens, D. Holz, S. Behnke: Real-time 3d perception and efficient grasp planning for everyday manipulation tasks, Eur. Conf.Mob. Robots (ECMR) (2011)
G. Kootstra, M. Popovic, J.A. Jørgensen, K. Kuklinski, K. Miatliuk, D. Kragic, N. Kruger: Enabling grasping of unknown objects through a synergistic use of edge and surface information, Int. J.Robotics Res. 31(10), 1190–1213 (2012)
D. Kraft, N. Pugeault, E. Baseski, M. Popovic, D. Kragic, S. Kalkan, F. Wörgötter, N. Krueger: Birth of the object: Detection of objectness and extraction of object shape through object action complexes, Int. J.Humanoid Robotics pp, 247–265 (2009)
O. Kroemer, E. Ugur, E. Oztop, J. Peters: A Kernel-based Approach to Direct Action Perception, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2012)
A. Herzog, P. Pastor, M. Kalakrishnan, L. Righetti, T. Asfour, S. Schaal: Template-based learning of grasp selection, Proc. IEEE Int. Conf.RoboticsAutom. (ICRA) (2012)
L. Montesano, M. Lopes, A. Bernardino, J. Santos-Victor: Learning object affordances: From sensory–motor coordination to imitation, IEEE Trans.Robotics 24(1), 15–26 (2008)
O. Faugeras: Three-Dimensional Computer Vision (MIT Press, Cambridge 1993)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Video-References
Video-References
- :
-
Google’s project Tango available from http://handbookofrobotics.org/view-chapter/32/videodetails/120
- :
-
Finding paths through the world’s photos available from http://handbookofrobotics.org/view-chapter/32/videodetails/121
- :
-
LIBVISO: Visual odometry for intelligent vehicles available from http://handbookofrobotics.org/view-chapter/32/videodetails/122
- :
-
Parallel tracking and mapping for small AR workspaces (PTAM) available from http://handbookofrobotics.org/view-chapter/32/videodetails/123
- :
-
DTAM: Dense tracking and mapping in real-time available from http://handbookofrobotics.org/view-chapter/32/videodetails/124
- :
-
3-D models from 2-D video – automatically available from http://handbookofrobotics.org/view-chapter/32/videodetails/125
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kragic, D., Daniilidis, K. (2016). 3-D Vision for Navigation and Grasping. In: Siciliano, B., Khatib, O. (eds) Springer Handbook of Robotics. Springer Handbooks. Springer, Cham. https://doi.org/10.1007/978-3-319-32552-1_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-32552-1_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32550-7
Online ISBN: 978-3-319-32552-1
eBook Packages: EngineeringEngineering (R0)