Abstract
We present an alternate account of human 3D processing for perception and action. Traditional views postulate that visual mechanisms extract veridical metric properties of environmental objects. Instead, we argue that local affine information, which encodes non metric aspects of a 3D structure, like the depth-order of feature points, determines both our conscious perception of the world and our motor actions. Since only this information is accurately carried on from the earliest stages of 3D processing, our theory predicts large biases in perceptual and motor tasks that require veridical metric estimates. We describe empirical results that support this prediction and show the inadequacy of metric models as computational theories of 3D processing.
Keywords
- Affinity Information
- Retinal Velocity
- Final Grip Aperture (FGA)
- Extraretinal Information
- Affinity Hypothesis
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
We arbitrarily chose a zero disparity value for the flankers, as if the observers were fixating the plane identified by the two flanking rods.
- 2.
In the specific case of disparity and velocity \(\mathit{SNR}_{c} = \sqrt{\mathit{SNR}_{d}^{2} + \mathit{SNR}_{v}^{2}}\).
- 3.
Consider binocular disparities: The relative disparity d is proportional to the depth Δz of the distal object through a scaling factor k d , related to the egocentric distance of the object (d=k d Δz) and, therefore, to the Signal to Noise Ratio \((\mathit{SNR}_{d} = \frac {k_{d}}{\sigma_{d}}\Delta z)\). For a fixed value of k d , SNR d increases with Δz. Thus, “on average”, larger values of SNR correspond to larger values of Δz.
References
Trommershauser J, Kording K, Landy M (2011) Sensory cue integration. Oxford University Press, New York
Goodale MA (2011) Transforming vision into action. Vis Res 51:1567–1587
Todd JT, Bressan P (1990) The perception of 3-dimensional affine structure from minimal apparent motion sequences. Percept Psychophys 48:419–430
Todd JT, Norman JF (2003) The visual perception of 3-d shape from multiple cues: are observers capable of perceiving metric structure? Percept Psychophys 65:31–47
Todd JT, Oomes AHJ, Koenderink JJ, Kappers AML (2001) On the affine structure of perceptual space. Psychol Sci 12:191–196
Norman JF, Lappin JS, Norman HF (2000) The perception of length on curved and flat surfaces. Percept Psychophys 62:1133–1145
Di Luca M, Domini F, Caudek C (2010) Inconsistency of perceived 3D shape. Vis Res 50:1519–1531
Norman JF, Crabtree CE, Clayton AM, Norman HF (2005) The perception of distances and spatial relationships in natural outdoor environments. Perception 34:1315–1324
Domini F, Shah R, Caudek C (2011) Do we perceive a flattened world on the monitor screen? Acta Psychol 138:359–366
Johnston EB (1991) Systematic distortions of shape from stereopsis. Vis Res 31:1351–1360
Domini F, Braunstein ML (1998) Recovery of 3-d structure from motion is neither Euclidean nor affine. J Exp Psychol Hum Percept Perform 24:1273–1295
Domini F, Caudek C (1999) Perceiving surface slant from deformation of optic flow. J Exp Psychol Hum Percept Perform 25:426–444
Domini F, Caudek C (2003) 3D structure perceived from dynamic information: a new theory. Trends Cogn Sci 7:444–449
Domini F, Caudek C (2003) Recovering slant and angular velocity from a linear velocity field: modeling and psychophysics. Vis Res 43:1753–1764
Fantoni C, Caudek C, Domini F (2010) Systematic distortions of perceived planar surface motion in active vision. J Vis 10:1–20. doi:10.1167/10.5.12
Caudek C, Fantoni C, Domini F (2011) Bayesian modeling of perceived surface slant from actively-generated and passively-observed optic flow. PLoS ONE. doi:10.1371/journal.pone.0018731
Fantoni C, Caudek C, Domini F (2012) Perceived surface slant is systematically biased in the actively-generated optic flow. PLoS ONE. doi:10.1371/journal.pone.0033911
Domini F, Caudek C (2010) Matching perceived depth from disparity and velocity: modeling and psychophysics. Acta Psychol 133:81–89
Domini F, Caudek C, Tassinari H (2006) Stereo and motion information are not independently processed by the visual system. Vis Res 46:1707–1723
Richards W (1985) Structure from stereo and motion. J Opt Soc Am 2:343–349
Foster R, Fantoni C, Caudek C, Domini F (2011) Integration of disparity and velocity information for haptic and perceptual judgments of object depth. Acta Psychol 136:300–310
Campagnoli C, Volcic R, Domini F (2012) The same object and at least three different grip apertures. J Vis. doi:10.1167/12.9.429
Saunders JA, Knill DC (2003) Humans use continuous visual feedback from the hand to control fast reaching movements. Exp Brain Res 152:341–352
Bingham G, Coats R, Mon-Williams M (2007) Natural prehension in trials without haptic feedback but only when calibration is allowed. Neuropsychologia 45:288–294
Chen M, Chen K (1982) A transformational analysis of form recognition under plane isometries. J Math Psychol 26:237–251
Van Gool M, Moons T, Pauwels E, Wagemans J Invariance from the Euclidean geometer’s perspective. Perception 23:547–561
Wagemans J, Van Gool L, Lamote C, Foster DH (2000) Minimal information to determine affine shape equivalence. J Exp Psychol Hum Percept Perform 26:443–468
Domini F, Caudek C (2011) Combining image signals before 3D reconstruction: the intrinsic constraint model of cue integration. In: Trommershauser J, Kording K, Landy M (eds) Sensory cue integration. Oxford University Press, New York, pp 120–143
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag London
About this chapter
Cite this chapter
Domini, F., Caudek, C. (2013). Perception and Action Without Veridical Metric Reconstruction: An Affine Approach. In: Dickinson, S., Pizlo, Z. (eds) Shape Perception in Human and Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-5195-1_20
Download citation
DOI: https://doi.org/10.1007/978-1-4471-5195-1_20
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5194-4
Online ISBN: 978-1-4471-5195-1
eBook Packages: Computer ScienceComputer Science (R0)