Extraction of Object Representations from Stereo Image Sequences Utilizing Statistical and Deterministic Regularities in Visual Data
The human visual system is a highly interconnected machinery that acquires its stability through integration of information across modalities and time frames. This integration becomes possible by utilizing regularities in visual data, most importantly motion (especially rigid body motion) and statistical regularities reflected in Gestalt principles such as collinearity.
In this paper we describe an artificial vision system which extracts 3D- information from stereo sequences. This system uses deterministic and statistical regularities to aquire stable representations from unreliable sub-modalities such as stereo or edge detection. To make use of the above mentioned regularities we have to work within a complex machinery containing sub-modules such as stereo, pose estimation and an accumulation scheme. The interaction of these modules allows to use the statistical and deterministic regularities for feature disambiguation within a process of recurrent predictions.
KeywordsObject Representation Rigid Body Motion Consecutive Frame Accumulation Scheme Semantic Parameter
Unable to display preview. Download preview PDF.
- J. Aloimonos and D. Shulman. Integration of Visual Modules—An extension of the Marr Paradigm. Academic Press, London, 1989.Google Scholar
- O.D. Faugeras. Three-Dimensional Computer Vision. MIT Press, 1993.Google Scholar
- M. Felsberg and G. Sommer. The monogenic signal. IEEE Transactions on Signal Processing, 41(12), 2001.Google Scholar
- O. Granert. Poseschaetzung kinematischer Ketten. Diploma Thesis,Universität Kiel, 2002.Google Scholar
- D.D. Hoffman, editor. Visual Intelligence: How we create what we see. W.W. Norton and Company, 1980.Google Scholar
- Thomas Jäger. Interaktion verschiedener visueller Modalitäten zur stabilen Extraktion von Objektrepräsentationen. Diploma thesis (University of Kiel), 2002.Google Scholar
- B. Jähne. Digital Image Processing—Concepts,Algorithms,and Scientific Applications. Springer, 1997.Google Scholar
- R. Klette, K. Schlüns, and A. Koschan. Computer Vision—Three-Dimensional Data from Images. Springer, 1998.Google Scholar
- P. Kovesi. Image features from phase congruency. Videre: Journal of Computer Vision Research, 1(3):1–26, 1999.Google Scholar
-  N. Krüger, M. Felsberg, C. Gebken, and M. Pörksen. An explicit and compact coding of geometric and structural information applied to stereo processing. Proceedings of the workshop ‘Vision, Modeling and Visualization 2002’ 2002.Google Scholar
- N. Krüger and F. Wörgötter. Different degree of genetical prestructuring in the ontogenesis of visual abilities based on deterministic and statistical regularities. Proceedings of the Workshop On Growing up Artifacts that Live SAB 2002, 2002.Google Scholar
- N. Krüger and F. Wörgötter. Multi modal estimation of collinearity and parallelism in natural image sequences. To appear in Network: Computation in Neural Systems, 2002.Google Scholar
- B. Rosenhahn, N. Krüger, T. Rabsch, and G. Sommer. Automatic tracking with a novel pose estimation algorithm. Robot Vision 2001, 2001.Google Scholar
- S. Sarkar and K.L. Boyer. Computing Perceptual Organization in Computer Vision. World Scientific, 1994.Google Scholar
- C. Zetzsche and E. Barth. Fundamental limits of linear filters in the visual processing of two dimensional signals. Vision Research, 30, 1990.Google Scholar