Efficient Edge-Based Methods for Estimating Manhattan Frames in Urban Imagery
We address the problem of efficiently estimating the rotation of a camera relative to the canonical 3D Cartesian frame of an urban scene, under the so-called “Manhattan World” assumption [1,2]. While the problem has received considerable attention in recent years, it is unclear how current methods stack up in terms of accuracy and efficiency, and how they might best be improved. It is often argued that it is best to base estimation on all pixels in the image . However, in this paper, we argue that in a sense, less can be more: that basing estimation on sparse, accurately localized edges, rather than dense gradient maps, permits the derivation of more accurate statistical models and leads to more efficient estimation. We also introduce and compare several different search techniques that have advantages over prior approaches. A cornerstone of the paper is the establishment of a new public groundtruth database which we use to derive required statistics and to evaluate and compare algorithms.
KeywordsGround Truth Camera Parameter Urban Scene Vanishing Point Gauss Sphere
- 4.Schindler, G., Dellaert, F.: Atlanta world: An expectation maximization framework for simultaneous low-level edge grouping and camera calibration in complex man-made environments. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. I–203 – I–209. IEEE, Los Alamitos (2004) Google Scholar
- 5.Kos̆ecká, J., Zhang, W.: Video compass. In: Seventh European Conference on Computer Vision, pp. 476–490 (2002)Google Scholar
- 6.Wildenauer, H., Vincze, M.: Vanishing point detection in complex man-made worlds. In: 14th IEEE International Conference on Image Analysis and Processing, pp. 615–622. IEEE, Los Alamitos (2007)Google Scholar
- 7.Collins, R., Weiss, R.: Vanishing point calculation as a statistical inference on the unit sphere. In: Third International Conference on Computer Vision, pp. 400–403. IEEE, Los Alamitos (1990)Google Scholar