Manhattan World: compass direction from a single image by Bayesian inference
- 1 January 1999
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 941-947 vol.2
- https://doi.org/10.1109/iccv.1999.790349
Abstract
When designing computer vision systems for the blind and visually impaired it is important to determine the orientation of the user relative to the scene. We observe that most indoor and outdoor (city) scenes are designed on a Manhattan three-dimensional grid. This Manhattan grid structure puts strong constraints on the intensity gradients in the image. We demonstrate an algorithm for detecting the orientation of the user in such scenes based on Bayesian inference using statistics which we have learnt in this domain. Our algorithm requires a single input image and does not involve pre-processing stages such as edge detection and Hough grouping. We demonstrate strong experimental results on a range of indoor and outdoor images. We also show that estimating the grid structure makes it significantly easier to detect target objects which are not aligned with the grid.Keywords
This publication has 7 references indexed in Scilit:
- Fundamental bounds on edge detection: an information theoretic evaluation of different edge cuesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Robust computation and parametrization of multiple view relationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Grouping based on projective geometry constraints and uncertaintyPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Elements of Information TheoryPublished by Wiley ,2001
- Aided and automatic target recognition based upon sensory inputs from image forming systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1997
- Contribution to the determination of vanishing points using Hough transformPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1994
- New method for vanishing point detectionCVGIP: Image Understanding, 1991