Manhattan World: compass direction from a single image by Bayesian inference

1 January 1999

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2, 941-947 vol.2
https://doi.org/10.1109/iccv.1999.790349

Abstract

When designing computer vision systems for the blind and visually impaired it is important to determine the orientation of the user relative to the scene. We observe that most indoor and outdoor (city) scenes are designed on a Manhattan three-dimensional grid. This Manhattan grid structure puts strong constraints on the intensity gradients in the image. We demonstrate an algorithm for detecting the orientation of the user in such scenes based on Bayesian inference using statistics which we have learnt in this domain. Our algorithm requires a single input image and does not involve pre-processing stages such as edge detection and Hough grouping. We demonstrate strong experimental results on a range of indoor and outdoor images. We also show that estimating the grid structure makes it significantly easier to detect target objects which are not aligned with the grid.

Keywords

This publication has 7 references indexed in Scilit:

Fundamental bounds on edge detection: an information theoretic evaluation of different edge cues
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Robust computation and parametrization of multiple view relations
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Grouping based on projective geometry constraints and uncertainty
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Elements of Information Theory
Published by Wiley ,2001
Aided and automatic target recognition based upon sensory inputs from image forming systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1997
Contribution to the determination of vanishing points using Hough transform
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1994
New method for vanishing point detection
CVGIP: Image Understanding, 1991