Abstract
A new algorithm for registering a 3D model to a single or a sequence of 2D images without a-priori knowledge of correspondences is presented. The algorithm is composed of two iterated steps: the first step determines correspondences between image features and model data, while the second one computes the 3D rigid body transformation that minimizes the displacement of the matched points. As camera images do not yield full 3D information, the key idea proposed is to exploit the inverse perspective, i.e. the projection ray determined by the focal point of the camera and an image feature point. The inverse perspective yields the constraints for both correspondence search and pose estimation in 3D space rather than in the image plane. This not only simplifies the motion parameter estimation but also reduces matching ambiguities due to occlusions. In addition, a robust M-estimation technique reduces the impact of false correspondences. Experimental results show that complex, 3D CAD-models can be registered efficiently and accurately to images, even if image features are incomplete, fragmented and noisy.

This publication has 9 references indexed in Scilit: