Model-based interpretation of complex and variable images

Open Access

29 August 1997

journal article
research article
Published by The Royal Society in Philosophical Transactions Of The Royal Society B-Biological Sciences

Vol. 352 (1358) , 1267-1274
https://doi.org/10.1098/rstb.1997.0109

Abstract

The ultimate goal of machine vision is image understanding—the ability not only to recover image structure but also to know what it represents. By definition, this involves the use of models which describe and label the expected structure of the world. Over the past decade, model–based vision has been applied successfully to images of man–made objects. It has proved much more difficult to develop model–based approaches to the interpretation of images of complex and variable structures such as faces or the internal organs of the human body (as visualized in medical images). In such cases it has been problematic even to recover image structure reliably, without a model to organize the often noisy and incomplete image evidence. The key problem is that of variability. To be useful, a model needs to be specific—that is, to be capable of representing only ‘legal’ examples of the modelled object(s). It has proved difficult to achieve this whilst allowing for natural variability. Recent developments have overcome this problem; it has been shown that specific patterns of variability in shape and grey–level appearance can be captured by statistical models that can be used directly in image interpretation. The details of the approach are outlined and practical examples from medical image interpretation and face recognition are used to illustrate how previously intractable problems can now be tackled successfully. It is also interesting to ask whether these results provide any possible insights into natural vision; for example, we show that the apparent changes in shape which result from viewing three–dimensional objects from different viewpoints can be modelled quite well in two dimensions; this may lend some support to the ‘characteristic views’ model of natural vision.

Keywords

This publication has 29 references indexed in Scilit:

Non-linear generalization of point distribution models using polynomial regression
Image and Vision Computing, 1995
Active Shape Models-Their Training and Application
Computer Vision and Image Understanding, 1995
Use of active shape models for locating structures in medical images
Image and Vision Computing, 1994
Medical image interpretation: a generic approach using deformable templates
Medical Informatics, 1994
Boundary finding with parametrically deformable models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
Model-based image interpretation using genetic algorithms
Image and Vision Computing, 1992
Closed-form solutions for physically based shape modeling and recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Application of the Karhunen-Loeve procedure for the characterization of human faces
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1990
Deformable templates for feature extraction from medical images
Published by Springer Nature ,1990
Principal warps: thin-plate splines and the decomposition of deformations
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1989