Model-based interpretation of complex and variable images
Open Access
- 29 August 1997
- journal article
- research article
- Published by The Royal Society in Philosophical Transactions Of The Royal Society B-Biological Sciences
- Vol. 352 (1358) , 1267-1274
- https://doi.org/10.1098/rstb.1997.0109
Abstract
The ultimate goal of machine vision is image understanding—the ability not only to recover image structure but also to know what it represents. By definition, this involves the use of models which describe and label the expected structure of the world. Over the past decade, model–based vision has been applied successfully to images of man–made objects. It has proved much more difficult to develop model–based approaches to the interpretation of images of complex and variable structures such as faces or the internal organs of the human body (as visualized in medical images). In such cases it has been problematic even to recover image structure reliably, without a model to organize the often noisy and incomplete image evidence. The key problem is that of variability. To be useful, a model needs to be specific—that is, to be capable of representing only ‘legal’ examples of the modelled object(s). It has proved difficult to achieve this whilst allowing for natural variability. Recent developments have overcome this problem; it has been shown that specific patterns of variability in shape and grey–level appearance can be captured by statistical models that can be used directly in image interpretation. The details of the approach are outlined and practical examples from medical image interpretation and face recognition are used to illustrate how previously intractable problems can now be tackled successfully. It is also interesting to ask whether these results provide any possible insights into natural vision; for example, we show that the apparent changes in shape which result from viewing three–dimensional objects from different viewpoints can be modelled quite well in two dimensions; this may lend some support to the ‘characteristic views’ model of natural vision.Keywords
This publication has 29 references indexed in Scilit:
- Non-linear generalization of point distribution models using polynomial regressionImage and Vision Computing, 1995
- Active Shape Models-Their Training and ApplicationComputer Vision and Image Understanding, 1995
- Use of active shape models for locating structures in medical imagesImage and Vision Computing, 1994
- Medical image interpretation: a generic approach using deformable templatesMedical Informatics, 1994
- Boundary finding with parametrically deformable modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Model-based image interpretation using genetic algorithmsImage and Vision Computing, 1992
- Closed-form solutions for physically based shape modeling and recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Application of the Karhunen-Loeve procedure for the characterization of human facesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990
- Deformable templates for feature extraction from medical imagesPublished by Springer Nature ,1990
- Principal warps: thin-plate splines and the decomposition of deformationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1989