Sensor-fusion for robust identification of persons: a field test

1 January 1995

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 3, 516-519 vol.3
https://doi.org/10.1109/icip.1995.537685

Abstract

We have presented an approach to combine optical lip motion analysis and acoustic voice analysis in order to identify the people speaking. Due to the independence of the different data sources, a higher reliability of the results in comparison with simple optical lip reading was observed. From this proposal, a system prototype has emerged. This improved setup, which is demonstrated, has shown promising results, with a false recognition rate of 0% for both surnames and entry words on a sample of 101 persons. Rejection rates of 8% and 14% respectively have been observed. For performing the recognition, the person to be identified has to speak a single word, which can either be specific of the person (e.g. surname), or be one identical entry word for all persons. Meanwhile, a field test at the entrance of our Institute has been started, which will last for a few months. We demonstrate the first results of this field test. We propose that the combination of motion and voice analysis offers a possibility for realizing robust access control systems.

Keywords

This publication has 3 references indexed in Scilit:

Testing synergetic algorithms with industrial classification problems
Neural Networks, 1994
Multi-sensorial inputs for the identification of persons with synergetic computers
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1994
Automatic lipreading by optical‐flow analysis
Systems and Computers in Japan, 1991